SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
.   Biological Databses




                Presented By:
                Bhakti Thakkar
                M.L.I.Sc 2012-13
*A database is a collection of
information that is organized
so that it can easily be
accessed, managed, and
updated.


            *
*Harrod’s   Librarien’s Glossary
 defines, “Database as any grouping
 of data for a particular purpose or
 for the use of a particular set of end
 users, usually organized via fields
 and providing tools to enable
 manipulation of the data such as
 sorting, grouping, extraction and
 reporting”.

                       *
Types of Databases




                                       Source database
Reference database
 contains references or
                                           contains
        citations                    Informative text, raw data




          *
Biological
        Databses

   Biogrid database
   Rfam database
   Mir base database
   Modbase database
* The Biological General Repository for
  Interaction Datasets (BioGRID) is a curated
  biological database of protein-protein and
  genetic interactions.
* It wascreated in 2003 by Mike Tyers, Bobby-Joe
  Breitkreutz, and Chris Stark at the Samuel
  Lunenfeld Research Institute at Mount Sinai
  Hospital.
* It strives to provide a comprehensive resource
  of protein–protein and genetic interactions for
  all major model organism species



              *
*BiOGRID searches can be performed
 by clicking on the “gene” tab from
 the main search page.
* To perform search, one can simply
 enter term and search engine will
 search for matching identifiers.
*Advanced search option can be also
 used for the better results.
       *
Search options at BioGRID
                            Wildcard
                            Searches
                            Publication
                             Searches




                    *
* The BioGRID supports wildcard searching (ie.
 searching where the search is not exact but
 rather a range of possible matches) on the TAIL
 end of any keyword entered in our search field.
* To perform a wildcard search, simply enter
 your prefix (must be 3 letters or more)
 followed by a star (*).
* Examples: STE*, CDC*, YAL01*, CLN*


            *
*Publication searches help to
search using publications like
journals, magazines, aritcals
etc.




    *
*Rfam is an open access
 database, hosted at the
 Wellcome Trust Sanger
 Institute, containing information
 about RNA(Ribonucleic acid)
 families.
*Rfam contains information for
 RNA families and annotations for
 millions of RNA genes.
              *
Rfam
             Database
              Search


                      Batch
      Sequence
                    sequence



Keyword     Taxonomy      Entry Type




            *
* To Find Rfam families within your sequence of
 interest use Sequence search
* To find multiple sequences to be searched for
 matching Rfam families use Batch sequence
* To search for keywords within the textual data in the
 Rfam database use Keyword search
* For a complex expression that precisely defines the
 species distribution of families use Taxonomy search
* for Search for Rfam families by type like Gene, anti-
 toxin etc. use Entry Type search




                                 *
* miRBase is a biological database that acts as an
 archive of microRNA sequences and
 annotations.
* The miRBase registry provides a centralised
 system for assigning new names to microRNA
 genes.
* The miRBase Sequence database is the primary
 repository for published microRNA (miRNA)
 sequence and annotation data.

                              *             Database
*
miRBase has five aims:

 To provide a consistent naming system for microRNAs


 To provide a central place collecting all known microRNA
  sequences


 To provide human and computer readable information for
  each microRNA


 To provide primary evidence for each microRNA


 To aggregate and link to microRNA target information
MiRBase database search features


 By miRNA identifier or keyword

 By genomic location


 For clusters

 By tissue expression

 By sequence
miRBase provides the following services:
 The miRBase database is a searchable database of published
  miRNA sequences and annotation

 Both hairpin and mature sequences are available for searching
  and browsing, and entries can also be retrieved by
  name, keyword, references and annotation. All sequence and
  annotation data are also available for download

 Each entry in the miRBase Sequence database represents a predicted
  hairpin portion of a miRNA transcript (termed mir in the
  database), with information on the location and sequence of the
  mature miRNA sequence (termed miR). Both hairpin and mature
  sequences are available for searching and browsing, and entries can
  also be retrieved by name, keyword, references and annotation.
 The miRBase Registry provides miRNA gene hunters with unique
  names for novel miRNA genes prior to publication of results.
Modbase detabase
   MODBASE is a query able database of
   annotated protein structure models.



   MODBASE contains theoretically calculated
   models, which may contain significant
   errors, not experimentally determined
   structures..
Modbase detabase
ModBase Search : ModBase is a database of comparative protein structure models,
calculated by our modeling pipeline ModPipe.

ModBase Search

 Search type model (default)

 Sequence similarity (blast)

 Display type:-

 model detail, model overview, sequence overview.

 Search by properties.

 Database accession number

 Annotation keywords

 Gene name
Advance Search option
 Internal id
 Template pdb code
 Template or homolog pdb code
Organism:-
 Homoslapiens
 Mas musculus and all
In advance search               :-
   Select property options:-
   Model size, E-value (0-100)
   Model score (0-100)
   Protein size
   % seq. lent (0-100)
   Minimum and maximum
   Search-(and, or)
   Apply to-all models of sequence, some models of a sequence
   Hide advanced properties.
Modbase provides the following services:

 MODBASE uses the MySQL relational database management system for flexible
  querying and CHIMERA for viewing the sequences and structures
  (http://www.cgl.ucsf.edu/chimera/).



 MODBASE is updated regularly to reflect the growth in protein sequence and
  structure databases, as well as improvements in the software for calculating
  the models.


 The largest data set contains 1,26,629 models for domains in 659,495 out of
  1,182,126 unique protein sequences in the complete Swiss-Prot/TrEMBL
  database (August 25, 2003); only models based on alignments with significant
  similarity scores and models assessed to have the correct fold despite
  insignificant alignments are included.
 The 53 new structures produced by the consortium allowed us to
  characterize structurally 24,113 sequences.



 Our other resources associated with MODBASE include a comprehensive
  database of multiple protein structure alignments
biological detabase

Contenu connexe

Tendances (20)

BLAST
BLASTBLAST
BLAST
 
UniProt
UniProtUniProt
UniProt
 
Gene network and pathways
Gene network and pathwaysGene network and pathways
Gene network and pathways
 
Protein database
Protein  databaseProtein  database
Protein database
 
UPGMA
UPGMAUPGMA
UPGMA
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Structural databases
Structural databases Structural databases
Structural databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
Fasta
FastaFasta
Fasta
 
MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Blast Algorithm
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Scop database
Scop databaseScop database
Scop database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 

En vedette

The Way to Success Timesharing
The Way to Success TimesharingThe Way to Success Timesharing
The Way to Success Timesharingolga2953
 
Christthe Redeemerin Rio
Christthe Redeemerin RioChristthe Redeemerin Rio
Christthe Redeemerin RioNasia
 
Nachos multiprogramming
Nachos multiprogrammingNachos multiprogramming
Nachos multiprogrammingJim Chang
 
Parallel Processing
Parallel ProcessingParallel Processing
Parallel ProcessingRTigger
 

En vedette (9)

O Imaginário
O ImaginárioO Imaginário
O Imaginário
 
The Way to Success Timesharing
The Way to Success TimesharingThe Way to Success Timesharing
The Way to Success Timesharing
 
Christthe Redeemerin Rio
Christthe Redeemerin RioChristthe Redeemerin Rio
Christthe Redeemerin Rio
 
La paz ahora
La paz ahoraLa paz ahora
La paz ahora
 
Enade alberto-mayol
Enade alberto-mayolEnade alberto-mayol
Enade alberto-mayol
 
Luchar por la definición de la crisis
Luchar por la definición de la crisis Luchar por la definición de la crisis
Luchar por la definición de la crisis
 
Nachos multiprogramming
Nachos multiprogrammingNachos multiprogramming
Nachos multiprogramming
 
Parallel processing extra
Parallel processing extraParallel processing extra
Parallel processing extra
 
Parallel Processing
Parallel ProcessingParallel Processing
Parallel Processing
 

Similaire à biological detabase

Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 
Microarrays Databases.pptx
Microarrays Databases.pptxMicroarrays Databases.pptx
Microarrays Databases.pptxMuzzamilahmed14
 
Data retreival system
Data retreival systemData retreival system
Data retreival systemShikha Thakur
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDBbhrfonseca
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDBbhrfonseca
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDBbhrfonseca
 
Rap db(rice annotation project data base)
Rap db(rice annotation project data base)Rap db(rice annotation project data base)
Rap db(rice annotation project data base)PrajaktaKale17
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data miningSangeeta Das
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...Peter McQuilton
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdfSrimathideviJ
 

Similaire à biological detabase (20)

Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Microarrays Databases.pptx
Microarrays Databases.pptxMicroarrays Databases.pptx
Microarrays Databases.pptx
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDB
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDB
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
How to Use mirtronDB
How to Use mirtronDBHow to Use mirtronDB
How to Use mirtronDB
 
Rap db(rice annotation project data base)
Rap db(rice annotation project data base)Rap db(rice annotation project data base)
Rap db(rice annotation project data base)
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Chemspider For Mass Spectrometrists Public Version
Chemspider For Mass Spectrometrists Public VersionChemspider For Mass Spectrometrists Public Version
Chemspider For Mass Spectrometrists Public Version
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
 
Harvester I
Harvester IHarvester I
Harvester I
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 

biological detabase

  • 1. . Biological Databses Presented By: Bhakti Thakkar M.L.I.Sc 2012-13
  • 2. *A database is a collection of information that is organized so that it can easily be accessed, managed, and updated. *
  • 3. *Harrod’s Librarien’s Glossary defines, “Database as any grouping of data for a particular purpose or for the use of a particular set of end users, usually organized via fields and providing tools to enable manipulation of the data such as sorting, grouping, extraction and reporting”. *
  • 4. Types of Databases Source database Reference database contains references or contains citations Informative text, raw data *
  • 5. Biological Databses  Biogrid database  Rfam database  Mir base database  Modbase database
  • 6. * The Biological General Repository for Interaction Datasets (BioGRID) is a curated biological database of protein-protein and genetic interactions. * It wascreated in 2003 by Mike Tyers, Bobby-Joe Breitkreutz, and Chris Stark at the Samuel Lunenfeld Research Institute at Mount Sinai Hospital. * It strives to provide a comprehensive resource of protein–protein and genetic interactions for all major model organism species *
  • 7.
  • 8. *BiOGRID searches can be performed by clicking on the “gene” tab from the main search page. * To perform search, one can simply enter term and search engine will search for matching identifiers. *Advanced search option can be also used for the better results. *
  • 9. Search options at BioGRID Wildcard Searches Publication Searches *
  • 10. * The BioGRID supports wildcard searching (ie. searching where the search is not exact but rather a range of possible matches) on the TAIL end of any keyword entered in our search field. * To perform a wildcard search, simply enter your prefix (must be 3 letters or more) followed by a star (*). * Examples: STE*, CDC*, YAL01*, CLN* *
  • 11. *Publication searches help to search using publications like journals, magazines, aritcals etc. *
  • 12. *Rfam is an open access database, hosted at the Wellcome Trust Sanger Institute, containing information about RNA(Ribonucleic acid) families. *Rfam contains information for RNA families and annotations for millions of RNA genes. *
  • 13.
  • 14. Rfam Database Search Batch Sequence sequence Keyword Taxonomy Entry Type *
  • 15. * To Find Rfam families within your sequence of interest use Sequence search * To find multiple sequences to be searched for matching Rfam families use Batch sequence * To search for keywords within the textual data in the Rfam database use Keyword search * For a complex expression that precisely defines the species distribution of families use Taxonomy search * for Search for Rfam families by type like Gene, anti- toxin etc. use Entry Type search *
  • 16.
  • 17.
  • 18. * miRBase is a biological database that acts as an archive of microRNA sequences and annotations. * The miRBase registry provides a centralised system for assigning new names to microRNA genes. * The miRBase Sequence database is the primary repository for published microRNA (miRNA) sequence and annotation data. * Database
  • 19. *
  • 20. miRBase has five aims:  To provide a consistent naming system for microRNAs  To provide a central place collecting all known microRNA sequences  To provide human and computer readable information for each microRNA  To provide primary evidence for each microRNA  To aggregate and link to microRNA target information
  • 21. MiRBase database search features  By miRNA identifier or keyword  By genomic location  For clusters  By tissue expression  By sequence
  • 22. miRBase provides the following services:  The miRBase database is a searchable database of published miRNA sequences and annotation  Both hairpin and mature sequences are available for searching and browsing, and entries can also be retrieved by name, keyword, references and annotation. All sequence and annotation data are also available for download  Each entry in the miRBase Sequence database represents a predicted hairpin portion of a miRNA transcript (termed mir in the database), with information on the location and sequence of the mature miRNA sequence (termed miR). Both hairpin and mature sequences are available for searching and browsing, and entries can also be retrieved by name, keyword, references and annotation.  The miRBase Registry provides miRNA gene hunters with unique names for novel miRNA genes prior to publication of results.
  • 23. Modbase detabase MODBASE is a query able database of annotated protein structure models. MODBASE contains theoretically calculated models, which may contain significant errors, not experimentally determined structures..
  • 24.
  • 25. Modbase detabase ModBase Search : ModBase is a database of comparative protein structure models, calculated by our modeling pipeline ModPipe. ModBase Search  Search type model (default)  Sequence similarity (blast)  Display type:-  model detail, model overview, sequence overview.  Search by properties.  Database accession number  Annotation keywords  Gene name
  • 26. Advance Search option  Internal id  Template pdb code  Template or homolog pdb code Organism:-  Homoslapiens  Mas musculus and all In advance search :-  Select property options:-  Model size, E-value (0-100)  Model score (0-100)  Protein size  % seq. lent (0-100)  Minimum and maximum  Search-(and, or)  Apply to-all models of sequence, some models of a sequence  Hide advanced properties.
  • 27. Modbase provides the following services:  MODBASE uses the MySQL relational database management system for flexible querying and CHIMERA for viewing the sequences and structures (http://www.cgl.ucsf.edu/chimera/).  MODBASE is updated regularly to reflect the growth in protein sequence and structure databases, as well as improvements in the software for calculating the models.  The largest data set contains 1,26,629 models for domains in 659,495 out of 1,182,126 unique protein sequences in the complete Swiss-Prot/TrEMBL database (August 25, 2003); only models based on alignments with significant similarity scores and models assessed to have the correct fold despite insignificant alignments are included.
  • 28.  The 53 new structures produced by the consortium allowed us to characterize structurally 24,113 sequences.  Our other resources associated with MODBASE include a comprehensive database of multiple protein structure alignments