SlideShare une entreprise Scribd logo
1  sur  14
Biological Databases
1
Submitted
For the M.Sc. (BIOTECHNOLOGY) III-SEMESTER EXAM.DEC.2018 (REGULAR)
Subject
SEMINAR, SCIENTIFIC WRITING AND PRESENTATION
Submitted by
Shradheya R.R. Gupta
M.Sc. Biotechnology
Roll. Number
1832128
1. Content
Biological Databases
Types
A. On the basis of type:-
1. Sequence Databases
2. Structure Databases
3. Functional Databases
Conclusion
B. On the basis of order:-
1. Primary Databases
2. Secondary Databases
3. Composite Databases
2
1. Biological Databases
 Biological databases are store house of life science information.
 Information is collected from scientific experiments, published literature, high-
throughput experiment technology, and computational analysis.
3
A. On the basis of type
1. Sequence database:-
 Composed of a large collection of
nucleic acid and protein sequences.
 BLAST program is the most common
searching tool for sequence
similarity.
 Many annotations of the sequences
are based on the results of sequence
similarity searches of previously-
annotated sequences. 4
2. Structure database:-
 Main aim is to organize and annotate the protein structures.
Example:-
1. PDB
2. Databases of Macromolecules Movements
3. Functional database:-
 Physiological role of gene products - enzyme activities, mutant phenotypes,
biological pathways etc.
Examples:-
1. KEGG PATHWAY Database
2. BRENDA
3. Reactome
4. HMDB
5
B. On the basis of order
1. Primary database:-
 A primary database contains information obtained experimentally.
 Experimental results are submitted directly into the database by researchers,
and the data are essentially archival in nature.
6
A. Nucleotide Primary database:-
 Three chief databases that store and
make available raw nucleic acid
sequences.
1. GenBank:-
Located in the U.S.A.
2. DDBJ:-
Located in Japan
3. EMBL:-
Located in U.K.
 They have uniform data formats (but
not identical) and exchange data on
daily basis. 7
B. Protein Primary database:-
 PIR-PSD is a comprehensive, non- redundant and annotated data.
Classification of protein sequences based on the super family concept.
 SWISS -PROT it provides a high level of annotation.
 Both PIR-PSD and SWISS-PROT have software that enables the user to easily
search through the database to obtain only the required information.
 TrEMB it contains the translation of all coding sequences present in the EMBL
nucleotide database.
8
2. Secondary database:-
 Comprises data derived from the results of primary data.
 Secondary databases have become the molecular biologist’s reference library
over the past decade.
9
A. Nucleotide Secondary database:-
 UniGene automatically partitioning GenBank sequences into a non-redundant
set of gene-oriented clusters.
 Ensembl provide a centralized resource for geneticists, molecular biologists
and other researchers studying the genomes.
 Microbial Resource contains all the focus on one organism.
 ACeDB originally developed for the C. Elegans ( a nematode worm) genome
project. It is a repository of sequence, genetic map and phenotypic information
about the C. Elegans.
 FlyBase genome of the fruit fly D. Melanogaster to a high degree of
completeness and quality.
10
B. Protein Secondary database:-
 InterPro is a database of protein families, domains and functional sites in
which identifiable features found in known proteins can be applied to new
protein.
 UniProt database of protein sequence and functional information.
 GPCRGB database is focused on a single family protein, GPCRGB. These are
transmembrane protein used by cells to communicate with the outside world.
 CluSTr (Cluster of SWISS-PROT and TrEMBL) database offers an automatic
classification of the entries in the SWISS-PROT and TrEMBL databases into
groups of related proteins.
 COGS or Cluster of Orthologous Groups of protein database. 11
3. Composite database:-
 It is an amalgamation of different primary database sources,
which omits the need to search multiple resources.
 NCBI hosts these features to various persons involved in
research.
Examples:-
1. OMIM
Catalog of human genes, genetic disorders and related literature.
2. GENE
Molecular data and literature related to genes with extensive links to
other databases. 12
13
Conclusion
 The present challenge is to:-
1. Handle huge volume of data.
2. To improve database design.
3. Develop software for database access and manipulation.
 There is no doubt of involvement of bioinformatics in biological
sciences and betterment of human lives.
14
Thank You

Contenu connexe

Tendances (20)

Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Ddbj
DdbjDdbj
Ddbj
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
EMBL
EMBLEMBL
EMBL
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Cath
CathCath
Cath
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Structural databases
Structural databases Structural databases
Structural databases
 
Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Databases
DatabasesDatabases
Databases
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
Protein database
Protein databaseProtein database
Protein database
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Kegg
KeggKegg
Kegg
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 

Similaire à Biological databases

Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdfPravanjanDash
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxPravanjanDash
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 

Similaire à Biological databases (20)

Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Biological database
Biological databaseBiological database
Biological database
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Biological databases
Biological databases Biological databases
Biological databases
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 

Plus de SHRADHEYA GUPTA

Training report on Industrial Biotechnology
Training report on Industrial BiotechnologyTraining report on Industrial Biotechnology
Training report on Industrial BiotechnologySHRADHEYA GUPTA
 
Dessertation - Early treatment of Bloodstream Infections
Dessertation - Early treatment of Bloodstream InfectionsDessertation - Early treatment of Bloodstream Infections
Dessertation - Early treatment of Bloodstream InfectionsSHRADHEYA GUPTA
 
Plant Secondary Metabolities
Plant Secondary MetabolitiesPlant Secondary Metabolities
Plant Secondary MetabolitiesSHRADHEYA GUPTA
 

Plus de SHRADHEYA GUPTA (6)

Protein Predictinon
Protein PredictinonProtein Predictinon
Protein Predictinon
 
Clinical Microbiology
Clinical MicrobiologyClinical Microbiology
Clinical Microbiology
 
Training report on Industrial Biotechnology
Training report on Industrial BiotechnologyTraining report on Industrial Biotechnology
Training report on Industrial Biotechnology
 
Dessertation - Early treatment of Bloodstream Infections
Dessertation - Early treatment of Bloodstream InfectionsDessertation - Early treatment of Bloodstream Infections
Dessertation - Early treatment of Bloodstream Infections
 
Plant Secondary Metabolities
Plant Secondary MetabolitiesPlant Secondary Metabolities
Plant Secondary Metabolities
 
Pollution
PollutionPollution
Pollution
 

Dernier

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 

Dernier (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 

Biological databases

  • 1. Biological Databases 1 Submitted For the M.Sc. (BIOTECHNOLOGY) III-SEMESTER EXAM.DEC.2018 (REGULAR) Subject SEMINAR, SCIENTIFIC WRITING AND PRESENTATION Submitted by Shradheya R.R. Gupta M.Sc. Biotechnology Roll. Number 1832128
  • 2. 1. Content Biological Databases Types A. On the basis of type:- 1. Sequence Databases 2. Structure Databases 3. Functional Databases Conclusion B. On the basis of order:- 1. Primary Databases 2. Secondary Databases 3. Composite Databases 2
  • 3. 1. Biological Databases  Biological databases are store house of life science information.  Information is collected from scientific experiments, published literature, high- throughput experiment technology, and computational analysis. 3
  • 4. A. On the basis of type 1. Sequence database:-  Composed of a large collection of nucleic acid and protein sequences.  BLAST program is the most common searching tool for sequence similarity.  Many annotations of the sequences are based on the results of sequence similarity searches of previously- annotated sequences. 4
  • 5. 2. Structure database:-  Main aim is to organize and annotate the protein structures. Example:- 1. PDB 2. Databases of Macromolecules Movements 3. Functional database:-  Physiological role of gene products - enzyme activities, mutant phenotypes, biological pathways etc. Examples:- 1. KEGG PATHWAY Database 2. BRENDA 3. Reactome 4. HMDB 5
  • 6. B. On the basis of order 1. Primary database:-  A primary database contains information obtained experimentally.  Experimental results are submitted directly into the database by researchers, and the data are essentially archival in nature. 6
  • 7. A. Nucleotide Primary database:-  Three chief databases that store and make available raw nucleic acid sequences. 1. GenBank:- Located in the U.S.A. 2. DDBJ:- Located in Japan 3. EMBL:- Located in U.K.  They have uniform data formats (but not identical) and exchange data on daily basis. 7
  • 8. B. Protein Primary database:-  PIR-PSD is a comprehensive, non- redundant and annotated data. Classification of protein sequences based on the super family concept.  SWISS -PROT it provides a high level of annotation.  Both PIR-PSD and SWISS-PROT have software that enables the user to easily search through the database to obtain only the required information.  TrEMB it contains the translation of all coding sequences present in the EMBL nucleotide database. 8
  • 9. 2. Secondary database:-  Comprises data derived from the results of primary data.  Secondary databases have become the molecular biologist’s reference library over the past decade. 9
  • 10. A. Nucleotide Secondary database:-  UniGene automatically partitioning GenBank sequences into a non-redundant set of gene-oriented clusters.  Ensembl provide a centralized resource for geneticists, molecular biologists and other researchers studying the genomes.  Microbial Resource contains all the focus on one organism.  ACeDB originally developed for the C. Elegans ( a nematode worm) genome project. It is a repository of sequence, genetic map and phenotypic information about the C. Elegans.  FlyBase genome of the fruit fly D. Melanogaster to a high degree of completeness and quality. 10
  • 11. B. Protein Secondary database:-  InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to new protein.  UniProt database of protein sequence and functional information.  GPCRGB database is focused on a single family protein, GPCRGB. These are transmembrane protein used by cells to communicate with the outside world.  CluSTr (Cluster of SWISS-PROT and TrEMBL) database offers an automatic classification of the entries in the SWISS-PROT and TrEMBL databases into groups of related proteins.  COGS or Cluster of Orthologous Groups of protein database. 11
  • 12. 3. Composite database:-  It is an amalgamation of different primary database sources, which omits the need to search multiple resources.  NCBI hosts these features to various persons involved in research. Examples:- 1. OMIM Catalog of human genes, genetic disorders and related literature. 2. GENE Molecular data and literature related to genes with extensive links to other databases. 12
  • 13. 13 Conclusion  The present challenge is to:- 1. Handle huge volume of data. 2. To improve database design. 3. Develop software for database access and manipulation.  There is no doubt of involvement of bioinformatics in biological sciences and betterment of human lives.