SlideShare une entreprise Scribd logo
1  sur  22
Sucheta Tripathy
7th October 2012
   Introduction.
   History of Genome Sequencing.
   Rationale behind genome sequencing.
   How genomes are sequenced.
   What happens next.
    ◦ Assembly and Annotation.
    ◦ Sequence Submissions.
   Microbial Genome Sequencing.
   Human Genome Project.
    ◦ Encode Project.
    ◦ 1000 genomes project.
   Write a paragraph (less than 1000 characters)
    on “why you think more genomes need to be
    sequenced OR not sequenced”.
   tsucheta@iicb.res.in/tsucheta@gmail.com
   Literature search databases.
   NA and protein databases.
   Animal and plant databases
   Ensembl Genome project
   TIGR Database.
   Biotechnological databases
   Database for species identification and
    classification
   Structural databases
   Database retrieval and deposition schemes
   What are databases?
   Components.
   Types of Databases.
   Applications and Limitations.
   Journals Publishing databases.
   Database management Systems
    ◦   Mysql
    ◦   Oracle
    ◦   Postgress
    ◦   Sqlserver
    ◦   MS Access ….
   A DBMS in the backend.
    ◦ SQL scripting
    ◦ PL/SQLs
    ◦ Other scripting interfaces(C/C++/API)
   A front end UI.
    ◦ PHP
    ◦ Perl/CGI
    ◦ VB
   Files are not enough
   Searching.
   Sorting.
   Combining data types.
   Organizing.
   Managing.
   Sequence data in genbank.
   HTML files.
   Excel files.
   Regular list.
   Indexes.
   Flat files.
   Biological databases
    ◦ MetaBase ( A database of Biological databases)
    ◦ http://metadatabase.org/
   Bibliographic databases
   Chemical databases
   Numerous other databases.
   Sequence databases.
    ◦ Nucleotide
    ◦ Protein
   Structure Databases.
   Genome databases.
   Transcriptome databases
   Model organism databases.
    ◦ PlasmoDB, TAIR, FlyBase etc.
   http://asia.ensembl.org/Help/Movie?id=210
Genbank         DDBJ


          EBI
   Gbrowse
   UCSC Genome Browser
   Vista Browser
   Ensembl browser
   Integrated Genome Browser
   PUBMED
    ◦ 22.1 million records
    ◦ eTBLAST
   CABI
   SCOPUS
   Google Scholar
   Organized information.
   Maintained and upgraded.
   Visualization tools.
   So many database to look for
   Not many are updated
   Lack of proper documentation
   Database
   Nucleic Acids Research
   BMC Genomics
   Bioinformatics
   Nature
   Cell
   Plant Cell
   Pick any database of your choice and state
    why you like it. (1000 characters)

Contenu connexe

Tendances

Protein databases
Protein databasesProtein databases
Protein databases
sarumalay
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
Nikhil Aggarwal
 

Tendances (20)

Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Scop database
Scop databaseScop database
Scop database
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
BLAST
BLASTBLAST
BLAST
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Distance based method
Distance based method Distance based method
Distance based method
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl database
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
Tree building
Tree buildingTree building
Tree building
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 

En vedette

databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
nadeem akhter
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303
Bruno Mmassy
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical data
Abhik Seal
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
nadeem akhter
 
molecular file formats in bioinformatics
molecular file formats in bioinformaticsmolecular file formats in bioinformatics
molecular file formats in bioinformatics
nadeem akhter
 

En vedette (18)

Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
NCBI
NCBINCBI
NCBI
 
BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1
 
BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2
 
GenomeBrowser
GenomeBrowserGenomeBrowser
GenomeBrowser
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical data
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Design your own test automation tool
Design your own test automation toolDesign your own test automation tool
Design your own test automation tool
 
molecular file formats in bioinformatics
molecular file formats in bioinformaticsmolecular file formats in bioinformatics
molecular file formats in bioinformatics
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Intro to Open Babel
Intro to Open BabelIntro to Open Babel
Intro to Open Babel
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Similaire à Biological databases

Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
BioinformaticsCentre
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
AdiM27
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
Denis Shestakov
 

Similaire à Biological databases (20)

Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Data Base in Bioinformatics.ppt
Data Base in Bioinformatics.pptData Base in Bioinformatics.ppt
Data Base in Bioinformatics.ppt
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
 
2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 

Plus de Sucheta Tripathy

Plus de Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 

Dernier

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 

Dernier (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 

Biological databases

  • 2. Introduction.  History of Genome Sequencing.  Rationale behind genome sequencing.  How genomes are sequenced.  What happens next. ◦ Assembly and Annotation. ◦ Sequence Submissions.  Microbial Genome Sequencing.  Human Genome Project. ◦ Encode Project. ◦ 1000 genomes project.
  • 3. Write a paragraph (less than 1000 characters) on “why you think more genomes need to be sequenced OR not sequenced”.  tsucheta@iicb.res.in/tsucheta@gmail.com
  • 4. Literature search databases.  NA and protein databases.  Animal and plant databases  Ensembl Genome project  TIGR Database.  Biotechnological databases  Database for species identification and classification  Structural databases  Database retrieval and deposition schemes
  • 5. What are databases?  Components.  Types of Databases.  Applications and Limitations.  Journals Publishing databases.
  • 6. Database management Systems ◦ Mysql ◦ Oracle ◦ Postgress ◦ Sqlserver ◦ MS Access ….
  • 7. A DBMS in the backend. ◦ SQL scripting ◦ PL/SQLs ◦ Other scripting interfaces(C/C++/API)  A front end UI. ◦ PHP ◦ Perl/CGI ◦ VB
  • 8. Files are not enough  Searching.  Sorting.  Combining data types.  Organizing.  Managing.
  • 9. Sequence data in genbank.  HTML files.  Excel files.  Regular list.  Indexes.  Flat files.
  • 10. Biological databases ◦ MetaBase ( A database of Biological databases) ◦ http://metadatabase.org/  Bibliographic databases  Chemical databases  Numerous other databases.
  • 11. Sequence databases. ◦ Nucleotide ◦ Protein  Structure Databases.  Genome databases.  Transcriptome databases  Model organism databases. ◦ PlasmoDB, TAIR, FlyBase etc.
  • 12.
  • 13.
  • 14. http://asia.ensembl.org/Help/Movie?id=210
  • 15.
  • 16. Genbank DDBJ EBI
  • 17. Gbrowse  UCSC Genome Browser  Vista Browser  Ensembl browser  Integrated Genome Browser
  • 18. PUBMED ◦ 22.1 million records ◦ eTBLAST  CABI  SCOPUS  Google Scholar
  • 19. Organized information.  Maintained and upgraded.  Visualization tools.
  • 20. So many database to look for  Not many are updated  Lack of proper documentation
  • 21. Database  Nucleic Acids Research  BMC Genomics  Bioinformatics  Nature  Cell  Plant Cell
  • 22. Pick any database of your choice and state why you like it. (1000 characters)