SlideShare une entreprise Scribd logo
1  sur  14
Genome
Database
Systems
H.C Korala
114072l
Overview
• Introduction
• Background
• Major research areas
• Applications and Impact on society
• Discussion
Introduction
“Bioinformatics is an interdisciplinary field which is a
combination of many areas such as computer science,
statistics, mathematics and engineering to process
biological data ”
• What is BIOINFORMATICS ?
• GENOME
“The genome encodes all the information of the
Deoxyribonucleic Acid (DNA), the
macromolecule containing the genetic heritage
of each living being.”
• Increased electronically stored biological data
• Role of databases in the field of biology
• The study of genes and proteins has become an extremely
important area in the modern day biology and they are
better known as genomics and proteomics
• Genome databases store this information and differently
from gene databases the genome databases contain both
coding and non-coding intergenic sequences.
Background
Genome Databases
• Saccharomyces genome database
• Mouse genome database
• Human genome database
• European mutant mouse pathology database
• Mito Map
• Kyoto Encyclopedia of Genes and Genomes
Genome Databases
• High complex data
• Schema changes at a rapid pace
• Range of variability in data is high
• Complex queries
Characteristics of genome database
systems
• Non standard and unstructured data
• Complex query processing
• Data interpretation and meta data management
• Data integration across related databases
• Uniform data management solutions
Key Areas in Data Management in
Genome Database Systems
• Graph Based Genome Database Systems
• Generic genome browser
• Networked database environment for human
genome data
• The ENSEMBL genome database project
• MITOMAP
• The BIOGRID Interaction Database
• Data management for high throughput genomics
Major researches
• For research purposes
• Health Industry
• Study how variation in human genetics leads to
variation in response to drugs.
• Disease studies for large populations
• Drug Designing
Applications and Impact
on Society
• Genome database systems is one of the major research
areas in the world
• Health care, pharmaceutical organizations spend tons of
money to do researches to maintain genome databases
and implement effective and efficient tools to analyze
data.
• The main concerns in genome database systems are the
variability of the data types that they are associated with,
high throughput of genomic data, Meta data management,
data storage problem, complex query and complex
calculations are needed and data integration with different
databases
DISCUSSION
• Mainly the issues have occurred because of the
complexity of the data types in genome databases and
therefore the traditional relational DBMS concepts and
Object DBMS concepts are not the most suitable
concepts to tackle these issues.
• Researches have been carried out to discover novel
methods of representing data types in genome databases
• Some of the researches are targeting to identifying ways
which can be used to couple with the existing database
management systems so that only an extension is needed
for the usage.
• The people who use genome database systems are
scientists, academics and other people who are in the field
of genomics.
• Most of these people do not have a good knowledge in
Information Technology.
• Therefore improving the synergy between these two
fields is an important task,
Thank you !

Contenu connexe

Tendances

Tendances (20)

Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Cath
CathCath
Cath
 
OMIM Database
OMIM DatabaseOMIM Database
OMIM Database
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
 
Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Fasta
FastaFasta
Fasta
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Prosite
PrositeProsite
Prosite
 
Protein database
Protein databaseProtein database
Protein database
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Dynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentDynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignment
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Est database
Est databaseEst database
Est database
 

En vedette

Protein databases
Protein databasesProtein databases
Protein databases
sarumalay
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
bhargvi sharma
 

En vedette (20)

Protein databases
Protein databasesProtein databases
Protein databases
 
High throughput sequencing
High throughput sequencingHigh throughput sequencing
High throughput sequencing
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformatics
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
PowerMV
PowerMV PowerMV
PowerMV
 
Lyme disease
Lyme diseaseLyme disease
Lyme disease
 
Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its Scope
 
Errors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation SequencingErrors and Limitaions of Next Generation Sequencing
Errors and Limitaions of Next Generation Sequencing
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modules
 
Addressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambiaAddressing the shortage of medical doctors in zambia
Addressing the shortage of medical doctors in zambia
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programming
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
MASCOT
MASCOTMASCOT
MASCOT
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 
Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScape
 

Similaire à Genome Database Systems

6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril
med20su
 
Database technologies in bioinformatics
Database technologies in bioinformaticsDatabase technologies in bioinformatics
Database technologies in bioinformatics
Gleb Sklyr
 

Similaire à Genome Database Systems (20)

Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
The human genome project
The human genome projectThe human genome project
The human genome project
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptx
 
Computer science history.pdf
Computer science history.pdfComputer science history.pdf
Computer science history.pdf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptx
 
Basic of bioinformatics
Basic of bioinformaticsBasic of bioinformatics
Basic of bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics the tool of biotechnology
Bioinformatics the tool of biotechnologyBioinformatics the tool of biotechnology
Bioinformatics the tool of biotechnology
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
Genomics and Bioinformatics
Genomics and BioinformaticsGenomics and Bioinformatics
Genomics and Bioinformatics
 
6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
Database technologies in bioinformatics
Database technologies in bioinformaticsDatabase technologies in bioinformatics
Database technologies in bioinformatics
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
 

Dernier

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Genome Database Systems

  • 2. Overview • Introduction • Background • Major research areas • Applications and Impact on society • Discussion
  • 3. Introduction “Bioinformatics is an interdisciplinary field which is a combination of many areas such as computer science, statistics, mathematics and engineering to process biological data ” • What is BIOINFORMATICS ?
  • 4. • GENOME “The genome encodes all the information of the Deoxyribonucleic Acid (DNA), the macromolecule containing the genetic heritage of each living being.” • Increased electronically stored biological data • Role of databases in the field of biology
  • 5. • The study of genes and proteins has become an extremely important area in the modern day biology and they are better known as genomics and proteomics • Genome databases store this information and differently from gene databases the genome databases contain both coding and non-coding intergenic sequences. Background Genome Databases
  • 6. • Saccharomyces genome database • Mouse genome database • Human genome database • European mutant mouse pathology database • Mito Map • Kyoto Encyclopedia of Genes and Genomes Genome Databases
  • 7. • High complex data • Schema changes at a rapid pace • Range of variability in data is high • Complex queries Characteristics of genome database systems
  • 8. • Non standard and unstructured data • Complex query processing • Data interpretation and meta data management • Data integration across related databases • Uniform data management solutions Key Areas in Data Management in Genome Database Systems
  • 9. • Graph Based Genome Database Systems • Generic genome browser • Networked database environment for human genome data • The ENSEMBL genome database project • MITOMAP • The BIOGRID Interaction Database • Data management for high throughput genomics Major researches
  • 10. • For research purposes • Health Industry • Study how variation in human genetics leads to variation in response to drugs. • Disease studies for large populations • Drug Designing Applications and Impact on Society
  • 11. • Genome database systems is one of the major research areas in the world • Health care, pharmaceutical organizations spend tons of money to do researches to maintain genome databases and implement effective and efficient tools to analyze data. • The main concerns in genome database systems are the variability of the data types that they are associated with, high throughput of genomic data, Meta data management, data storage problem, complex query and complex calculations are needed and data integration with different databases DISCUSSION
  • 12. • Mainly the issues have occurred because of the complexity of the data types in genome databases and therefore the traditional relational DBMS concepts and Object DBMS concepts are not the most suitable concepts to tackle these issues. • Researches have been carried out to discover novel methods of representing data types in genome databases • Some of the researches are targeting to identifying ways which can be used to couple with the existing database management systems so that only an extension is needed for the usage.
  • 13. • The people who use genome database systems are scientists, academics and other people who are in the field of genomics. • Most of these people do not have a good knowledge in Information Technology. • Therefore improving the synergy between these two fields is an important task,