SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Towards integration of systems biology and biomedical
                             ontologies

                                             Robert Hoehndorf

                                             Department of Genetics
                                             University of Cambridge


                                               29 March 2011




Robert Hoehndorf (University of Cambridge)        Harvesting SBML      29 March 2011   1 / 28
Introduction   Motivation


  Motivation




Robert Hoehndorf (University of Cambridge)        Harvesting SBML        29 March 2011   2 / 28
Introduction   Motivation


  Motivation




Robert Hoehndorf (University of Cambridge)        Harvesting SBML        29 March 2011   3 / 28
Introduction   Ontology


  Applied ontology


           ontology (philosophy) studies the nature of existence and categories
           of being
           an ontology (computer science) is the “explicit specification of a
           conceptualization of a domain” [Gruber, 1993]
           ontologies specify the meaning of terms in a vocabulary
           formalized ontologies can be used by computers and automated
           systems
   Applied ontology is the branch of knowledge representation that focuses
   on the content.




Robert Hoehndorf (University of Cambridge)        Harvesting SBML      29 March 2011   4 / 28
Introduction     Ontology


  Open Biomedical Ontologies (OBO)

                                                     Individual




                                  Physical object     Quality      Function     Process




                                        ChEBI Ontology                                    Molecule
                                                                                          Gene
                                                    Sequence Ontology
                                                                                          Transcript
                                     GO-CC                                                Organelle
                                   Celltype                          Gene Ontology        Cell
                                                    Phenotype                             Tissue
                                                     Ontology                             Organ
                                   Anatomy
                                   Ontology
                                                                                          Body
                                                                                          Population



Robert Hoehndorf (University of Cambridge)               Harvesting SBML                               29 March 2011   5 / 28
Introduction   Ontology


  Systems biology


           Systems biology...is about putting together rather than taking
           apart, integration rather than reduction. [Denis Noble]

           multi-scale data integration
                  domains and levels of granularity
                  species
                  kinds of data
           integration of in silico, in vitro and in vivo research
                  focus on emergent properties
           simulation of biological systems
                  predict and simulate systems’ behavior




Robert Hoehndorf (University of Cambridge)        Harvesting SBML      29 March 2011   6 / 28
Introduction   Ontology


  Systems biology
  Challenges (Kitano, 2002)




           data integration
           validation
           standard languages
                  specification
                  exchange
                  results




Robert Hoehndorf (University of Cambridge)        Harvesting SBML      29 March 2011   7 / 28
Introduction   Ontology


  Systems biology
  Challenges (Kitano, 2002)




           data integration
           validation
           standard languages
                  specification
                  exchange
                  results
   Can we use ontologies to address these problems?




Robert Hoehndorf (University of Cambridge)        Harvesting SBML      29 March 2011   7 / 28
Harvesting SBML


  MIRIAM annotations
  Annotation of SBML




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   8 / 28
Harvesting SBML


  MIRIAM annotations
  Annotation of SBML




           MIRIAM provides annotation of SBML entities
           ontologies are treated as meta-data
                  search
                  semantic similarity
                  documentation
           no integration with modelling language




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   9 / 28
Harvesting SBML


  MIRIAM annotations
  Information flow hypothesis




          Integration of SBML and ontologies could lead to information flow
                           between models and ontologies.

   Information flow enables the use of ontologies for
           verification,
           access to data,
           integration and combination of models.




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   10 / 28
Harvesting SBML


  MIRIAM annotations




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   11 / 28
Harvesting SBML


  Ontological commitment
  Rule 1: models



   Model M annotated with A1:
           M represents an object O1
           O1 can have functions
           O1 ’s functions can be realized by processes
           model components represent parts of O1

           M SubClassOf:                 represents some A1
           M SubClassOf:                 represents some (has-function some A1)
           M SubClassOf: represents some (has-function some
           (realized-by only A1)



Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   12 / 28
Harvesting SBML


  Ontological commitment
  BioModel 82




   annotated with heterotrimeric G-protein complex cycle (GO:0031684):
           represents an object O1
           O1 has a function F1
           F1 is realized by processes of the type heterotrimeric G-protein
           complex cycle

           M SubClassOf:                 represents some O1
           O1 SubClassOf:                    (has-function some (realized-by only
           GO:0031684)




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   13 / 28
Harvesting SBML


  Ontological commitment
  Rule 2: Compartments




   Compartment C annotated with A2:
           represents an object O2
           part of the O1
           compartment’s species represent objects that are located in O2

           C SubClassOf:                 represents some A2
           A2 SubClassOf:                    located-in some A1




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   14 / 28
Harvesting SBML


  Ontological commitment
  Compartment “Cell” in BioModel 82



   annotated with Cell (GO:0005623):
           represents an object O2
           O2 is a kind of Cell
           O2 is part-of O1

           C SubClassOf:                 represents some O2
           O2 SubClassOf:                    Cell and part-of some O1




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   15 / 28
Harvesting SBML


  Ontological commitment
  Compartment “Cell” in BioModel 82



   annotated with Cell (GO:0005623):
           represents an object O2
           O2 is a kind of Cell
           O2 is part-of O1

           C SubClassOf:                 represents some O2
           O2 SubClassOf:                    Cell and part-of some O1
           O2 SubClassOf: Cell and part-of some (has-function
           some (realized-by only GO:0031684))




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   15 / 28
Harvesting SBML


  Ontological commitment
  Rule 3: Species




           represents an object O3
           O3 can have functions
           O3 ’s functions can be realized by processes
           O3 can have qualities (concentration, amount, charge,...)
           located in O2




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   16 / 28
Harvesting SBML


  Ontological commitment
  Species GTP in “Cell” in BioModel 82



   annotated with GTP (CHEBI:15996):
           represents an object O3
           O3 is a kind of GTP
           O3 is located-in O2

           S SubClassOf:                 represents some O3
           O3 SubClassOf:                    GTP and located-in some O2
           O3 SubClassOf: GTP and located-in some (Cell and
           part-of some (has-function some (realized-by only
           GO:0031684)))




Robert Hoehndorf (University of Cambridge)            Harvesting SBML     29 March 2011   17 / 28
Harvesting SBML


  Ontological commitment
  Reaction




           represents an object O3 with a function F
           F is realized by P
           P has participants (inputs, outputs and modifiers) O4
           O4 are objects represented by species
           P occurs in O1




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   18 / 28
Harvesting SBML


  Ontological commitment
  Reaction GTP-binding in BioModel 82


   annotated with GTP binding (GO:0005525):
           represents an object O4
           O4 has a function F4
           F4 is a kind of GTP binding
           F4 is realized by P4
           P4 has-input O3 (GTP)

           R SubClassOf:                 represents some (has-function some F4)
           F4 SubClassOf:                    GTP binding and realized-by only P
           P SubClassOf:                 has-input some O3



Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   19 / 28
Harvesting SBML


  Ontological commitment
  Reaction GTP-binding in BioModel 82


                                                                                                        World of BIOMD0000000082
         BIOMD0000000082 - Thomsen1988 AdenylateCyclase Inhibition
                                                                                                                                          has-function (realized-by)
                                                                                                                                          heterotrimeric G-protein complex cycle
                                                                                                       Cell in
                              Compartment "cell"                                    represents         World of BIOMD0000000082
                                                                                                                                                  has-part Cell
                                                                                                                     part-of
                DRG               GDP              GTP                                                                                                has-part GTP binding in world of
                                                                                                                     World of BIOMD0000000082
                                                                                                                                                      BIOMD0000000082
                                                                                                                           has-part GTP
                                                                             represents          GTP
                                                                                                       part-of Cell in
                                                                                                       World of
                                   Reactions                                                           BIOMD0000000082

          Reaction: GTP binding with DRG                                   represents



                                                                                                                            GTP binding in world of
                                                                                                                            BIOMD0000000082
                                                                            represents*                        has-input


                                  Parameter




Robert Hoehndorf (University of Cambridge)                                     Harvesting SBML                                                   29 March 2011                     20 / 28
Harvesting SBML


  Ontological commitment
  BioModels Result

   Ontologies:
           FMA
           ChEBI
           GO
           Celltype
           PATO
           (KEGG, Reactome)
   Result on BioModels:
           more than 300,000 classes
           more than 800,000 axioms
           90,000 complex model annotations
   http://sbmlharvester.googlecode.com

Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   21 / 28
Harvesting SBML


  Inconsistency
  Compartments/species annotated with functions or processes




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   22 / 28
Harvesting SBML


  Inconsistency
  Biological inconsistency: Biomodel 176




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   23 / 28
Harvesting SBML


  Inconsistency
  Biological inconsistency: Biomodel 176




   [Term]
   id: GO:0016887
   name: ATPase activity
   is a: GO:0017111 ! nucleoside-triphosphatase activity
   intersection of: GO:0003824 ! catalytic activity
   intersection of: has input CHEBI:15377 ! water
   intersection of: has input CHEBI:15422 ! ATP
   intersection of: has output CHEBI:16761 ! ADP
   intersection of: has output CHEBI:26020 ! phosphates




Robert Hoehndorf (University of Cambridge)            Harvesting SBML   29 March 2011   24 / 28
Harvesting SBML


  Knowledge retrieval

            Query                                Query string                                   # results

            Contradictory defined entities        Nothing                                        4,899

            Models which represent a pro-        model-of some (has-part some (has-function     54
            cess involving sugar                 some (realized-by only (has-participant some
                                                 sugar))))

            Parts of BIOMD0000000015 that        part-of some BIOMD0000000015 and represents    29
            represent processes involving        some (has-function some (realized-by only
            sugar                                (has-participant some sugar)))

            Model entities that represent the    represents some (has-part some (has-function   14
            cell cycle                           some (realized-by only ’cell cycle’)))

            Model entities that represent        represents some (has-part some (’has role’     2
            mutagenic central nervous sys-       some ’central nervous system drug’ and
            tem drugs in the gastrointestinal    ’has role’ some mutagen and part-of some
            systems                              ’Gastrointestinal system’)

            Model entities that represent        represents some (has-function some             4
            catalytic activity involving sugar   (realized-by only (realizes some ’catalytic
            in the endocrine pancreas            activity’ and has-participant some (sugar
                                                 and contained-in some (part-of some
                                                 ’Endocrine pancreas’)))))




Robert Hoehndorf (University of Cambridge)                 Harvesting SBML                      29 March 2011   25 / 28
Conclusions


  Future research
  Towards integration of systems biology and biomedical ontology




           extension to other modelling frameworks (CellML, FieldML, ...)
           application to other resources
                  YeastNet
           knowledge discovery
                  ontology of functions (of chemicals)
                  model comparison
                  model composition




Robert Hoehndorf (University of Cambridge)       Harvesting SBML   29 March 2011   26 / 28
Conclusions


  Acknowledgements



           George Gkoutos
           Michel Dumontier
           Dan Cook
           Bernard de Bono
           John Gennari
           Pierre Grenon
           Sarala Wimalaratne




Robert Hoehndorf (University of Cambridge)       Harvesting SBML   29 March 2011   27 / 28
Conclusions


  Thank you!




   Biomodels, YeastNet in OWL:
   http://sbmlharvester.googlecode.com
   Modularization:
   http://el-vira.googlecode.com




Robert Hoehndorf (University of Cambridge)       Harvesting SBML   29 March 2011   28 / 28

Contenu connexe

Similaire à Towards integration of systems biology and biomedical ontologies

ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
dolleyj
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and Disease
Barry Smith
 
Biomedical ontology tutorial_atlanta_june2011_part2
Biomedical ontology tutorial_atlanta_june2011_part2Biomedical ontology tutorial_atlanta_june2011_part2
Biomedical ontology tutorial_atlanta_june2011_part2
Barry Smith
 
Formal Ontology Meets Industry: Best Practices
Formal Ontology Meets Industry: Best PracticesFormal Ontology Meets Industry: Best Practices
Formal Ontology Meets Industry: Best Practices
David Koepsell
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
BOSC 2010
 

Similaire à Towards integration of systems biology and biomedical ontologies (20)

Using multiple ontologies to characterise the bioactivity of small molecules
Using multiple ontologies to characterise the bioactivity of small moleculesUsing multiple ontologies to characterise the bioactivity of small molecules
Using multiple ontologies to characterise the bioactivity of small molecules
 
Applying Ontology Design Patterns in bio-ontologies
Applying Ontology Design Patterns in bio-ontologiesApplying Ontology Design Patterns in bio-ontologies
Applying Ontology Design Patterns in bio-ontologies
 
OBO Foundry
OBO FoundryOBO Foundry
OBO Foundry
 
Accurate biochemical knowledge starting with precise structure-based criteria...
Accurate biochemical knowledge starting with precise structure-based criteria...Accurate biochemical knowledge starting with precise structure-based criteria...
Accurate biochemical knowledge starting with precise structure-based criteria...
 
University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Hyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologistHyperontology for the biomedical ontologist
Hyperontology for the biomedical ontologist
 
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
 
Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and Disease
 
Modularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBIModularity requirements in bio-ontologies: a case study of ChEBI
Modularity requirements in bio-ontologies: a case study of ChEBI
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challenges
 
Towards a Rapid Model Prototyping Strategy for Systems & Synthetic Biology
Towards a Rapid Model Prototyping  Strategy for Systems & Synthetic BiologyTowards a Rapid Model Prototyping  Strategy for Systems & Synthetic Biology
Towards a Rapid Model Prototyping Strategy for Systems & Synthetic Biology
 
Biomedical ontology tutorial_atlanta_june2011_part2
Biomedical ontology tutorial_atlanta_june2011_part2Biomedical ontology tutorial_atlanta_june2011_part2
Biomedical ontology tutorial_atlanta_june2011_part2
 
Formal Ontology Meets Industry: Best Practices
Formal Ontology Meets Industry: Best PracticesFormal Ontology Meets Industry: Best Practices
Formal Ontology Meets Industry: Best Practices
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
 
From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...From chemicals to minds: Integrated ontologies in the search for scientific u...
From chemicals to minds: Integrated ontologies in the search for scientific u...
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
Realism for Scientific Ontologies
Realism for Scientific OntologiesRealism for Scientific Ontologies
Realism for Scientific Ontologies
 
Synthetic Biology - Modeling and Optimisation
Synthetic Biology -  Modeling and OptimisationSynthetic Biology -  Modeling and Optimisation
Synthetic Biology - Modeling and Optimisation
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Towards integration of systems biology and biomedical ontologies

  • 1. Towards integration of systems biology and biomedical ontologies Robert Hoehndorf Department of Genetics University of Cambridge 29 March 2011 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 1 / 28
  • 2. Introduction Motivation Motivation Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 2 / 28
  • 3. Introduction Motivation Motivation Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 3 / 28
  • 4. Introduction Ontology Applied ontology ontology (philosophy) studies the nature of existence and categories of being an ontology (computer science) is the “explicit specification of a conceptualization of a domain” [Gruber, 1993] ontologies specify the meaning of terms in a vocabulary formalized ontologies can be used by computers and automated systems Applied ontology is the branch of knowledge representation that focuses on the content. Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 4 / 28
  • 5. Introduction Ontology Open Biomedical Ontologies (OBO) Individual Physical object Quality Function Process ChEBI Ontology Molecule Gene Sequence Ontology Transcript GO-CC Organelle Celltype Gene Ontology Cell Phenotype Tissue Ontology Organ Anatomy Ontology Body Population Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 5 / 28
  • 6. Introduction Ontology Systems biology Systems biology...is about putting together rather than taking apart, integration rather than reduction. [Denis Noble] multi-scale data integration domains and levels of granularity species kinds of data integration of in silico, in vitro and in vivo research focus on emergent properties simulation of biological systems predict and simulate systems’ behavior Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 6 / 28
  • 7. Introduction Ontology Systems biology Challenges (Kitano, 2002) data integration validation standard languages specification exchange results Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 7 / 28
  • 8. Introduction Ontology Systems biology Challenges (Kitano, 2002) data integration validation standard languages specification exchange results Can we use ontologies to address these problems? Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 7 / 28
  • 9. Harvesting SBML MIRIAM annotations Annotation of SBML Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 8 / 28
  • 10. Harvesting SBML MIRIAM annotations Annotation of SBML MIRIAM provides annotation of SBML entities ontologies are treated as meta-data search semantic similarity documentation no integration with modelling language Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 9 / 28
  • 11. Harvesting SBML MIRIAM annotations Information flow hypothesis Integration of SBML and ontologies could lead to information flow between models and ontologies. Information flow enables the use of ontologies for verification, access to data, integration and combination of models. Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 10 / 28
  • 12. Harvesting SBML MIRIAM annotations Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 11 / 28
  • 13. Harvesting SBML Ontological commitment Rule 1: models Model M annotated with A1: M represents an object O1 O1 can have functions O1 ’s functions can be realized by processes model components represent parts of O1 M SubClassOf: represents some A1 M SubClassOf: represents some (has-function some A1) M SubClassOf: represents some (has-function some (realized-by only A1) Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 12 / 28
  • 14. Harvesting SBML Ontological commitment BioModel 82 annotated with heterotrimeric G-protein complex cycle (GO:0031684): represents an object O1 O1 has a function F1 F1 is realized by processes of the type heterotrimeric G-protein complex cycle M SubClassOf: represents some O1 O1 SubClassOf: (has-function some (realized-by only GO:0031684) Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 13 / 28
  • 15. Harvesting SBML Ontological commitment Rule 2: Compartments Compartment C annotated with A2: represents an object O2 part of the O1 compartment’s species represent objects that are located in O2 C SubClassOf: represents some A2 A2 SubClassOf: located-in some A1 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 14 / 28
  • 16. Harvesting SBML Ontological commitment Compartment “Cell” in BioModel 82 annotated with Cell (GO:0005623): represents an object O2 O2 is a kind of Cell O2 is part-of O1 C SubClassOf: represents some O2 O2 SubClassOf: Cell and part-of some O1 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 15 / 28
  • 17. Harvesting SBML Ontological commitment Compartment “Cell” in BioModel 82 annotated with Cell (GO:0005623): represents an object O2 O2 is a kind of Cell O2 is part-of O1 C SubClassOf: represents some O2 O2 SubClassOf: Cell and part-of some O1 O2 SubClassOf: Cell and part-of some (has-function some (realized-by only GO:0031684)) Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 15 / 28
  • 18. Harvesting SBML Ontological commitment Rule 3: Species represents an object O3 O3 can have functions O3 ’s functions can be realized by processes O3 can have qualities (concentration, amount, charge,...) located in O2 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 16 / 28
  • 19. Harvesting SBML Ontological commitment Species GTP in “Cell” in BioModel 82 annotated with GTP (CHEBI:15996): represents an object O3 O3 is a kind of GTP O3 is located-in O2 S SubClassOf: represents some O3 O3 SubClassOf: GTP and located-in some O2 O3 SubClassOf: GTP and located-in some (Cell and part-of some (has-function some (realized-by only GO:0031684))) Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 17 / 28
  • 20. Harvesting SBML Ontological commitment Reaction represents an object O3 with a function F F is realized by P P has participants (inputs, outputs and modifiers) O4 O4 are objects represented by species P occurs in O1 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 18 / 28
  • 21. Harvesting SBML Ontological commitment Reaction GTP-binding in BioModel 82 annotated with GTP binding (GO:0005525): represents an object O4 O4 has a function F4 F4 is a kind of GTP binding F4 is realized by P4 P4 has-input O3 (GTP) R SubClassOf: represents some (has-function some F4) F4 SubClassOf: GTP binding and realized-by only P P SubClassOf: has-input some O3 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 19 / 28
  • 22. Harvesting SBML Ontological commitment Reaction GTP-binding in BioModel 82 World of BIOMD0000000082 BIOMD0000000082 - Thomsen1988 AdenylateCyclase Inhibition has-function (realized-by) heterotrimeric G-protein complex cycle Cell in Compartment "cell" represents World of BIOMD0000000082 has-part Cell part-of DRG GDP GTP has-part GTP binding in world of World of BIOMD0000000082 BIOMD0000000082 has-part GTP represents GTP part-of Cell in World of Reactions BIOMD0000000082 Reaction: GTP binding with DRG represents GTP binding in world of BIOMD0000000082 represents* has-input Parameter Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 20 / 28
  • 23. Harvesting SBML Ontological commitment BioModels Result Ontologies: FMA ChEBI GO Celltype PATO (KEGG, Reactome) Result on BioModels: more than 300,000 classes more than 800,000 axioms 90,000 complex model annotations http://sbmlharvester.googlecode.com Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 21 / 28
  • 24. Harvesting SBML Inconsistency Compartments/species annotated with functions or processes Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 22 / 28
  • 25. Harvesting SBML Inconsistency Biological inconsistency: Biomodel 176 Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 23 / 28
  • 26. Harvesting SBML Inconsistency Biological inconsistency: Biomodel 176 [Term] id: GO:0016887 name: ATPase activity is a: GO:0017111 ! nucleoside-triphosphatase activity intersection of: GO:0003824 ! catalytic activity intersection of: has input CHEBI:15377 ! water intersection of: has input CHEBI:15422 ! ATP intersection of: has output CHEBI:16761 ! ADP intersection of: has output CHEBI:26020 ! phosphates Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 24 / 28
  • 27. Harvesting SBML Knowledge retrieval Query Query string # results Contradictory defined entities Nothing 4,899 Models which represent a pro- model-of some (has-part some (has-function 54 cess involving sugar some (realized-by only (has-participant some sugar)))) Parts of BIOMD0000000015 that part-of some BIOMD0000000015 and represents 29 represent processes involving some (has-function some (realized-by only sugar (has-participant some sugar))) Model entities that represent the represents some (has-part some (has-function 14 cell cycle some (realized-by only ’cell cycle’))) Model entities that represent represents some (has-part some (’has role’ 2 mutagenic central nervous sys- some ’central nervous system drug’ and tem drugs in the gastrointestinal ’has role’ some mutagen and part-of some systems ’Gastrointestinal system’) Model entities that represent represents some (has-function some 4 catalytic activity involving sugar (realized-by only (realizes some ’catalytic in the endocrine pancreas activity’ and has-participant some (sugar and contained-in some (part-of some ’Endocrine pancreas’))))) Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 25 / 28
  • 28. Conclusions Future research Towards integration of systems biology and biomedical ontology extension to other modelling frameworks (CellML, FieldML, ...) application to other resources YeastNet knowledge discovery ontology of functions (of chemicals) model comparison model composition Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 26 / 28
  • 29. Conclusions Acknowledgements George Gkoutos Michel Dumontier Dan Cook Bernard de Bono John Gennari Pierre Grenon Sarala Wimalaratne Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 27 / 28
  • 30. Conclusions Thank you! Biomodels, YeastNet in OWL: http://sbmlharvester.googlecode.com Modularization: http://el-vira.googlecode.com Robert Hoehndorf (University of Cambridge) Harvesting SBML 29 March 2011 28 / 28