SlideShare a Scribd company logo
1 of 11
Integration of GO, Pathway data and Interaction data Chris Mungall Peter D’Eustachio
The GO was originally intended to integrate databases How are we doing? Interoperability of genomic databases is limited by this lack of progress, and it is this major obstacle that the Gene Ontology (GO) Consortium was formed to address Gene Ontology: Tool for theUnificationofBiology. Nat Genet 2000 SGD FB GOA
GO The GO was originally intended to integrate databases How are we doing? Not as well as we could! GO SGD FB GOA Pathway Commons IMEX Reactome Cyc … BioGRID Intact …
Integration enhances analyses and reduces workload Division of labor leave specialized curation to specialized systems biology databases but data needs to be re-combined to prevent siloing GO is an invaluable single-stop shop for term enrichment etc Can we quantify how integrating with systems biology databases helps users? Yes! We can do the experiment: GO term enrichment analysis on all MolSigDB withReactome annotations Also include Reactome inputs/outputs, not currently in GOA withoutReactomeannotations
Integration enhances analyses  GOA+R: Many p-values will significantly improved Recapitulated biologically valid results that would have been suppressed had one single resource been used Examples: Genes down-regulated in Alzheimers
How are we currently integrating systems biology datasets? Interaction data Currently Intact, soon IMEX “protein binding” and “self-protein binding” only (+with) Pathway data Currently Reactomeonly Loses much of what is in Reactome E,g,inputs and outputs  Manually curated GO<->Reactome links incomplete not always to the most specific term labor-intensive become stale over time other pathway databases? This can be improved!
Automating integration using cross-product definitions – pathway databases [Term] id: GO:0015871 name: choline transport intersection_of: GO:0006810 ! transport intersection_of:results_in_transport_ofCHEBI:15354 ! choline
Automating integration using cross-products – pathway databases We can also automatically map: catalysis terms [165*] transport [373] binding [133] phosphorylation and other modifications metabolism [278] signaling … All this relies on different cross-product files Any pathway database that exports BioPax-OWL can be used E.ghumancyc, mousecyc, pathwaycommons, … *Numbers for Reactome-human
Automating integration using cross-products – interaction databases FIGF VEGFR binds has_function is_a [Term] id: GO:0043184 name: vascular endothelial growth factor receptor 2 binding intersection_of: GO:0005488           ! binding intersection_of:results_in_binding_ofPRO:000002112! VEGFR 2
Automated Integration: Results Reactome Evaluation in progress Many manually assigned equivalencies recapitulated Inferred equivalencies differed in some cases sometimes better than manually assigned sometimes required info not in biopax export ongoing discussions BioGrid not evaluated (all trivial) inferred annotations improve some enrichment results E.g. Brentani angiogenesis gene sets, increased enrichment for VEGFR binding Obvious but useful as proof of concept
Conclusions and future work We can be more efficient: Coordinate with systems bio databases to divide labor Prevent siloing through semi-automated integration GO acts as a high-level ‘window’ on systems biology databases Still to be done: Make integration tool production-ready Reconcile existing mis-alignments, particularly signaling highly inconsistent between GO and Reactome Explore open questions – e.g. auto-generate terms? Finish cross-products, they are vital particular PRO, CHEBI

More Related Content

Viewers also liked (13)

Short Introduction of software engineering for bioinformatics
Short Introduction of software engineering for bioinformatics Short Introduction of software engineering for bioinformatics
Short Introduction of software engineering for bioinformatics
 
20160530 journal club_jqo
20160530 journal club_jqo20160530 journal club_jqo
20160530 journal club_jqo
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
Functional And Pathway Analysis 2010
Functional And Pathway Analysis 2010Functional And Pathway Analysis 2010
Functional And Pathway Analysis 2010
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
B.Sc. Biochem II Biomolecule I U 3.1 Structure of Proteins
B.Sc. Biochem II Biomolecule I U 3.1 Structure of ProteinsB.Sc. Biochem II Biomolecule I U 3.1 Structure of Proteins
B.Sc. Biochem II Biomolecule I U 3.1 Structure of Proteins
 
David
DavidDavid
David
 
Protein structure classification
Protein structure classificationProtein structure classification
Protein structure classification
 
Biological databases
Biological databasesBiological databases
Biological databases
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Protein classification
Protein classificationProtein classification
Protein classification
 
Classification and properties of protein
Classification and properties of proteinClassification and properties of protein
Classification and properties of protein
 

Similar to Go pathway-interaction-integration

SageCite demonstrator overview
SageCite demonstrator overviewSageCite demonstrator overview
SageCite demonstrator overviewmonicaduke
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataPhilip Cheung
 
NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw Alexander Pico
 
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDBMetabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDBDinesh Barupal
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesMelanie Courtot
 
Pathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsPathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsBenjamin Good
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET Journal
 
Ecocyc database
Ecocyc databaseEcocyc database
Ecocyc databaseShiv Kumar
 
Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Paolo Romano
 
ChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsNeil Swainston
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...Neil Swainston
 
Haider Embrace Bosc2008
Haider Embrace Bosc2008Haider Embrace Bosc2008
Haider Embrace Bosc2008bosc_2008
 
Ieee projects 2012 2013 - Bio Informatics
Ieee projects 2012 2013 - Bio InformaticsIeee projects 2012 2013 - Bio Informatics
Ieee projects 2012 2013 - Bio InformaticsK Sundaresh Ka
 
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsBenjamin Good
 
Bio-protocol4446.GO_KEGG_RICE.pdf
Bio-protocol4446.GO_KEGG_RICE.pdfBio-protocol4446.GO_KEGG_RICE.pdf
Bio-protocol4446.GO_KEGG_RICE.pdfssuserb500f8
 

Similar to Go pathway-interaction-integration (20)

SageCite demonstrator overview
SageCite demonstrator overviewSageCite demonstrator overview
SageCite demonstrator overview
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw NetBioSIG2013-Talk Robin Haw
NetBioSIG2013-Talk Robin Haw
 
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDBMetabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
Metabolic pathway mapping against KEGG, Reactome, HMDB and CPDB
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
Pathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsPathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMs
 
bioinformatic.pptx
bioinformatic.pptxbioinformatic.pptx
bioinformatic.pptx
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
 
Ecocyc database
Ecocyc databaseEcocyc database
Ecocyc database
 
Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?
 
ChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructions
 
GiTools
GiToolsGiTools
GiTools
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
 
Haider Embrace Bosc2008
Haider Embrace Bosc2008Haider Embrace Bosc2008
Haider Embrace Bosc2008
 
Ieee projects 2012 2013 - Bio Informatics
Ieee projects 2012 2013 - Bio InformaticsIeee projects 2012 2013 - Bio Informatics
Ieee projects 2012 2013 - Bio Informatics
 
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity Models
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
FYP report
FYP reportFYP report
FYP report
 
Bio-protocol4446.GO_KEGG_RICE.pdf
Bio-protocol4446.GO_KEGG_RICE.pdfBio-protocol4446.GO_KEGG_RICE.pdf
Bio-protocol4446.GO_KEGG_RICE.pdf
 
Bio4j
Bio4jBio4j
Bio4j
 

More from Chris Mungall

MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxChris Mungall
 
Scaling up semantics; lessons learned across the life sciences
Scaling up semantics; lessons learned across the life sciencesScaling up semantics; lessons learned across the life sciences
Scaling up semantics; lessons learned across the life sciencesChris Mungall
 
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOLinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOChris Mungall
 
Ontology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptxOntology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptxChris Mungall
 
LinkML Intro (for Monarch devs)
LinkML Intro (for Monarch devs)LinkML Intro (for Monarch devs)
LinkML Intro (for Monarch devs)Chris Mungall
 
LinkML presentation to Yosemite Group
LinkML presentation to Yosemite GroupLinkML presentation to Yosemite Group
LinkML presentation to Yosemite GroupChris Mungall
 
Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Chris Mungall
 
All together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeAll together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeChris Mungall
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeChris Mungall
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in UberonChris Mungall
 
SparqlProg (BioHackathon 2019)
SparqlProg (BioHackathon 2019)SparqlProg (BioHackathon 2019)
SparqlProg (BioHackathon 2019)Chris Mungall
 
Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Chris Mungall
 
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...US2TS: Reasoning over multiple open bio-ontologies to make machines and human...
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...Chris Mungall
 
Uberon: opening up to community contributions
Uberon: opening up to community contributionsUberon: opening up to community contributions
Uberon: opening up to community contributionsChris Mungall
 
Modeling exposure events and adverse outcome pathways using ontologies
Modeling exposure events and adverse outcome pathways using ontologiesModeling exposure events and adverse outcome pathways using ontologies
Modeling exposure events and adverse outcome pathways using ontologiesChris Mungall
 
Causal reasoning using the Relation Ontology
Causal reasoning using the Relation OntologyCausal reasoning using the Relation Ontology
Causal reasoning using the Relation OntologyChris Mungall
 
US2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyChris Mungall
 
Introduction to the BioLink datamodel
Introduction to the BioLink datamodelIntroduction to the BioLink datamodel
Introduction to the BioLink datamodelChris Mungall
 
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Chris Mungall
 

More from Chris Mungall (20)

MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptx
 
Scaling up semantics; lessons learned across the life sciences
Scaling up semantics; lessons learned across the life sciencesScaling up semantics; lessons learned across the life sciences
Scaling up semantics; lessons learned across the life sciences
 
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOLinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
 
Ontology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptxOntology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptx
 
LinkML Intro (for Monarch devs)
LinkML Intro (for Monarch devs)LinkML Intro (for Monarch devs)
LinkML Intro (for Monarch devs)
 
LinkML presentation to Yosemite Group
LinkML presentation to Yosemite GroupLinkML presentation to Yosemite Group
LinkML presentation to Yosemite Group
 
Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...
 
All together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeAll together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of life
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in Uberon
 
SparqlProg (BioHackathon 2019)
SparqlProg (BioHackathon 2019)SparqlProg (BioHackathon 2019)
SparqlProg (BioHackathon 2019)
 
Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019
 
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...US2TS: Reasoning over multiple open bio-ontologies to make machines and human...
US2TS: Reasoning over multiple open bio-ontologies to make machines and human...
 
Uberon: opening up to community contributions
Uberon: opening up to community contributionsUberon: opening up to community contributions
Uberon: opening up to community contributions
 
Modeling exposure events and adverse outcome pathways using ontologies
Modeling exposure events and adverse outcome pathways using ontologiesModeling exposure events and adverse outcome pathways using ontologies
Modeling exposure events and adverse outcome pathways using ontologies
 
Causal reasoning using the Relation Ontology
Causal reasoning using the Relation OntologyCausal reasoning using the Relation Ontology
Causal reasoning using the Relation Ontology
 
US2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene Ontology
 
Introduction to the BioLink datamodel
Introduction to the BioLink datamodelIntroduction to the BioLink datamodel
Introduction to the BioLink datamodel
 
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
 
ENVO GSC 2015
ENVO GSC 2015ENVO GSC 2015
ENVO GSC 2015
 

Go pathway-interaction-integration

  • 1. Integration of GO, Pathway data and Interaction data Chris Mungall Peter D’Eustachio
  • 2. The GO was originally intended to integrate databases How are we doing? Interoperability of genomic databases is limited by this lack of progress, and it is this major obstacle that the Gene Ontology (GO) Consortium was formed to address Gene Ontology: Tool for theUnificationofBiology. Nat Genet 2000 SGD FB GOA
  • 3. GO The GO was originally intended to integrate databases How are we doing? Not as well as we could! GO SGD FB GOA Pathway Commons IMEX Reactome Cyc … BioGRID Intact …
  • 4. Integration enhances analyses and reduces workload Division of labor leave specialized curation to specialized systems biology databases but data needs to be re-combined to prevent siloing GO is an invaluable single-stop shop for term enrichment etc Can we quantify how integrating with systems biology databases helps users? Yes! We can do the experiment: GO term enrichment analysis on all MolSigDB withReactome annotations Also include Reactome inputs/outputs, not currently in GOA withoutReactomeannotations
  • 5. Integration enhances analyses GOA+R: Many p-values will significantly improved Recapitulated biologically valid results that would have been suppressed had one single resource been used Examples: Genes down-regulated in Alzheimers
  • 6. How are we currently integrating systems biology datasets? Interaction data Currently Intact, soon IMEX “protein binding” and “self-protein binding” only (+with) Pathway data Currently Reactomeonly Loses much of what is in Reactome E,g,inputs and outputs Manually curated GO<->Reactome links incomplete not always to the most specific term labor-intensive become stale over time other pathway databases? This can be improved!
  • 7. Automating integration using cross-product definitions – pathway databases [Term] id: GO:0015871 name: choline transport intersection_of: GO:0006810 ! transport intersection_of:results_in_transport_ofCHEBI:15354 ! choline
  • 8. Automating integration using cross-products – pathway databases We can also automatically map: catalysis terms [165*] transport [373] binding [133] phosphorylation and other modifications metabolism [278] signaling … All this relies on different cross-product files Any pathway database that exports BioPax-OWL can be used E.ghumancyc, mousecyc, pathwaycommons, … *Numbers for Reactome-human
  • 9. Automating integration using cross-products – interaction databases FIGF VEGFR binds has_function is_a [Term] id: GO:0043184 name: vascular endothelial growth factor receptor 2 binding intersection_of: GO:0005488 ! binding intersection_of:results_in_binding_ofPRO:000002112! VEGFR 2
  • 10. Automated Integration: Results Reactome Evaluation in progress Many manually assigned equivalencies recapitulated Inferred equivalencies differed in some cases sometimes better than manually assigned sometimes required info not in biopax export ongoing discussions BioGrid not evaluated (all trivial) inferred annotations improve some enrichment results E.g. Brentani angiogenesis gene sets, increased enrichment for VEGFR binding Obvious but useful as proof of concept
  • 11. Conclusions and future work We can be more efficient: Coordinate with systems bio databases to divide labor Prevent siloing through semi-automated integration GO acts as a high-level ‘window’ on systems biology databases Still to be done: Make integration tool production-ready Reconcile existing mis-alignments, particularly signaling highly inconsistent between GO and Reactome Explore open questions – e.g. auto-generate terms? Finish cross-products, they are vital particular PRO, CHEBI

Editor's Notes

  1. The Gene Ontology was created as a response to the need to address the need for interoperability in genomic databases in the wake of the sequencing of the first metazoan genomes. In the paper Gene Ontology: tool for the unification of biology published nearly ten years ago, Ashburner et al state: Progress in the way that biologists describe and conceptualize the shared biological elements has not kept pace with sequencing . . . Interoperability of genomic databases is limited by this lack of progress, and it is this major obstacle that the Gene Ontology (GO) Consortium was formed to address [25].The GO has since become the de-facto terminological standard for functional annotation, and its success is evident in the popularity of GO-based class enrichment analyses. However, the intervening ten years have witnessed an explosion of interest in systems biology, with a concomitant increase in the number of databases providing information on interactions and pathways, including Reactome, Nature Signaling, PANTHER [26], BIND, BioGRID and HumanCyc (the EcoCyc metabolic pathway database preceded GO [27]). These databases each have their own individual data models and schemas, creating an interoperability problem. This has partly been mitigated by the adoption of BioPAX as a standard exchange format, which allows the aggregation of multiple pathway databases in single “one-stop shopping” warehouses, such as the Pathway Knowledge Base [28], Pathway Commons, and WikiPathways. However, the data is still only partially integrated, and if a researcher wishes to obtain a comprehensive view of a pathway they must still examine multiple records, in addition to GO annotations
  2. loss in pathway db transition
  3. their data models capture more.
  4. also via col16
  5. PRO does not yet have Ras etc