SlideShare une entreprise Scribd logo
1  sur  23
KNOWLEDGE BEACONS
Semantic Web Services for distributed question answering
QUESTIONS
• Why do variants that cause sickle cell anemia protect against malaria?
• In people with variants found in any of the 22 known FA genes, is there increased
incidence of aplastic anemia (or other diseases)?
• What venomous species have resulted in drugs approved by the FDA?
• What cellular processes in which tissues are impacted in a patient-based EMR?
• Why does ingestion of GlcNAc ameliorate symptoms of ngly1 deficiency?
DISTRIBUTED KNOWLEDGE
MANAGEMENT
• Just for today, the answer is not “put it all in wikidata” (or Monarch)
• Assume that we have to dynamically access knowledge sources (KS) that we don’t
control.
• Example, genetic data
DISTRIBUTED GENETIC
KNOWLEDGE
• A beacon is web service that any institution can implement to share
genetic data. A beacon answers questions of the form "Do you have
information about the following mutation?" and responds with one of
"Yes" or "No", among potentially more information.
• Experiment to test willingness to share genetic data in the simplest of
all technical contexts
• Input: chr16:28883241 A>G
• output: Yes I have data about that SNP, or no I don’t
• 90 beacons online
STAGE SET FOR DISTRIBUTED
GENETIC KNOWLEDGE BASES
chr16:28883241 A>G
QUESTION
•Why do variants that cause sickle cell anemia
protect against malaria?
THE TRANSLATOR
chr 11:5227002 A>T
causes
Sickle cell
anemia
pmid:20552021
Malaria
protects against
pmid:17668374
?
• How are they
related?
• Why 1 cause and
1 protect?
TRANSLATOR
• “A key output of this effort is a “product” that represents a unified interface to the
multiple groups, methods and approaches.” NCATS
• We assume from the outset that the required information is going to come from
different places.
• The goal is to build a system to access that information smoothly
• Step 1: where is it and who is willing to share ? (beacon)
KNOWLEDGE
BEACONS
• A knowledge beacon is web service that any institution can
implement to share genetic data knowledge. A beacon answers
questions of the form "Do you have information about the
following mutation concept?" and responds with semantic
relationships.
• Input: DOID:10923 (Sickle cell anemia)
• output:
DOID:10923
autosomal recessive disease
HBB
cerebral hemorrhage
isa
genetic association
has Phenotype
IMPLEMENTATION
K BEACON NETWORK
API: same as single BEACON +
• filter by provider
• list providers
• merge responses
CLIENTS (human/machine)
BLACKBOARD
KB4
Jupyter.. etc.
Swagger (Smart) API
• Auto-generated service stubs
Basic methods:
• list concepts by keyword
• get concept by CURIE
• get exact matching concept
identifiers
• get statements (semantic
relationships) by concept id
• get evidence for statements by
statement id
KNOWLEDGE BEACONS
https://github.com/STARInformatics/translator-knowledge-beacon
ANSWER HOW ARE THEY RELATED?:
BY BEACON TRAVERSAL
chr 11:5227002 A>T cerebral hemorrhage
has Phenotype
has Phenotype
Monarch
has Phenotype
renal insufficiency
Monarch
has subclass
HPO
Cerebral
Malariahas subclass
DO
causes
Sickle cell
anemia
MyVariant
Malaria
protects against
Wikidata
Abnormal renal
physiology
has Phenotype
Monarch
Enacted in any client,
provenance, evidence tracked
THAT’S STEP 1
• We have knowledge coming in from multiple sources
• Integrated into a coherent framework
• Allowing connections to be formed and some questions to be answered
WAIT BUT WHY
• Sickle cell shows up with 187 phenotypes,
• cerebral malaria with 52,
• malaria with 163
• There are likely thousands of paths that connect them.
• Just from Monarch
Why do variants that cause sickle cell
anemia protect against malaria?
57 YEARS FROM
OBSERVATION TO
EXPLANATION
(stops damage to brain,
does not influence
pathogen load)
ANSWERING
THE WHY
QUESTION
A Sickle cell
anemia
pathway
Hb
HBB gene
variants
HBB gene
variant
Heme
Nrf2,
HO-1
CO
Sickle red
blood cells
Hb
Malaria
infect red
blood cell
Heme
Cerebral
Malaria
cerebral hemorrhage
Dying from
Malaria
makes too
much
makes too
much
too much
causes
causes
causes
(Dying from
Sickle cell)
Ferreira (2011)
Blocks ability to
release Heme
Blocks ability to
release Heme
DATA CRUNCHING SERVICES
• pathfinding, ranking: assuming those connections exist, how to find them?
• set analysis – given groups of e.g. patients, infer characteristics like ‘decreased heme
production’
• patient data: get to data needed to do this
• relation inference: simple (ontological expansion), complex (anlytical pipelines:
homology, expression analysis etc.)
• more..
TRANSLATOR
• The major consideration for Translator queries should be the extent to which they
are integrative and translational. That is, we would like to work with queries that not
only utilize multiple domains of knowledge, but do so in a way that goes beyond
simply pulling from the first source, filtering by the second, filtering by the third, and
so on. It should utilize unique relationships between multiples data sources.
““THOSE WHO CANNOT REMEMBER THE
PAST ARE CONDEMNED TO REPEAT IT.”
• SADI
• WSMO
• OWL-S
• BioMOBY
• SAWSDL
• SSWAP
• caBIO
• myGrid
• TAMBIS
George Santayana The Life of Reason, 1905
2001
THANKS
• Chris Mungall
• Richard Bruskiewich
• API specification
• https://github.com/STARInformatics/translator-knowledge-beacon
• Early implementation over semmedDB (wikidata version)
• http://default-environment.kmmdmp4hsz.us-east-1.elasticbeanstalk.com/api/swagger-ui.html
• Greg Stupp
• Beginning implementation over wikidata in Garbanzo
• http://52.15.200.208:5000/#/translator
IMPORTANT DETAILS FOR
DISTRIBUTED SYSTEMS
• registries
• concept identity
• syntax
• relation semantics
• service stability
• provenance
• evidence
• credit
QUERY = SERVICE ORCHESTRATION
• Specifying goals and inference strategies

Contenu connexe

Tendances

Jsm madduri-august-2015
Jsm madduri-august-2015Jsm madduri-august-2015
Jsm madduri-august-2015Ravi Madduri
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataMichel Dumontier
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...Syed Ahmad Chan Bukhari, PhD
 
Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline Ravi Madduri
 
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...Syed Ahmad Chan Bukhari, PhD
 
CI4CC sustainability-panel
CI4CC sustainability-panelCI4CC sustainability-panel
CI4CC sustainability-panelRavi Madduri
 
Human Studies Database Project (demo)
Human Studies Database Project (demo)Human Studies Database Project (demo)
Human Studies Database Project (demo)Ida Sim
 
SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++Bas Stringer
 
2015 balti-and-bioinformatics
2015 balti-and-bioinformatics2015 balti-and-bioinformatics
2015 balti-and-bioinformaticsc.titus.brown
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009Ian Foster
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...Michel Dumontier
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astrowebuploader
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesBastian Greshake
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesMichel Dumontier
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 

Tendances (20)

Jsm madduri-august-2015
Jsm madduri-august-2015Jsm madduri-august-2015
Jsm madduri-august-2015
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 
Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline
 
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
 
CI4CC sustainability-panel
CI4CC sustainability-panelCI4CC sustainability-panel
CI4CC sustainability-panel
 
Human Studies Database Project (demo)
Human Studies Database Project (demo)Human Studies Database Project (demo)
Human Studies Database Project (demo)
 
SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++
 
2015 balti-and-bioinformatics
2015 balti-and-bioinformatics2015 balti-and-bioinformatics
2015 balti-and-bioinformatics
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologies
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 

Similaire à Knowledge Beacons

Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Friend NRNB 2012-12-13
Friend NRNB 2012-12-13Friend NRNB 2012-12-13
Friend NRNB 2012-12-13Sage Base
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lectureDan Gaston
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowBarry Smith
 
Gene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2KGene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2KBenjamin Good
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemSubhendu Dey
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30Sage Base
 
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)Erich Gombocz
 
UCSD / DBMI seminar 2015-02-6
UCSD / DBMI seminar 2015-02-6UCSD / DBMI seminar 2015-02-6
UCSD / DBMI seminar 2015-02-6Andrew Su
 
Fanconi Anemia Research Symposium 2017 Hoatlin
Fanconi Anemia Research Symposium 2017 HoatlinFanconi Anemia Research Symposium 2017 Hoatlin
Fanconi Anemia Research Symposium 2017 HoatlinMaureen Hoatlin
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsPaul Groth
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgePaul Agapow
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08Russ Altman
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical DataPaul Agapow
 
GEMC: Central Nervous System Infections
GEMC: Central Nervous System InfectionsGEMC: Central Nervous System Infections
GEMC: Central Nervous System InfectionsOpen.Michigan
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesConnected Data World
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholdermhaendel
 
Accessing Biomedical and Health Information
Accessing Biomedical and Health InformationAccessing Biomedical and Health Information
Accessing Biomedical and Health Informationmonicaduke
 

Similaire à Knowledge Beacons (20)

Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Friend NRNB 2012-12-13
Friend NRNB 2012-12-13Friend NRNB 2012-12-13
Friend NRNB 2012-12-13
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lecture
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to Know
 
Gene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2KGene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2K
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problem
 
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
 
Friend harvard 2013-01-30
Friend harvard 2013-01-30Friend harvard 2013-01-30
Friend harvard 2013-01-30
 
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
UCSD / DBMI seminar 2015-02-6
UCSD / DBMI seminar 2015-02-6UCSD / DBMI seminar 2015-02-6
UCSD / DBMI seminar 2015-02-6
 
Fanconi Anemia Research Symposium 2017 Hoatlin
Fanconi Anemia Research Symposium 2017 HoatlinFanconi Anemia Research Symposium 2017 Hoatlin
Fanconi Anemia Research Symposium 2017 Hoatlin
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
Machine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledgeMachine learning, health data & the limits of knowledge
Machine learning, health data & the limits of knowledge
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
GEMC: Central Nervous System Infections
GEMC: Central Nervous System InfectionsGEMC: Central Nervous System Infections
GEMC: Central Nervous System Infections
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Accessing Biomedical and Health Information
Accessing Biomedical and Health InformationAccessing Biomedical and Health Information
Accessing Biomedical and Health Information
 

Plus de Benjamin Good

Representing and reasoning with biological knowledge
Representing and reasoning with biological knowledgeRepresenting and reasoning with biological knowledge
Representing and reasoning with biological knowledgeBenjamin Good
 
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsBenjamin Good
 
Pathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsPathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsBenjamin Good
 
Gene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshopGene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshopBenjamin Good
 
Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2Benjamin Good
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giantsBenjamin Good
 
Channeling Collaborative Spirit
Channeling Collaborative SpiritChanneling Collaborative Spirit
Channeling Collaborative SpiritBenjamin Good
 
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery (Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery Benjamin Good
 
2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbio2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbioBenjamin Good
 
Citizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdfCitizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdfBenjamin Good
 
Building a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen scienceBuilding a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen scienceBenjamin Good
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBenjamin Good
 
Serious games for bioinformatics education. ISMB 2014 education workshop
Serious games for bioinformatics education.  ISMB 2014 education workshopSerious games for bioinformatics education.  ISMB 2014 education workshop
Serious games for bioinformatics education. ISMB 2014 education workshopBenjamin Good
 
The Cure: Making a game of gene selection for breast cancer survival prediction
The Cure: Making a game of gene selection for breast cancer survival predictionThe Cure: Making a game of gene selection for breast cancer survival prediction
The Cure: Making a game of gene selection for breast cancer survival predictionBenjamin Good
 
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...Benjamin Good
 
Microtask crowdsourcing for disease mention annotation in PubMed abstracts
Microtask crowdsourcing for disease mention annotation in PubMed abstractsMicrotask crowdsourcing for disease mention annotation in PubMed abstracts
Microtask crowdsourcing for disease mention annotation in PubMed abstractsBenjamin Good
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationBenjamin Good
 

Plus de Benjamin Good (20)

Representing and reasoning with biological knowledge
Representing and reasoning with biological knowledgeRepresenting and reasoning with biological knowledge
Representing and reasoning with biological knowledge
 
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity Models
 
Pathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsPathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMs
 
Science Game Lab
Science Game LabScience Game Lab
Science Game Lab
 
Gene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshopGene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshop
 
Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
Channeling Collaborative Spirit
Channeling Collaborative SpiritChanneling Collaborative Spirit
Channeling Collaborative Spirit
 
2016 mem good
2016 mem good2016 mem good
2016 mem good
 
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery (Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery
(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery
 
2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbio2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbio
 
(Bio)Hackathons
(Bio)Hackathons(Bio)Hackathons
(Bio)Hackathons
 
Citizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdfCitizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdf
 
Building a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen scienceBuilding a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen science
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiers
 
Serious games for bioinformatics education. ISMB 2014 education workshop
Serious games for bioinformatics education.  ISMB 2014 education workshopSerious games for bioinformatics education.  ISMB 2014 education workshop
Serious games for bioinformatics education. ISMB 2014 education workshop
 
The Cure: Making a game of gene selection for breast cancer survival prediction
The Cure: Making a game of gene selection for breast cancer survival predictionThe Cure: Making a game of gene selection for breast cancer survival prediction
The Cure: Making a game of gene selection for breast cancer survival prediction
 
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
 
Microtask crowdsourcing for disease mention annotation in PubMed abstracts
Microtask crowdsourcing for disease mention annotation in PubMed abstractsMicrotask crowdsourcing for disease mention annotation in PubMed abstracts
Microtask crowdsourcing for disease mention annotation in PubMed abstracts
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
 

Dernier

Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to VirusesAreesha Ahmad
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfTukamushabaBismark
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 

Dernier (20)

Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 

Knowledge Beacons

  • 1. KNOWLEDGE BEACONS Semantic Web Services for distributed question answering
  • 2. QUESTIONS • Why do variants that cause sickle cell anemia protect against malaria? • In people with variants found in any of the 22 known FA genes, is there increased incidence of aplastic anemia (or other diseases)? • What venomous species have resulted in drugs approved by the FDA? • What cellular processes in which tissues are impacted in a patient-based EMR? • Why does ingestion of GlcNAc ameliorate symptoms of ngly1 deficiency?
  • 3. DISTRIBUTED KNOWLEDGE MANAGEMENT • Just for today, the answer is not “put it all in wikidata” (or Monarch) • Assume that we have to dynamically access knowledge sources (KS) that we don’t control. • Example, genetic data
  • 4. DISTRIBUTED GENETIC KNOWLEDGE • A beacon is web service that any institution can implement to share genetic data. A beacon answers questions of the form "Do you have information about the following mutation?" and responds with one of "Yes" or "No", among potentially more information. • Experiment to test willingness to share genetic data in the simplest of all technical contexts • Input: chr16:28883241 A>G • output: Yes I have data about that SNP, or no I don’t
  • 5. • 90 beacons online
  • 6. STAGE SET FOR DISTRIBUTED GENETIC KNOWLEDGE BASES chr16:28883241 A>G
  • 7. QUESTION •Why do variants that cause sickle cell anemia protect against malaria?
  • 8. THE TRANSLATOR chr 11:5227002 A>T causes Sickle cell anemia pmid:20552021 Malaria protects against pmid:17668374 ? • How are they related? • Why 1 cause and 1 protect?
  • 9. TRANSLATOR • “A key output of this effort is a “product” that represents a unified interface to the multiple groups, methods and approaches.” NCATS • We assume from the outset that the required information is going to come from different places. • The goal is to build a system to access that information smoothly • Step 1: where is it and who is willing to share ? (beacon)
  • 10. KNOWLEDGE BEACONS • A knowledge beacon is web service that any institution can implement to share genetic data knowledge. A beacon answers questions of the form "Do you have information about the following mutation concept?" and responds with semantic relationships. • Input: DOID:10923 (Sickle cell anemia) • output: DOID:10923 autosomal recessive disease HBB cerebral hemorrhage isa genetic association has Phenotype
  • 11. IMPLEMENTATION K BEACON NETWORK API: same as single BEACON + • filter by provider • list providers • merge responses CLIENTS (human/machine) BLACKBOARD KB4 Jupyter.. etc. Swagger (Smart) API • Auto-generated service stubs Basic methods: • list concepts by keyword • get concept by CURIE • get exact matching concept identifiers • get statements (semantic relationships) by concept id • get evidence for statements by statement id KNOWLEDGE BEACONS https://github.com/STARInformatics/translator-knowledge-beacon
  • 12. ANSWER HOW ARE THEY RELATED?: BY BEACON TRAVERSAL chr 11:5227002 A>T cerebral hemorrhage has Phenotype has Phenotype Monarch has Phenotype renal insufficiency Monarch has subclass HPO Cerebral Malariahas subclass DO causes Sickle cell anemia MyVariant Malaria protects against Wikidata Abnormal renal physiology has Phenotype Monarch Enacted in any client, provenance, evidence tracked
  • 13. THAT’S STEP 1 • We have knowledge coming in from multiple sources • Integrated into a coherent framework • Allowing connections to be formed and some questions to be answered
  • 14. WAIT BUT WHY • Sickle cell shows up with 187 phenotypes, • cerebral malaria with 52, • malaria with 163 • There are likely thousands of paths that connect them. • Just from Monarch Why do variants that cause sickle cell anemia protect against malaria?
  • 15. 57 YEARS FROM OBSERVATION TO EXPLANATION (stops damage to brain, does not influence pathogen load)
  • 16. ANSWERING THE WHY QUESTION A Sickle cell anemia pathway Hb HBB gene variants HBB gene variant Heme Nrf2, HO-1 CO Sickle red blood cells Hb Malaria infect red blood cell Heme Cerebral Malaria cerebral hemorrhage Dying from Malaria makes too much makes too much too much causes causes causes (Dying from Sickle cell) Ferreira (2011) Blocks ability to release Heme Blocks ability to release Heme
  • 17. DATA CRUNCHING SERVICES • pathfinding, ranking: assuming those connections exist, how to find them? • set analysis – given groups of e.g. patients, infer characteristics like ‘decreased heme production’ • patient data: get to data needed to do this • relation inference: simple (ontological expansion), complex (anlytical pipelines: homology, expression analysis etc.) • more..
  • 18. TRANSLATOR • The major consideration for Translator queries should be the extent to which they are integrative and translational. That is, we would like to work with queries that not only utilize multiple domains of knowledge, but do so in a way that goes beyond simply pulling from the first source, filtering by the second, filtering by the third, and so on. It should utilize unique relationships between multiples data sources.
  • 19. ““THOSE WHO CANNOT REMEMBER THE PAST ARE CONDEMNED TO REPEAT IT.” • SADI • WSMO • OWL-S • BioMOBY • SAWSDL • SSWAP • caBIO • myGrid • TAMBIS George Santayana The Life of Reason, 1905
  • 20. 2001
  • 21. THANKS • Chris Mungall • Richard Bruskiewich • API specification • https://github.com/STARInformatics/translator-knowledge-beacon • Early implementation over semmedDB (wikidata version) • http://default-environment.kmmdmp4hsz.us-east-1.elasticbeanstalk.com/api/swagger-ui.html • Greg Stupp • Beginning implementation over wikidata in Garbanzo • http://52.15.200.208:5000/#/translator
  • 22. IMPORTANT DETAILS FOR DISTRIBUTED SYSTEMS • registries • concept identity • syntax • relation semantics • service stability • provenance • evidence • credit
  • 23. QUERY = SERVICE ORCHESTRATION • Specifying goals and inference strategies

Notes de l'éditeur

  1. BRCA Exchange aggregates BRCA variants and gets experts to curate their clinical significance.
  2. At the end of the pilot, a key output of this effort is a “product” that represents a unified interface to the multiple groups, methods and approaches. One or more queries will be presented to this unified interface.
  3. Sickle cell shows up with 187 phenotypes, cerebral malaria with 52, malaria with 163
  4. Homozygous Sickle cell anemia results in damage from accumulation of high levels of cell-free Hb and heme in plasma Heterozygous Sickle patients also accumulate low (nonpathologic) levels of heme in plasma, this results in the production of CO (carbon monoxide) which binds cell-free Hb and inhibits its oxidation, thus preventing heme release
  5. geographical