SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
Simon Jupp
Technical Coordinator / Ontology Project Lead
Samples, Phenotypes and Ontologies Team
EMBL-EBI European Bioinformatics Institute
Ontology services for connecting
biomedical data
Connected Data London,
October 4th, 2019
What is EMBL-EBI?
• Europe’s home for biological data services, research
and training
• A trusted data provider for the life sciences
• Part of the European Molecular Biology Laboratory,
an intergovernmental research organisation
• International: 650 members of staff from 66 nations
From molecules to medicine
We are always seeking new ways to read
and understand DNA
New technologies provide ways to collect,
compare and visualise molecular
information
Bioinformatics enables new applications:
• molecular medicine
• agriculture
• food
• environmental sciences
Data resources at EMBL-EBI
There‘s a lot of metadata...
tissues cell lines diseases
How many ways can you say “female”?
18-day pregnant females female (lactating) individual female worker caste (female)
2 yr old female female (pregnant) lgb*cc females sex: female
400 yr. old female female (outbred) mare female, other
adult female female parent female (worker) female child
asexual female female plant monosex female femal
castrate female female with eggs ovigerous female 3 female
cf.female female worker oviparous sexual females female (phenotype)
cystocarpic female female, 6-8 weeks old worker bee female mice
dikaryon female, virgin female enriched female, spayed
dioecious female female, worker pseudohermaprhoditic female femlale
diploid female female(gynoecious) remale metafemale
f femele semi-engorged female sterile female
famale female, pooled sexual oviparous female normal female
femail femalen sterile female worker sf
female females strictly female vitellogenic replete female
female - worker females only tetraploid female worker
female (alate sexual) gynoecious thelytoky hexaploid female
female (calf) healthy female female (gynoecious) female (f-o)
hen probably female (based on morphology)
female (note: this sample was originally provided as a "male" sample to us and therefore labeled this way in the brawand et al. paper
and original geo submission; however, detailed data analyses carried out in the meantime clearly show that this sample stems from a
female individual)",
Courtesy of N. Silvester, European Nucleotide Archive, EMBL-
EBI
Need for terminology standards
• Need to ensure we’re all talking about the same thing
• The biomedical science community has been busy
building ontologies and terminology standard
• Over 100 freely-available ontologies from the Open
Biological Ontology (OBO) community
• Most developed with formal semantics in OWL
• Many more terminology standards in use in biomedicine
Tibia?
EBI Ontologies Team
• Build services to make
ontologies accessible for
humans and machines
• Ensure a consistent set of
interoperable ontologies are
used across public datasets to
maximise interoperability
• Scale up the process to millions
of data points
• Work with software and
database developers to utilise
the ontologies
Data to knowledge
The end result is integrated data with
semantic search
Expression Atlas
GWAS catalog
Ontology driven search
• Semantic query across 20 integrated datasets to identify
potential new drug targets for disease
https://www.targetvalidation.org
Aligning data to our ontologies
Organism: Homo sapiens
cell type: Mast cell
Disease: Type II diabetes mellitus
Organism
part:
pancreas
CL:0000097
Cell type ontology
Where do you start?
Typical questions
• How do I access ontologies?
• How do I annotate data with ontologies?
• Which ontologies should I use?
• What about data that doesn’t map easily?
• How can I translate from one ontology to another?
• How can I extend an ontology?
• How do I build “ontology aware” applications?
The Ontology Toolkit
https://github.com/EBISPOT
Open Source Software
http://www.ebi.ac.uk/spot/ontology
Ontology Lookup Service
• Ontology search engine
• Ontology term history tracking
• Ontology visualisation
• RESTful API
Repository of over 200 pre-selected biomedical ontologies (5+ million terms)
http://www.ebi.ac.uk/ols
• Provides unified mechanism to access
multiple ontologies
• 6,000 users / 50 million hits per month
Visualisation tools
The problem with just an ontology lookup
…knowing what you’re looking for
Data annotation services
• Supporting data curation to map to the “right”
terms
• Based on what other databases are doing
• Collect mappings from 10 databases at EBI
and use as a training set to predict how new
unseen data should map to ontologies
http://www.ebi.ac.uk/spot/zooma
“Heart” UBERON:0000948
+ Context
(where, when?)
https://www.ebi.ac.uk/spot/zooma/
• Using previously curated data sources
https://www.ebi.ac.uk/spot/zooma/
• Using only ontologies
• Curators review output and feedback into Zooma
https://www.ebi.ac.uk/spot/zooma/
Reviewers
• We’re increasingly seeing data that is described using
ontologies
• But we don’t always agree on the ontologies to use
Datasource 1 Datasource 2
Human
Phenotype
Ontology
SNOMED-CTMappings
Ontology Mapping Service (OxO)
http://www.ebi.ac.uk/spot/oxo
Ontology Mapping Service (OxO)
• Graph database (Neo4j) of mappings from a number of public source
• Mappings are often semantically vague (exact, broader, narrower,
related)
• We use the graph to infer potential new mappings, and identify
conflicting sources of mappings
http://www.ebi.ac.uk/spot/oxo
Under the hood we use Neo4j
• We import OWL ontologies into Neo4j
• Simplify the OWL representation that is optimized for common queries
• Model for the application needs
• Scalable applications that are more developer friendly than triple stores
Powerful yet simple queries
• Get the full partonomy and classification of “heart” with
CYPHER
MATCH
(n)-[r:SUBCLASSOF|PARTOF*]->(parents)
WHERE n.label = “heart”
RETURN parents
Using ontologies in our search indexes
https://ebispot.github.io/BioSolr/
Enrich your search
index with ontology
goodness
• For text search we compute the closure of all
relationships into our text index
Semantic search and data integration with
ontologies
https://www.ebi.ac.uk/gwas
Publishing the data
• EBI RDF platform contains 7 EBI databases connected
by shared ontologies
• SPARQL access to a subset of EBI data
• But maintenance is hard as it’s not the source of truth for
the data
http://rdf.ebi.ac.uk
Aligning schemas to a single model is hard
Gene (via identifiers.
org/ensembl)
RNA transcript (via
identifiers.org/ensembl)
uniprot:Protein
rdfs:seeAlso (not currently linking
to identifiers.org but soon)
discretized differential
gene expression ratio
(sio: SIO_001078)
Gene Expression Atlas
Ensembl
sio:'is attribute of'
(sio:SIO_000011)
Uniprot
Gene Ontology
GO BP GO MF GO CC
uniprot:classifiedWith
bq:occursIn
Organisms
Organism/taxon
ChEMBL
Assay
(?)
chem
bl:hasTarget
?
bq:isVersionOf
uniprot:organism
rdfs:seeAlso
1
1
1
*
1
* * *
1
1
BioModels
SBMLModel
Reaction
Species
Compartment
bq:is
bq:isVersionOf
bq:isVersionOf
bq:is
bq:isVersionOf
bq:isHomologTo
bq:hasPart
ChEBI
Reactome
Pathway
bq:isVersionOf
bq:isVersionOf
SBO
bq:is
Relationships within
Biomodels can be found
at https://github.
com/sarala/ricordo-
rdfconverter/wiki/SBML-
RDF-Schema
rdfs:seeAlso
Structure
PDB
1
rdfs:seeAlso
Target (?)
uniprot:transcribedFrom
Protein (via identifiers.
org/ensembl)
uniprot:translatedTo
bq:isVersionOf
Genes
Drugs
Species
Protein
Protein Structure
Reactions
Gene function Systems
Disease
Is JSON-LD the answer?
e.g. Most services produce JSON via REST
API
Ensembl REST API
Slight tweak to make RDF compatible
"@context" : {
"@vocab" : "http://rdf.ebi.ac.uk/terms/ensembl/",
"obo" : "http://purl.obolibrary.org/obo/",
"dcterms" : "http://purl.org/dc/terms/",
"faldo" : "http://biohackathon.org/resource/faldo#",
"biotype" : {
"@id" : "http://www.w3.org/1999/02/22-rdf-syntax-ns#",
"@type" : "@vocab"
},
"protein_coding" : "obo:SO_0001217",
"id" : "dcterms:identifier",
"homo_sapiens" : "http://identifiers.org/taxonomy/9606",
"species" : {
"@id" : "obo:OBO_0100026",
"@type" : "@vocab"
},
"description" : "dcterms:description",
"display_name" : "http://www.w3.org/2004/02/skos/core#prefLabel"
}
Using JSON-LD
to assign
ontology
semantics to
existing data
Ensembl JSON as RDF triples
"@context" : {
"@vocab" : "http://rdf.ebi.ac.uk/terms/ensembl/",
"obo" : "http://purl.obolibrary.org/obo/",
"dcterms" : "http://purl.org/dc/terms/",
"faldo" : "http://biohackathon.org/resource/faldo#",
"biotype" : {
"@id" : "http://www.w3.org/1999/02/22-rdf-syntax-ns#",
"@type" : "@vocab"
},
"protein_coding" : "obo:SO_0001217",
"id" : "dcterms:identifier",
"homo_sapiens" : "http://identifiers.org/taxonomy/9606",
"species" : {
"@id" : "obo:OBO_0100026",
"@type" : "@vocab"
},
"description" : "dcterms:description",
"display_name" : "http://www.w3.org/2004/02/skos/core#prefLabel"
}
BioSchemas & Schema.org
• Low cost investment (markup in HTML)
• Community growing for Life science
• http://bioschemas.org
• JSON-LD emerging as popular microformat language
• EBI BioSamples database has over 10 million pages
marked up with semantic markup
• Great potential for datasets discovery (finding data
generated from the same samples)
• But not clear who will do the crawling and build the
indexes…
What we’ve learnt along the way
• The data we see is getting better as the ontologies have matured and
consensus has grown around which ontologies should be used
• Crowdsourcing through tools like Zooma and OxO has good economies of
scale with respect to data curation
• Retrofitting the semantics in this way has limits, there’s still a long tail of data
that we miss.
• OWL semantics are essential for building and maintaining our ontologies, but
we’ve had to devise custom ways to utilise the ontologies when building
applications and populating databases
• Developers want more conventional access to semantics (i.e. REST+JSON)
Ontology team
Helen Parkinson
Warren ReadOla Ajigboye
Funding
• EMBL and OpenTargets
• CORBEL This project receives funding from the
European Union’s Horizon 2020 research and
innovation programme under grant agreement No
654248.
• EJP cofund
• EOSC-Life
• EXCELERATE ELIXIR-EXCELERATE is funded by
the European Commission within the Research
Infrastructures programme of Horizon 2020, grant
agreement number 676559.
• Funding for Human Cell Atlas from Chan-Zuckerberg
Initiative
Paola Roncaglia Henriette Harmse
Simon Jupp
Zoe Pendlington
Nicolas Matentzoglu
David Osumi-Sutherland

Contenu connexe

Tendances

UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
EBI
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
Shruthi Choudary
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
Chris Evelo
 
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
dolleyj
 
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
BioCuration 2019 - Evidence and Conclusion Ontology 2019 UpdateBioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
dolleyj
 
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Jie Bao
 

Tendances (20)

UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
 
Building a Model Organism Metabolome Database
Building a  Model Organism Metabolome DatabaseBuilding a  Model Organism Metabolome Database
Building a Model Organism Metabolome Database
 
Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019
 
All together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeAll together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of life
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
 
Using biological network approaches for dynamic extension of micronutrient re...
Using biological network approaches for dynamic extension of micronutrient re...Using biological network approaches for dynamic extension of micronutrient re...
Using biological network approaches for dynamic extension of micronutrient re...
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in Uberon
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Gene Ontology Project
Gene Ontology ProjectGene Ontology Project
Gene Ontology Project
 
Analysis with biological pathways:
Analysis with biological pathways: Analysis with biological pathways:
Analysis with biological pathways:
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
 
Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
BioCuration 2019 - Evidence and Conclusion Ontology 2019 UpdateBioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update
 
Introduction to In silico engineering for biologics
Introduction to In silico engineering for biologicsIntroduction to In silico engineering for biologics
Introduction to In silico engineering for biologics
 
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
 

Similaire à Ontology Services for the Biomedical Sciences

GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
Neo4j
 

Similaire à Ontology Services for the Biomedical Sciences (20)

Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
World-wide data exchange in metabolomics, Wageningen, October 2016
World-wide data exchange in metabolomics, Wageningen, October 2016World-wide data exchange in metabolomics, Wageningen, October 2016
World-wide data exchange in metabolomics, Wageningen, October 2016
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Developments in Metabolomics leading to PhenoMeNal
Developments in Metabolomics leading to PhenoMeNalDevelopments in Metabolomics leading to PhenoMeNal
Developments in Metabolomics leading to PhenoMeNal
 
Ontologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlin
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-jupp
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
Sharing data from clinical and medical research
Sharing data from clinical and medical researchSharing data from clinical and medical research
Sharing data from clinical and medical research
 
Building an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomicsBuilding an efficient infrastructure, standards and data flow for metabolomics
Building an efficient infrastructure, standards and data flow for metabolomics
 
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting HinxonPhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
PhenoMeNal presentation at STFC-ELIXIR Meeting Hinxon
 
Biological database....pptx
Biological database....pptxBiological database....pptx
Biological database....pptx
 
A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 

Plus de Connected Data World

The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
Connected Data World
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
Connected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
Connected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
Connected Data World
 

Plus de Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora Lassila
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 

Dernier

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 

Dernier (20)

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 

Ontology Services for the Biomedical Sciences

  • 1. Simon Jupp Technical Coordinator / Ontology Project Lead Samples, Phenotypes and Ontologies Team EMBL-EBI European Bioinformatics Institute Ontology services for connecting biomedical data Connected Data London, October 4th, 2019
  • 2. What is EMBL-EBI? • Europe’s home for biological data services, research and training • A trusted data provider for the life sciences • Part of the European Molecular Biology Laboratory, an intergovernmental research organisation • International: 650 members of staff from 66 nations
  • 3. From molecules to medicine We are always seeking new ways to read and understand DNA New technologies provide ways to collect, compare and visualise molecular information Bioinformatics enables new applications: • molecular medicine • agriculture • food • environmental sciences
  • 4. Data resources at EMBL-EBI
  • 5. There‘s a lot of metadata... tissues cell lines diseases
  • 6. How many ways can you say “female”? 18-day pregnant females female (lactating) individual female worker caste (female) 2 yr old female female (pregnant) lgb*cc females sex: female 400 yr. old female female (outbred) mare female, other adult female female parent female (worker) female child asexual female female plant monosex female femal castrate female female with eggs ovigerous female 3 female cf.female female worker oviparous sexual females female (phenotype) cystocarpic female female, 6-8 weeks old worker bee female mice dikaryon female, virgin female enriched female, spayed dioecious female female, worker pseudohermaprhoditic female femlale diploid female female(gynoecious) remale metafemale f femele semi-engorged female sterile female famale female, pooled sexual oviparous female normal female femail femalen sterile female worker sf female females strictly female vitellogenic replete female female - worker females only tetraploid female worker female (alate sexual) gynoecious thelytoky hexaploid female female (calf) healthy female female (gynoecious) female (f-o) hen probably female (based on morphology) female (note: this sample was originally provided as a "male" sample to us and therefore labeled this way in the brawand et al. paper and original geo submission; however, detailed data analyses carried out in the meantime clearly show that this sample stems from a female individual)", Courtesy of N. Silvester, European Nucleotide Archive, EMBL- EBI
  • 7. Need for terminology standards • Need to ensure we’re all talking about the same thing • The biomedical science community has been busy building ontologies and terminology standard • Over 100 freely-available ontologies from the Open Biological Ontology (OBO) community • Most developed with formal semantics in OWL • Many more terminology standards in use in biomedicine Tibia?
  • 8. EBI Ontologies Team • Build services to make ontologies accessible for humans and machines • Ensure a consistent set of interoperable ontologies are used across public datasets to maximise interoperability • Scale up the process to millions of data points • Work with software and database developers to utilise the ontologies Data to knowledge
  • 9. The end result is integrated data with semantic search Expression Atlas GWAS catalog
  • 10. Ontology driven search • Semantic query across 20 integrated datasets to identify potential new drug targets for disease https://www.targetvalidation.org
  • 11. Aligning data to our ontologies Organism: Homo sapiens cell type: Mast cell Disease: Type II diabetes mellitus Organism part: pancreas CL:0000097 Cell type ontology Where do you start?
  • 12. Typical questions • How do I access ontologies? • How do I annotate data with ontologies? • Which ontologies should I use? • What about data that doesn’t map easily? • How can I translate from one ontology to another? • How can I extend an ontology? • How do I build “ontology aware” applications?
  • 13. The Ontology Toolkit https://github.com/EBISPOT Open Source Software http://www.ebi.ac.uk/spot/ontology
  • 14. Ontology Lookup Service • Ontology search engine • Ontology term history tracking • Ontology visualisation • RESTful API Repository of over 200 pre-selected biomedical ontologies (5+ million terms) http://www.ebi.ac.uk/ols • Provides unified mechanism to access multiple ontologies • 6,000 users / 50 million hits per month
  • 16. The problem with just an ontology lookup …knowing what you’re looking for
  • 17. Data annotation services • Supporting data curation to map to the “right” terms • Based on what other databases are doing • Collect mappings from 10 databases at EBI and use as a training set to predict how new unseen data should map to ontologies http://www.ebi.ac.uk/spot/zooma “Heart” UBERON:0000948 + Context (where, when?)
  • 19. • Using previously curated data sources https://www.ebi.ac.uk/spot/zooma/
  • 20. • Using only ontologies • Curators review output and feedback into Zooma https://www.ebi.ac.uk/spot/zooma/ Reviewers
  • 21. • We’re increasingly seeing data that is described using ontologies • But we don’t always agree on the ontologies to use Datasource 1 Datasource 2 Human Phenotype Ontology SNOMED-CTMappings Ontology Mapping Service (OxO) http://www.ebi.ac.uk/spot/oxo
  • 22. Ontology Mapping Service (OxO) • Graph database (Neo4j) of mappings from a number of public source • Mappings are often semantically vague (exact, broader, narrower, related) • We use the graph to infer potential new mappings, and identify conflicting sources of mappings http://www.ebi.ac.uk/spot/oxo
  • 23. Under the hood we use Neo4j • We import OWL ontologies into Neo4j • Simplify the OWL representation that is optimized for common queries • Model for the application needs • Scalable applications that are more developer friendly than triple stores
  • 24. Powerful yet simple queries • Get the full partonomy and classification of “heart” with CYPHER MATCH (n)-[r:SUBCLASSOF|PARTOF*]->(parents) WHERE n.label = “heart” RETURN parents
  • 25. Using ontologies in our search indexes https://ebispot.github.io/BioSolr/ Enrich your search index with ontology goodness • For text search we compute the closure of all relationships into our text index
  • 26. Semantic search and data integration with ontologies https://www.ebi.ac.uk/gwas
  • 27. Publishing the data • EBI RDF platform contains 7 EBI databases connected by shared ontologies • SPARQL access to a subset of EBI data • But maintenance is hard as it’s not the source of truth for the data http://rdf.ebi.ac.uk
  • 28. Aligning schemas to a single model is hard Gene (via identifiers. org/ensembl) RNA transcript (via identifiers.org/ensembl) uniprot:Protein rdfs:seeAlso (not currently linking to identifiers.org but soon) discretized differential gene expression ratio (sio: SIO_001078) Gene Expression Atlas Ensembl sio:'is attribute of' (sio:SIO_000011) Uniprot Gene Ontology GO BP GO MF GO CC uniprot:classifiedWith bq:occursIn Organisms Organism/taxon ChEMBL Assay (?) chem bl:hasTarget ? bq:isVersionOf uniprot:organism rdfs:seeAlso 1 1 1 * 1 * * * 1 1 BioModels SBMLModel Reaction Species Compartment bq:is bq:isVersionOf bq:isVersionOf bq:is bq:isVersionOf bq:isHomologTo bq:hasPart ChEBI Reactome Pathway bq:isVersionOf bq:isVersionOf SBO bq:is Relationships within Biomodels can be found at https://github. com/sarala/ricordo- rdfconverter/wiki/SBML- RDF-Schema rdfs:seeAlso Structure PDB 1 rdfs:seeAlso Target (?) uniprot:transcribedFrom Protein (via identifiers. org/ensembl) uniprot:translatedTo bq:isVersionOf Genes Drugs Species Protein Protein Structure Reactions Gene function Systems Disease
  • 29. Is JSON-LD the answer? e.g. Most services produce JSON via REST API
  • 31. Slight tweak to make RDF compatible "@context" : { "@vocab" : "http://rdf.ebi.ac.uk/terms/ensembl/", "obo" : "http://purl.obolibrary.org/obo/", "dcterms" : "http://purl.org/dc/terms/", "faldo" : "http://biohackathon.org/resource/faldo#", "biotype" : { "@id" : "http://www.w3.org/1999/02/22-rdf-syntax-ns#", "@type" : "@vocab" }, "protein_coding" : "obo:SO_0001217", "id" : "dcterms:identifier", "homo_sapiens" : "http://identifiers.org/taxonomy/9606", "species" : { "@id" : "obo:OBO_0100026", "@type" : "@vocab" }, "description" : "dcterms:description", "display_name" : "http://www.w3.org/2004/02/skos/core#prefLabel" } Using JSON-LD to assign ontology semantics to existing data
  • 32. Ensembl JSON as RDF triples "@context" : { "@vocab" : "http://rdf.ebi.ac.uk/terms/ensembl/", "obo" : "http://purl.obolibrary.org/obo/", "dcterms" : "http://purl.org/dc/terms/", "faldo" : "http://biohackathon.org/resource/faldo#", "biotype" : { "@id" : "http://www.w3.org/1999/02/22-rdf-syntax-ns#", "@type" : "@vocab" }, "protein_coding" : "obo:SO_0001217", "id" : "dcterms:identifier", "homo_sapiens" : "http://identifiers.org/taxonomy/9606", "species" : { "@id" : "obo:OBO_0100026", "@type" : "@vocab" }, "description" : "dcterms:description", "display_name" : "http://www.w3.org/2004/02/skos/core#prefLabel" }
  • 33. BioSchemas & Schema.org • Low cost investment (markup in HTML) • Community growing for Life science • http://bioschemas.org • JSON-LD emerging as popular microformat language • EBI BioSamples database has over 10 million pages marked up with semantic markup • Great potential for datasets discovery (finding data generated from the same samples) • But not clear who will do the crawling and build the indexes…
  • 34. What we’ve learnt along the way • The data we see is getting better as the ontologies have matured and consensus has grown around which ontologies should be used • Crowdsourcing through tools like Zooma and OxO has good economies of scale with respect to data curation • Retrofitting the semantics in this way has limits, there’s still a long tail of data that we miss. • OWL semantics are essential for building and maintaining our ontologies, but we’ve had to devise custom ways to utilise the ontologies when building applications and populating databases • Developers want more conventional access to semantics (i.e. REST+JSON)
  • 35. Ontology team Helen Parkinson Warren ReadOla Ajigboye Funding • EMBL and OpenTargets • CORBEL This project receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 654248. • EJP cofund • EOSC-Life • EXCELERATE ELIXIR-EXCELERATE is funded by the European Commission within the Research Infrastructures programme of Horizon 2020, grant agreement number 676559. • Funding for Human Cell Atlas from Chan-Zuckerberg Initiative Paola Roncaglia Henriette Harmse Simon Jupp Zoe Pendlington Nicolas Matentzoglu David Osumi-Sutherland