SlideShare a Scribd company logo
1 of 41
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, and CHEMINF 1 ChEBI User Group Meeting:June 24, 2010 Michel Dumontier, Ph.D. Associate Professor of Bioinformatics Carleton University Department of Biology School of Computer Science Institute of Biochemistry Ottawa Institute of Systems Biology Ottawa-Carleton Institute of Biomedical Engineering
Syntactic Web… It takes a lot of digging to get answers 3 ChEBI User Group Meeting:June 24, 2010
We need to get to the deep web  Surface web:167 terabytes Deep web:91,000 terabytes 545-to-one ChEBI User Group Meeting:June 24, 2010 4
ChEBI User Group Meeting:June 24, 2010 and  tap into the global web of structured knowledge 5
The Semantic Web  is the new global web of knowledge 6 ChEBI User Group Meeting:June 24, 2010 It is about standards for publishing, sharing and querying  knowledge drawn from diverse sources It makes possible the answering sophisticated questions using  background knowledge
Goals Provision chemical data on the Web Find cheminformatic services that will consume the data Answer questions about chemicals by reasoning over essential chemical knowledge ChEBI User Group Meeting:June 24, 2010 7
Is caffeine a drug-like molecule? ChEBI User Group Meeting:June 24, 2010 8
Lipinski Rule of Five Rule of thumb for druglikeness (orally active in humans) 	(4 rules with multiples of 5) Less than 500 Dalton Less than 5 hydrogen bond donors Less than 10 hydrogen bond acceptors A partition coefficient value between -5 and 5 We need a more formal (machine understandable) description ChEBI User Group Meeting:June 24, 2010 9
Formal Ontology as a Strategy 10 ChEBI User Group Meeting:June 24, 2010
The Web Ontology Language (OWL) Has Explicit Semantics Can therefore be used to capture knowledge in a machine understandable way 11 ChEBI User Group Meeting:June 24, 2010
Lipinski Rule of Five Empirically derived ruleset for druglikeness 	(4 rules with multiples of 5) Less than 500 Dalton Less than 5 hydrogen bond donors Less than 10 hydrogen bond acceptors A partition coefficient value between -5 and 5 A formal description using OWL: ChEBI User Group Meeting:June 24, 2010 12
To calculate these attributes, we need access to a computable representation of the molecular structure ChEBI User Group Meeting:June 24, 2010 13 ball & stick model for caffeine
The chemical graph specifies the type and connectivity of atoms in molecules. It describes a part of chemical structureSMILES strings are common representations of the chemical graph ChEBI User Group Meeting:June 24, 2010 14 Cn1cnc2n(C)c(=O)n(C)c(=O)c12 ball & stick model for caffeine SMILES string  for caffeine
Chemical descriptors Chemical descriptors are data (quantities or values) that provide information about substances, molecular entities, and their parts (rings, atoms, bonds, etc). Sometimes they enumerate material parts, they quantify or describe qualities, functions or dispositions Often used to build Quantitative Structure Activity Relationships (QSAR) models Example descriptors : Mass values Partition coefficients Heats of formation Aromaticity values Molecular formulas ChEBI User Group Meeting:June 24, 2010 15
The Chemical Information Ontology (CHEMINF) 100 chemical descriptors 50 chemical qualities Relates descriptors to their specifications, the software that generated them (along with the running parameters, and the algorithms that they implement) Contributors: Nico Adams, Leonid Chepelev, Michel Dumontier, Janna Hastings, EgonWillighagen, Peter Murray-Rust, CristophSteinbeck ChEBI User Group Meeting:June 24, 2010 16 http://semanticchemistry.googlecode.com
CHEMINF provides the vocabulary to define an input (SMILES-annotated molecule) and an output (molecule annotated with a descriptor) ChEBI User Group Meeting:June 24, 2010 17
Ultimately, the goal is to use an OWL reasoner to reason about the attributes to determine whether the compound is drug-like ChEBI User Group Meeting:June 24, 2010 18
Semantic Automated Discovery and Integration http://sadiframework.org SADI is a framework to create Semantic Web services using OWL classes as service inputs and outputs Mark Wilkinson, UBC Michel Dumontier, Carleton University Christopher Baker, UNB 19 ChEBI User Group Meeting:June 24, 2010
SADI OWL classes in SADI are local to individual services They should uniquely specify the service input and outputs (they exactly have the right restrictions) one service’s world-view can conflict with another,but a client can use any or all maximize interoperability by reusing types and relations ChEBI User Group Meeting:June 24, 2010 20
Create code stubs using the ontology Publish the ontology to a web-accessible location http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl  Make sure that the class names are resolvable (easy when using the hash notation) http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smiles-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#logp-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hbdc-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hdba-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#lipinksi-druglike-molecule Download/checkout the code 	http://sadiframework.org Run the code generator  specify the URIs that correspond to input and output types ChEBI User Group Meeting:June 24, 2010 21
Implement the functionality Java version  Uses Jena to manipulate the RDF graph Uses Maven to build from command-line or Eclipse; Invokes Jetty for service testing Chemistry We used the Chemistry Development Kit (CDK) to implement 4 services ChEBI User Group Meeting:June 24, 2010 22
Working with the service (GET) Responds to a GET by providing the service description in RDF conforms to Feta (BioMoby, myGrid) ChEBI User Group Meeting:June 24, 2010 23 curl  http://cbrass.biordf.net/logpdc/logpc <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"     xmlns:j.0="http://www.mygrid.org.uk/mygrid-moby-service#" >    <rdf:Descriptionrdf:about="">     <j.0:hasServiceDescriptionText>no description</j.0:hasServiceDescriptionText>     <j.0:hasServiceNameText rdf:datatype="http://www.w3.org/2001/XMLSchema#string">logpc</j.0:hasServiceNameText>     <j.0:hasOperation rdf:resource="#operation"/>     <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#serviceDescription"/>   </rdf:Description>   <rdf:Descriptionrdf:about="#input">     <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smilesmolecule"/>     <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/>   </rdf:Description>   <rdf:Descriptionrdf:about="#operation">     <j.0:outputParameter rdf:resource="#output"/>     <j.0:inputParameter rdf:resource="#input"/>     <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#operation"/>   </rdf:Description>   <rdf:Descriptionrdf:about="#output">     <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#alogpsmilesmolecule"/>     <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/>   </rdf:Description> </rdf:RDF>
Working with the service (POST) Responds to a POST with service output (process an input file) ChEBI User Group Meeting:June 24, 2010 24 <rdf:RDFxmlns="http://semanticscience.org/sadi/ontology/caffeine.rdf#" xmlns:so="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:sio="http://semanticscience.org/resource/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#">     <so:smilesmoleculerdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#m">         <sio:SIO_000008 rdf:resource = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"/>     </so:smilesmolecule>     <sio:CHEMINF_000018 rdf:about = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles">           <sio:SIO_000300 rdf:datatype="xsd:string">Cn1cnc2n(C)c(=O)n(C)c(=O)c12</sio:SIO_000300>     </sio:CHEMINF_000018> </rdf:RDF>  curl --data @caffeine.rdf http://cbrass.biordf.net/logpdc/logpc  <rdf:Descriptionrdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#mdalogp">     <rdf:typerdf:resource="http://semanticscience.org/resource/CHEMINF_000251"/>     <j.0:SIO_000300 rdf:datatype="http://www.w3.org/2001/XMLSchema#double">-0.4311000000000006</j.0:SIO_000300>   </rdf:Description>
Publish and Register the service ChEBI User Group Meeting:June 24, 2010 25 http://sadiframework.org/registry
Now what? ChEBI User Group Meeting:June 24, 2010 26
ChEBI User Group Meeting:June 24, 2010 27 Semantic Health and Research Environment SHARE is an application that execute (SPARQL) queries as workflows over SADI Services
“Reckoning”dynamic discovery of instances of OWL classes through synthesis and invocation of a Web Service workflow capable of generating data described by the OWL class restrictions, followed by reasoning to classify the data into that ontology 28 ChEBI User Group Meeting:June 24, 2010
29 SPARQL is the newcool kid on the query block SQLSPARQL ChEBI User Group Meeting:June 24, 2010
SHARE SPARQL engine triple patterns are matched against service descriptions knowledge base is dynamically populated queries can contain OWL classes, which are expanded to the required triple patterns query is optimized to minimize the number of service calls and the amount of data sent over the network ChEBI User Group Meeting:June 24, 2010 30
ChEBI has data! ChEBI User Group Meeting:June 24, 2010 31
Bio2RDF provides ChEBI in RDF ChEBI User Group Meeting:June 24, 2010 32
Bio2RDF now serving over 40 billion triples of linked biological data 33 ChEBI User Group Meeting:June 24, 2010
34 ChEBI User Group Meeting:June 24, 2010
An increasing amount of machine understandable chemical data ChEBI User Group Meeting:June 24, 2010 35
Query for log p ChEBI User Group Meeting:June 24, 2010 36
Query: Is caffeine a drug-like molecule? ChEBI User Group Meeting:June 24, 2010 37
SADI ,[object Object]
 Subject of input and output must be the same
 Web services indexed by predicates
Biocatalogue will list SADI-compliant services

More Related Content

Viewers also liked

Wicked Wiki
Wicked WikiWicked Wiki
Wicked Wikijactlc
 
Generell presentasjon
Generell presentasjonGenerell presentasjon
Generell presentasjonGlenn Melby
 
Design thinking in efl context
Design thinking in efl contextDesign thinking in efl context
Design thinking in efl contextDebopriyo Roy
 
English Translation Of Go Forward Plan, Harvard Bus Review
English Translation Of Go Forward Plan, Harvard Bus ReviewEnglish Translation Of Go Forward Plan, Harvard Bus Review
English Translation Of Go Forward Plan, Harvard Bus ReviewThomas Nastas
 
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy ResourcesLine Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy ResourcesIain Sanders
 
Tema 5 1º bach tangencias y enlaces v4
Tema 5 1º bach tangencias y enlaces v4Tema 5 1º bach tangencias y enlaces v4
Tema 5 1º bach tangencias y enlaces v4qvrrafa
 
Johnstown Pa Defense Hub
Johnstown   Pa Defense HubJohnstown   Pa Defense Hub
Johnstown Pa Defense HubBob Shark
 
GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.GeekMeet
 
Steria Recruitment Presentation
Steria Recruitment PresentationSteria Recruitment Presentation
Steria Recruitment PresentationTillyNorton
 
Design Thinking in EFL Context
Design Thinking in EFL ContextDesign Thinking in EFL Context
Design Thinking in EFL ContextDebopriyo Roy
 
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...Craig Jordan
 
Spiceworksintro 1219952728712087 9
Spiceworksintro 1219952728712087 9Spiceworksintro 1219952728712087 9
Spiceworksintro 1219952728712087 9fredjr
 

Viewers also liked (20)

Wicked Wiki
Wicked WikiWicked Wiki
Wicked Wiki
 
Generell presentasjon
Generell presentasjonGenerell presentasjon
Generell presentasjon
 
Howgirlsunderstand
HowgirlsunderstandHowgirlsunderstand
Howgirlsunderstand
 
Design thinking in efl context
Design thinking in efl contextDesign thinking in efl context
Design thinking in efl context
 
English Translation Of Go Forward Plan, Harvard Bus Review
English Translation Of Go Forward Plan, Harvard Bus ReviewEnglish Translation Of Go Forward Plan, Harvard Bus Review
English Translation Of Go Forward Plan, Harvard Bus Review
 
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy ResourcesLine Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
 
Tema 5 1º bach tangencias y enlaces v4
Tema 5 1º bach tangencias y enlaces v4Tema 5 1º bach tangencias y enlaces v4
Tema 5 1º bach tangencias y enlaces v4
 
Johnstown Pa Defense Hub
Johnstown   Pa Defense HubJohnstown   Pa Defense Hub
Johnstown Pa Defense Hub
 
GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.GeekMeet Intro - Filip C.T.E.
GeekMeet Intro - Filip C.T.E.
 
HR head dilemma ideate Assignment
HR head dilemma ideate AssignmentHR head dilemma ideate Assignment
HR head dilemma ideate Assignment
 
Steria Recruitment Presentation
Steria Recruitment PresentationSteria Recruitment Presentation
Steria Recruitment Presentation
 
Detskaya Rabota2
Detskaya Rabota2Detskaya Rabota2
Detskaya Rabota2
 
Design Thinking in EFL Context
Design Thinking in EFL ContextDesign Thinking in EFL Context
Design Thinking in EFL Context
 
Vincentvan Gogh
Vincentvan GoghVincentvan Gogh
Vincentvan Gogh
 
Mining 012 riskope is it true that pi_gs
Mining 012 riskope  is it true that pi_gsMining 012 riskope  is it true that pi_gs
Mining 012 riskope is it true that pi_gs
 
Collaborate
CollaborateCollaborate
Collaborate
 
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
 
Spiceworksintro 1219952728712087 9
Spiceworksintro 1219952728712087 9Spiceworksintro 1219952728712087 9
Spiceworksintro 1219952728712087 9
 
Isis Credentials May 2011
Isis Credentials May 2011Isis Credentials May 2011
Isis Credentials May 2011
 
Pdf Drawings New
Pdf Drawings NewPdf Drawings New
Pdf Drawings New
 

Similar to Building Chemical Semantic Web Services with SADI, ChEBI, and CHEMINF

2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...Michel Dumontier
 
2011 ebi industry workshop
2011 ebi industry workshop2011 ebi industry workshop
2011 ebi industry workshopMichel Dumontier
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioCatalogue
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebMathieu d'Aquin
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxChris Mungall
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectStuart Chalk
 
BioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioCatalogue
 
Getting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueGetting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueBioCatalogue
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs3 Round Stones
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataTomasz Adamusiak
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSValery Tkachenko
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfacesSteve Speicher
 
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOLinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOChris Mungall
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Adrian Stevenson
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10Scott Edmunds
 

Similar to Building Chemical Semantic Web Services with SADI, ChEBI, and CHEMINF (20)

2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...
 
2011 ebi industry workshop
2011 ebi industry workshop2011 ebi industry workshop
2011 ebi industry workshop
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
How do I aggregate oers
How do I aggregate oersHow do I aggregate oers
How do I aggregate oers
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
OWL and OBO
OWL and OBOOWL and OBO
OWL and OBO
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptx
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP Project
 
Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018
 
BioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole Goble
 
Getting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueGetting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service Catalogue
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked Data
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfaces
 
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODOLinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
LinkML Intro July 2022.pptx PLEASE VIEW THIS ON ZENODO
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...Report on the International Linked Open Data for Libraries, Archives and Muse...
Report on the International Linked Open Data for Libraries, Archives and Muse...
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 

More from Michel Dumontier

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsMichel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsMichel Dumontier
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemMichel Dumontier
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...Michel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemMichel Dumontier
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Michel Dumontier
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Michel Dumontier
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Michel Dumontier
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureMichel Dumontier
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesMichel Dumontier
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsMichel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationMichel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessMichel Dumontier
 

More from Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 

Recently uploaded

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Building Chemical Semantic Web Services with SADI, ChEBI, and CHEMINF

  • 1. We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, and CHEMINF 1 ChEBI User Group Meeting:June 24, 2010 Michel Dumontier, Ph.D. Associate Professor of Bioinformatics Carleton University Department of Biology School of Computer Science Institute of Biochemistry Ottawa Institute of Systems Biology Ottawa-Carleton Institute of Biomedical Engineering
  • 2.
  • 3. Syntactic Web… It takes a lot of digging to get answers 3 ChEBI User Group Meeting:June 24, 2010
  • 4. We need to get to the deep web Surface web:167 terabytes Deep web:91,000 terabytes 545-to-one ChEBI User Group Meeting:June 24, 2010 4
  • 5. ChEBI User Group Meeting:June 24, 2010 and tap into the global web of structured knowledge 5
  • 6. The Semantic Web is the new global web of knowledge 6 ChEBI User Group Meeting:June 24, 2010 It is about standards for publishing, sharing and querying knowledge drawn from diverse sources It makes possible the answering sophisticated questions using background knowledge
  • 7. Goals Provision chemical data on the Web Find cheminformatic services that will consume the data Answer questions about chemicals by reasoning over essential chemical knowledge ChEBI User Group Meeting:June 24, 2010 7
  • 8. Is caffeine a drug-like molecule? ChEBI User Group Meeting:June 24, 2010 8
  • 9. Lipinski Rule of Five Rule of thumb for druglikeness (orally active in humans) (4 rules with multiples of 5) Less than 500 Dalton Less than 5 hydrogen bond donors Less than 10 hydrogen bond acceptors A partition coefficient value between -5 and 5 We need a more formal (machine understandable) description ChEBI User Group Meeting:June 24, 2010 9
  • 10. Formal Ontology as a Strategy 10 ChEBI User Group Meeting:June 24, 2010
  • 11. The Web Ontology Language (OWL) Has Explicit Semantics Can therefore be used to capture knowledge in a machine understandable way 11 ChEBI User Group Meeting:June 24, 2010
  • 12. Lipinski Rule of Five Empirically derived ruleset for druglikeness (4 rules with multiples of 5) Less than 500 Dalton Less than 5 hydrogen bond donors Less than 10 hydrogen bond acceptors A partition coefficient value between -5 and 5 A formal description using OWL: ChEBI User Group Meeting:June 24, 2010 12
  • 13. To calculate these attributes, we need access to a computable representation of the molecular structure ChEBI User Group Meeting:June 24, 2010 13 ball & stick model for caffeine
  • 14. The chemical graph specifies the type and connectivity of atoms in molecules. It describes a part of chemical structureSMILES strings are common representations of the chemical graph ChEBI User Group Meeting:June 24, 2010 14 Cn1cnc2n(C)c(=O)n(C)c(=O)c12 ball & stick model for caffeine SMILES string for caffeine
  • 15. Chemical descriptors Chemical descriptors are data (quantities or values) that provide information about substances, molecular entities, and their parts (rings, atoms, bonds, etc). Sometimes they enumerate material parts, they quantify or describe qualities, functions or dispositions Often used to build Quantitative Structure Activity Relationships (QSAR) models Example descriptors : Mass values Partition coefficients Heats of formation Aromaticity values Molecular formulas ChEBI User Group Meeting:June 24, 2010 15
  • 16. The Chemical Information Ontology (CHEMINF) 100 chemical descriptors 50 chemical qualities Relates descriptors to their specifications, the software that generated them (along with the running parameters, and the algorithms that they implement) Contributors: Nico Adams, Leonid Chepelev, Michel Dumontier, Janna Hastings, EgonWillighagen, Peter Murray-Rust, CristophSteinbeck ChEBI User Group Meeting:June 24, 2010 16 http://semanticchemistry.googlecode.com
  • 17. CHEMINF provides the vocabulary to define an input (SMILES-annotated molecule) and an output (molecule annotated with a descriptor) ChEBI User Group Meeting:June 24, 2010 17
  • 18. Ultimately, the goal is to use an OWL reasoner to reason about the attributes to determine whether the compound is drug-like ChEBI User Group Meeting:June 24, 2010 18
  • 19. Semantic Automated Discovery and Integration http://sadiframework.org SADI is a framework to create Semantic Web services using OWL classes as service inputs and outputs Mark Wilkinson, UBC Michel Dumontier, Carleton University Christopher Baker, UNB 19 ChEBI User Group Meeting:June 24, 2010
  • 20. SADI OWL classes in SADI are local to individual services They should uniquely specify the service input and outputs (they exactly have the right restrictions) one service’s world-view can conflict with another,but a client can use any or all maximize interoperability by reusing types and relations ChEBI User Group Meeting:June 24, 2010 20
  • 21. Create code stubs using the ontology Publish the ontology to a web-accessible location http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl Make sure that the class names are resolvable (easy when using the hash notation) http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smiles-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#logp-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hbdc-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hdba-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#lipinksi-druglike-molecule Download/checkout the code http://sadiframework.org Run the code generator specify the URIs that correspond to input and output types ChEBI User Group Meeting:June 24, 2010 21
  • 22. Implement the functionality Java version Uses Jena to manipulate the RDF graph Uses Maven to build from command-line or Eclipse; Invokes Jetty for service testing Chemistry We used the Chemistry Development Kit (CDK) to implement 4 services ChEBI User Group Meeting:June 24, 2010 22
  • 23. Working with the service (GET) Responds to a GET by providing the service description in RDF conforms to Feta (BioMoby, myGrid) ChEBI User Group Meeting:June 24, 2010 23 curl http://cbrass.biordf.net/logpdc/logpc <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:j.0="http://www.mygrid.org.uk/mygrid-moby-service#" > <rdf:Descriptionrdf:about=""> <j.0:hasServiceDescriptionText>no description</j.0:hasServiceDescriptionText> <j.0:hasServiceNameText rdf:datatype="http://www.w3.org/2001/XMLSchema#string">logpc</j.0:hasServiceNameText> <j.0:hasOperation rdf:resource="#operation"/> <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#serviceDescription"/> </rdf:Description> <rdf:Descriptionrdf:about="#input"> <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smilesmolecule"/> <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/> </rdf:Description> <rdf:Descriptionrdf:about="#operation"> <j.0:outputParameter rdf:resource="#output"/> <j.0:inputParameter rdf:resource="#input"/> <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#operation"/> </rdf:Description> <rdf:Descriptionrdf:about="#output"> <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#alogpsmilesmolecule"/> <rdf:typerdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/> </rdf:Description> </rdf:RDF>
  • 24. Working with the service (POST) Responds to a POST with service output (process an input file) ChEBI User Group Meeting:June 24, 2010 24 <rdf:RDFxmlns="http://semanticscience.org/sadi/ontology/caffeine.rdf#" xmlns:so="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:sio="http://semanticscience.org/resource/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#"> <so:smilesmoleculerdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#m"> <sio:SIO_000008 rdf:resource = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"/> </so:smilesmolecule> <sio:CHEMINF_000018 rdf:about = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"> <sio:SIO_000300 rdf:datatype="xsd:string">Cn1cnc2n(C)c(=O)n(C)c(=O)c12</sio:SIO_000300> </sio:CHEMINF_000018> </rdf:RDF> curl --data @caffeine.rdf http://cbrass.biordf.net/logpdc/logpc <rdf:Descriptionrdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#mdalogp"> <rdf:typerdf:resource="http://semanticscience.org/resource/CHEMINF_000251"/> <j.0:SIO_000300 rdf:datatype="http://www.w3.org/2001/XMLSchema#double">-0.4311000000000006</j.0:SIO_000300> </rdf:Description>
  • 25. Publish and Register the service ChEBI User Group Meeting:June 24, 2010 25 http://sadiframework.org/registry
  • 26. Now what? ChEBI User Group Meeting:June 24, 2010 26
  • 27. ChEBI User Group Meeting:June 24, 2010 27 Semantic Health and Research Environment SHARE is an application that execute (SPARQL) queries as workflows over SADI Services
  • 28. “Reckoning”dynamic discovery of instances of OWL classes through synthesis and invocation of a Web Service workflow capable of generating data described by the OWL class restrictions, followed by reasoning to classify the data into that ontology 28 ChEBI User Group Meeting:June 24, 2010
  • 29. 29 SPARQL is the newcool kid on the query block SQLSPARQL ChEBI User Group Meeting:June 24, 2010
  • 30. SHARE SPARQL engine triple patterns are matched against service descriptions knowledge base is dynamically populated queries can contain OWL classes, which are expanded to the required triple patterns query is optimized to minimize the number of service calls and the amount of data sent over the network ChEBI User Group Meeting:June 24, 2010 30
  • 31. ChEBI has data! ChEBI User Group Meeting:June 24, 2010 31
  • 32. Bio2RDF provides ChEBI in RDF ChEBI User Group Meeting:June 24, 2010 32
  • 33. Bio2RDF now serving over 40 billion triples of linked biological data 33 ChEBI User Group Meeting:June 24, 2010
  • 34. 34 ChEBI User Group Meeting:June 24, 2010
  • 35. An increasing amount of machine understandable chemical data ChEBI User Group Meeting:June 24, 2010 35
  • 36. Query for log p ChEBI User Group Meeting:June 24, 2010 36
  • 37. Query: Is caffeine a drug-like molecule? ChEBI User Group Meeting:June 24, 2010 37
  • 38.
  • 39. Subject of input and output must be the same
  • 40. Web services indexed by predicates
  • 41. Biocatalogue will list SADI-compliant services
  • 42. Tavernaplugin to work with SADI services
  • 43. Protégé 4.1 plugin to create SADI services
  • 44. Simplified migration path for existing web services (java, perl)38 ChEBI User Group Meeting:June 24, 2010
  • 45. Benefits Data remains distributed – no warehouse! Data is not “exposed” as a SPARQL endpoint greater provider-control over computational resources Yet data appears to be a SPARQL endpoint… no modification of SPARQL or reasoner required. ChEBI User Group Meeting:June 24, 2010 39
  • 46. Join Us! SADI and CardioSHARE are Open Source Come join us – we’re having a lot of fun!! http://sadiframework.org ChEBI User Group Meeting:June 24, 2010 40
  • 47. ChEBI User Group Meeting:June 24, 2010 41 Acknowledgements Leonid Chepelev (implementing the services) Luke McCarthy (technical support) Mark Wilkinson (vision and leadership) CHEMINF Group Janna Hastings Nico Adams EgonWillighagen This research is supported by The Heart + Stroke Foundation of BC and Yukon, Microsoft Research, The Canadian Institutes of Health Research, The Natural Sciences and Engineering Research Council of Canada and CANARIE.

Editor's Notes

  1. Can’t answer questions that require background knowledge
  2. Research – that’s what brought you hereSkils – marketable in whatever you choose to do thereafterKnowledeable – where the field has been and where it is goingImprove oral and written scientific communication skillsResearch – tell people what you’ve been doingTrack progress – develop a sense of progress