SlideShare une entreprise Scribd logo
1  sur  2
Télécharger pour lire hors ligne
Linked Data, an opportunity to mitigate complexity
              pharmaceutical research and development
                        Bo Andersson                                                        Kerstin Forsberg
                        AstraZeneca R&D                                                      AstraZeneca R&D
                          221 87 LUND                                                        431 83 MÖLNDAL
                         +46 46 337621                                                        +46 31 7065318
             bo.h.andersson@astrazeneca.com                                        kerstin.l.forsberg@astrazeneca.com

ABSTRACT                                                                                 Table 1. SemanticWeb standards
In the paper we present opportunities and challenges in applying         Resource Description Framework (RDF) to model
linked data principles to improve the research utility of data to        statements as so called RDF-triples of subject/predicate/object,
mitigate complexity in pharmaceutical research and development.          e.g. clinical study/active ingredient/omeprazole, often encoded
Association and interpretation of the data needed for research and       using the RDF/XML format.
development in a pharmaceutical company has become a too                 Universal Resource Identifiers (URI:s) to globally identify
complex task for individual scientists, or even teams to handle.         entities, such as proteins, organ systems, clinical studies, e.g.
Scientists need data to be organized for associations, prepared for      http//data.pharma.com/id/clinicalstudy/D8180C00011, and also
not yet defined use and ready for automation where computers can         data content such as clinical observation results.
work alongside the scientists.                                           Web Ontology Language (OWL) to express vocabularies of
                                                                         terms representing identified entities and relationships in
Keywords                                                                 portions of reality, e.g. the terms ‘active ingredient’ and
linked data, semantic web, pharmaceutical research                       ‘pharmaceutical product’ for two of the entities identified in
                                                                         the Translational Medicine Ontology (TMO).
1. INTRODUCTION                                                          Simple Knowledge Organization System (SKOS) to express
During the WWW2007 conference a breakthrough of the Linked               taxonomies and controlled vocabularies, e.g. the terms
Data idea happened in a session where web experts demonstrated           ‘tauopathies’ and ‘dementia’ as broader terms than ‘alzheimer
the power of a new generation of the web, a web of data. For us          disease’ in the Medical Subject Headings (MeSH).
attending the session it was hard to imagine the full potential on
what this idea would mean for individual scientists and for a            Furthermore, access to data in a standard model is not enough, but
pharmaceutical company.                                                  relationships among data must also be made available as RDF
In the Large Knowledge Collider (LarKC) project scientists               triples to create a web of data. This collection of associated
working in early clinical development have identified [1] how            datasets can also be referred to as Linked Data.
successful research and development rely on scientists having            A typical case of a large linked dataset is DBpedia, which makes
access to data and ability to utilize it, to handle uncertainty and to   the structured content of Wikipedia available as RDF triples. The
prepare for the unexpected. To meet these challenges we have             beauty of this is not only that Wikipedia data becomes available
identified three key enablers: linked data principles, semantic web      as linked data, but also that it connects to other datasets publicly
standards and open ontologies.                                           available such as Drug Bank. For example, the entity identified
One activity in the W3C interest group for semantic web in Health        with the URI - http://dbpedia.org/resource/Omeprazole, is linked
Care and Life Science (HCLS) has been to link publicly available         to the descriptions of the same entity in Drug Bank, also made
datasets [2] such as Drug Bank and ClinicalTrial.gov. These              available as linked data.
datasets are now part of the growing Linked Open Data cloud. An          This means that facts available in the two different datasets can be
important learning from the collaborative work across health care        combined. For example the name Losec, and the 70+ other brand
organizations, pharmaceutical companies and academia groups is           names for Omeprazole, from Drug Bank can be integrated by
that we all need the same datasets, however for different decisions      different applications and can also be directly queried, e.g. when
and different types of applications.                                     searching for brand names for drugs categorized in Wikipedia as
                                                                         essential medicines by WHO.
2. LINKED DATA FOR PHARMA
A federated knowledge repository of data, built with Semantic            3. OPPORTUNTIES FOR PHARMA
Web standards, is a new generation of knowledge base beyond              Scientists routinely utilize publicly available documents as a
separated document repositories. Gaining value from the                  complement to data in internal and licensed sources. Most of the
federation of knowledge requires data to be available using a            scientists have preferred sources they utilize to gain knowledge
standard model, of so called RDF triples.                                from and to associate with the data relevant for their decisions and
                                                                         questions. The knowledge mining is often done for individual
                                                                         tasks, and in many cases done manually by scientists together
 Copyright is held by the author/owner(s).                               with informatics and library experts.
 LWDM 2011, March 25, 2011, Uppsala, Sweden.                             Imagine an environment where internal data easily connects with
 Copyright 2011 ACM 978-1-4503-0608-9/11/03 ...$10.00.
                                                                         external datasets from different sources. The associations made,
do make use of relationships developed by experts, in terms of so      should be balanced with utilization of more mundane and relaxed
called ontologies, and thereby help scientists find answerers not      relationships as for example from DBpedia, a source humans are
obvious today. While broaden the scientific spectrum where             comfortable to use.
individual scientists find their answers, this would also stimulate    The research utility of shared datasets will also require explicit
more cross scientific sharing and establish a foundation for           provenanceiii information to make it possible for scientists and
improved computational support in research and development.            computers to make trust judgments about the datasets.
We therefore propose the prospective use of linked data principles     Linked datasets also needs to be described with a vocabulary that
as a component of the information infrastructure for                   allow for automation of discovery, selection and association. The
pharmaceutical research and development, to make sure internal         Vocabulary of Interlinked Datasets (void)iv standard allows
data connects to external data. This is a step towards an              formally describing shared datasets and thereby improving the
information infrastructure where data is prepared for association,     research utility.
for not yet defined use, and for facilitating computers to function
alongside scientists to mitigate the complexity.                       A pragmatic approach for linked data management is needed that
                                                                       motivates people to constructively face the tension between
4. CHALLENGES FOR PHARMA                                               different needs. This must be an iterative process where we
From internal semantic web projects, by participation in the           continuously improve the research utility of shared datasets by
LarKCi and in W3C HCLSii since 2006 we have built experience           applying three key enablers:
in linking data. Although integration of disparate datasets today is           Linked Data principles, including establishing global
easily done by experts, there are still more to do. We need to                  namespaces of URI:s
reach a maturity level where datasets is routinely published in a              Semantic web standards, also for provenance data and
way that automatically associates them with other datasets.                     dataset descriptions
The Linking Open Drug Data (LODD) team in W3C HCLS                             Open ontologies, with the required level of precision
revealed various challenges during the work to connect public                   and consistency
available datasets such as DrugBank and ClinicalTrial.gov. Many        And, also a process where computers are used to work alongside
of them relate to technology but a key aspect is peoples’              scientists to automatically:
willingness to constructively face the tension of opposing needs to
                                                                                 Identify entities and assign URI:s
improve the research utility of shared datasets.
                                                                                 Structure data and capture provenance data
Relationships are a significant challenge for shared datasets in                 Mine for causal relationships and inconsistency
pharmaceutical research and development. We often have a strong
prevalence of terminology conflicts, synonyms, and homonyms.
For instance, the word ‘drug’ can refer to the whole                   REFERENCES
pharmaceutical product, or just the active ingredient.                 [1] B. Andersson, V. Momtchev, Requirements summary and
Biomedical ontologies represent what entities exist in portions of         data repository, Large Knowledge Collider (LarKC), 2008.
the biological and clinical reality, and how they relate to each           http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d7a-
other. One such ontology for translational medicine (TMO) [3] is           11_requirements-summary-and-data-repository_m6.pdf
developed by a W3C HCLS team with representatives from                 [2] A. Jentzsch, et.al.. Enabling Tailored Therapeutics with
pharmaceutical companies, health care providers, National Centre           Linked Data. In Proceedings of the WWW2009 workshop on
for Biomedical Ontology (NCBO) and academia groups. TMO                    Linked Data on the Web (LDOW2009), 2009.
structure the different meanings of ‘drug’, by identifying different       http://esw.w3.org/images/5/57/HCLSIG%24Final_ldow2009
types of entities and making relationships between them explicit:          .pdf
’Molecular entity’ for single molecule kinds. ‘Active ingredient’
                                                                       [3] M. Dumontier, et.al.. The Translational Medicine Ontology:
as a role played by a chemical substance which has the disposition
                                                                           Driving personalized medicine by bridging the gap from
to treat a certain disease and is part of a ‘pharmaceutical
                                                                           bedside to bench. In Proceedings of the 13th Annual Bio-
formulation’ for biologically active chemicals. A ‘Pharmaceutical
                                                                           Ontologies Meeting, Boston, USA (Bio-Ontologies 2010),
product’ is a formulated pharmaceutical that has been approved.
                                                                           2010. http://dumontierlab.com/pdf/2010_BIOONT_TMO.pdf
The difficulty in using precise and consistent defined relationships
in ontologies, such as the TMO, for improved research utility




i
     http://www.larkc.eu/
ii
      http://www.w3.org/2001/sw/hcls/
iii
      http://www.w3.org/2005/Incubator/prov/XGR-prov/
iv
      http://www.w3.org/TR/2011/NOTE-void-20110303/

Contenu connexe

Tendances

DataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationDataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationMichael Bar-Sinai
 
Joint data citation principles slide set v2
Joint data citation principles slide set v2Joint data citation principles slide set v2
Joint data citation principles slide set v2force11
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...datascienceiqss
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?LIBER Europe
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET
 
A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesLIBER Europe
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
Summary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewSummary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewMicah Altman
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesRichard.Sapon-White
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaidatascienceiqss
 
Blue Canopy Semantic Web Approach v25 brief
Blue Canopy Semantic Web Approach v25 briefBlue Canopy Semantic Web Approach v25 brief
Blue Canopy Semantic Web Approach v25 briefNick Savage
 
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017dkNET
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumMerce Crosas
 
Data Repositories Impact
Data Repositories ImpactData Repositories Impact
Data Repositories ImpactMerce Crosas
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017Edward Blurock
 
Publishing in SCI journals - Chinese translated
Publishing in SCI journals - Chinese translatedPublishing in SCI journals - Chinese translated
Publishing in SCI journals - Chinese translatedRoger Watson
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 

Tendances (20)

DataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationDataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse Integration
 
Joint data citation principles slide set v2
Joint data citation principles slide set v2Joint data citation principles slide set v2
Joint data citation principles slide set v2
 
Brennan "The National Library of Medicine – Anticipating our Third Century"
Brennan "The National Library of Medicine – Anticipating our Third Century"Brennan "The National Library of Medicine – Anticipating our Third Century"
Brennan "The National Library of Medicine – Anticipating our Third Century"
 
Or 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-researchOr 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-research
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017
 
A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data Repositories
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
Summary of data citation synthesis activity & Review
Summary of data citation synthesis activity & ReviewSummary of data citation synthesis activity & Review
Summary of data citation synthesis activity & Review
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemes
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
 
Blue Canopy Semantic Web Approach v25 brief
Blue Canopy Semantic Web Approach v25 briefBlue Canopy Semantic Web Approach v25 brief
Blue Canopy Semantic Web Approach v25 brief
 
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 
Data Repositories Impact
Data Repositories ImpactData Repositories Impact
Data Repositories Impact
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017
 
Publishing in SCI journals - Chinese translated
Publishing in SCI journals - Chinese translatedPublishing in SCI journals - Chinese translated
Publishing in SCI journals - Chinese translated
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 

Similaire à Linked data in pharma R&D

Neuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptxNeuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptxJagannath University
 
Neuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptxNeuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptxJagannath University
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Kerstin Forsberg
 
Breaking Down Information Silos
Breaking Down Information SilosBreaking Down Information Silos
Breaking Down Information SilosChris Waller
 
Ontology Based Information Extraction for Disease Intelligence
Ontology Based Information Extraction for Disease Intelligence Ontology Based Information Extraction for Disease Intelligence
Ontology Based Information Extraction for Disease Intelligence IJORCS
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordJisc
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapemhaendel
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zenecaKerstin Forsberg
 
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...Eric Stephan
 
Pitch for Open Drug Discovery teams
Pitch for Open Drug Discovery teamsPitch for Open Drug Discovery teams
Pitch for Open Drug Discovery teamsSean Ekins
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentThe Digital Group
 
An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0Natasha Grant
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Dag Endresen
 
Linked Data: Why Bother?
Linked Data:  Why Bother?Linked Data:  Why Bother?
Linked Data: Why Bother?Jennifer Bowen
 
Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG TRG
 

Similaire à Linked data in pharma R&D (20)

Neuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptxNeuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics_Databses_Ontologies_Federated Database.pptx
 
Neuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptxNeuroinformatics Databases Ontologies Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptx
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium
 
Breaking Down Information Silos
Breaking Down Information SilosBreaking Down Information Silos
Breaking Down Information Silos
 
Ontology Based Information Extraction for Disease Intelligence
Ontology Based Information Extraction for Disease Intelligence Ontology Based Information Extraction for Disease Intelligence
Ontology Based Information Extraction for Disease Intelligence
 
Why open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and modelsWhy open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and models
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
 
Pitch for Open Drug Discovery teams
Pitch for Open Drug Discovery teamsPitch for Open Drug Discovery teams
Pitch for Open Drug Discovery teams
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic Enrichment
 
An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0
 
Current opinions in drug discovery public compound databases
Current opinions in drug discovery public compound databasesCurrent opinions in drug discovery public compound databases
Current opinions in drug discovery public compound databases
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019
 
Linked Data: Why Bother?
Linked Data:  Why Bother?Linked Data:  Why Bother?
Linked Data: Why Bother?
 
Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG
 
Public Compound Databases
Public Compound DatabasesPublic Compound Databases
Public Compound Databases
 

Plus de Kerstin Forsberg

Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareKerstin Forsberg
 
Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Kerstin Forsberg
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...Kerstin Forsberg
 
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings Kerstin Forsberg
 
Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Kerstin Forsberg
 
CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examplesKerstin Forsberg
 
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013Kerstin Forsberg
 
Linked open data it univ 22 nov 2012
Linked open data it univ 22 nov 2012Linked open data it univ 22 nov 2012
Linked open data it univ 22 nov 2012Kerstin Forsberg
 
Linked open data example uk spending
Linked open data example uk spendingLinked open data example uk spending
Linked open data example uk spendingKerstin Forsberg
 
Semantic models for cdisc based standards and metadata management (1)
Semantic models for cdisc based standards and metadata management (1)Semantic models for cdisc based standards and metadata management (1)
Semantic models for cdisc based standards and metadata management (1)Kerstin Forsberg
 
Semantic models for cdisc based standards and metadata management
Semantic models for cdisc based standards and metadata managementSemantic models for cdisc based standards and metadata management
Semantic models for cdisc based standards and metadata managementKerstin Forsberg
 
Linked data in pharma it univ 2 april 2012
Linked data in pharma it univ 2 april 2012Linked data in pharma it univ 2 april 2012
Linked data in pharma it univ 2 april 2012Kerstin Forsberg
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempelKerstin Forsberg
 
Designing and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryDesigning and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryKerstin Forsberg
 
Linking clinical data standards
Linking clinical data standardsLinking clinical data standards
Linking clinical data standardsKerstin Forsberg
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesKerstin Forsberg
 

Plus de Kerstin Forsberg (20)

Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcare
 
Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
 
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
 
Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...
 
CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examples
 
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
CDISC2RDF poster for Conference on Data Integration in the Life Sciences 2013
 
Cdisc2 rdf overveiw
Cdisc2 rdf overveiwCdisc2 rdf overveiw
Cdisc2 rdf overveiw
 
Linked open data it univ 22 nov 2012
Linked open data it univ 22 nov 2012Linked open data it univ 22 nov 2012
Linked open data it univ 22 nov 2012
 
Linked open data example uk spending
Linked open data example uk spendingLinked open data example uk spending
Linked open data example uk spending
 
Semantic models for cdisc based standards and metadata management (1)
Semantic models for cdisc based standards and metadata management (1)Semantic models for cdisc based standards and metadata management (1)
Semantic models for cdisc based standards and metadata management (1)
 
Semantic models for cdisc based standards and metadata management
Semantic models for cdisc based standards and metadata managementSemantic models for cdisc based standards and metadata management
Semantic models for cdisc based standards and metadata management
 
Linked data in pharma it univ 2 april 2012
Linked data in pharma it univ 2 april 2012Linked data in pharma it univ 2 april 2012
Linked data in pharma it univ 2 april 2012
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempel
 
Designing and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryDesigning and launching the Clinical Reference Library
Designing and launching the Clinical Reference Library
 
Linking clinical data standards
Linking clinical data standardsLinking clinical data standards
Linking clinical data standards
 
Linked data in pharma
Linked data in pharmaLinked data in pharma
Linked data in pharma
 
Mobile Newsmaking
Mobile NewsmakingMobile Newsmaking
Mobile Newsmaking
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiences
 
Extensible use of RDF
Extensible use of RDFExtensible use of RDF
Extensible use of RDF
 

Dernier

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Dernier (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Linked data in pharma R&D

  • 1. Linked Data, an opportunity to mitigate complexity pharmaceutical research and development Bo Andersson Kerstin Forsberg AstraZeneca R&D AstraZeneca R&D 221 87 LUND 431 83 MÖLNDAL +46 46 337621 +46 31 7065318 bo.h.andersson@astrazeneca.com kerstin.l.forsberg@astrazeneca.com ABSTRACT Table 1. SemanticWeb standards In the paper we present opportunities and challenges in applying Resource Description Framework (RDF) to model linked data principles to improve the research utility of data to statements as so called RDF-triples of subject/predicate/object, mitigate complexity in pharmaceutical research and development. e.g. clinical study/active ingredient/omeprazole, often encoded Association and interpretation of the data needed for research and using the RDF/XML format. development in a pharmaceutical company has become a too Universal Resource Identifiers (URI:s) to globally identify complex task for individual scientists, or even teams to handle. entities, such as proteins, organ systems, clinical studies, e.g. Scientists need data to be organized for associations, prepared for http//data.pharma.com/id/clinicalstudy/D8180C00011, and also not yet defined use and ready for automation where computers can data content such as clinical observation results. work alongside the scientists. Web Ontology Language (OWL) to express vocabularies of terms representing identified entities and relationships in Keywords portions of reality, e.g. the terms ‘active ingredient’ and linked data, semantic web, pharmaceutical research ‘pharmaceutical product’ for two of the entities identified in the Translational Medicine Ontology (TMO). 1. INTRODUCTION Simple Knowledge Organization System (SKOS) to express During the WWW2007 conference a breakthrough of the Linked taxonomies and controlled vocabularies, e.g. the terms Data idea happened in a session where web experts demonstrated ‘tauopathies’ and ‘dementia’ as broader terms than ‘alzheimer the power of a new generation of the web, a web of data. For us disease’ in the Medical Subject Headings (MeSH). attending the session it was hard to imagine the full potential on what this idea would mean for individual scientists and for a Furthermore, access to data in a standard model is not enough, but pharmaceutical company. relationships among data must also be made available as RDF In the Large Knowledge Collider (LarKC) project scientists triples to create a web of data. This collection of associated working in early clinical development have identified [1] how datasets can also be referred to as Linked Data. successful research and development rely on scientists having A typical case of a large linked dataset is DBpedia, which makes access to data and ability to utilize it, to handle uncertainty and to the structured content of Wikipedia available as RDF triples. The prepare for the unexpected. To meet these challenges we have beauty of this is not only that Wikipedia data becomes available identified three key enablers: linked data principles, semantic web as linked data, but also that it connects to other datasets publicly standards and open ontologies. available such as Drug Bank. For example, the entity identified One activity in the W3C interest group for semantic web in Health with the URI - http://dbpedia.org/resource/Omeprazole, is linked Care and Life Science (HCLS) has been to link publicly available to the descriptions of the same entity in Drug Bank, also made datasets [2] such as Drug Bank and ClinicalTrial.gov. These available as linked data. datasets are now part of the growing Linked Open Data cloud. An This means that facts available in the two different datasets can be important learning from the collaborative work across health care combined. For example the name Losec, and the 70+ other brand organizations, pharmaceutical companies and academia groups is names for Omeprazole, from Drug Bank can be integrated by that we all need the same datasets, however for different decisions different applications and can also be directly queried, e.g. when and different types of applications. searching for brand names for drugs categorized in Wikipedia as essential medicines by WHO. 2. LINKED DATA FOR PHARMA A federated knowledge repository of data, built with Semantic 3. OPPORTUNTIES FOR PHARMA Web standards, is a new generation of knowledge base beyond Scientists routinely utilize publicly available documents as a separated document repositories. Gaining value from the complement to data in internal and licensed sources. Most of the federation of knowledge requires data to be available using a scientists have preferred sources they utilize to gain knowledge standard model, of so called RDF triples. from and to associate with the data relevant for their decisions and questions. The knowledge mining is often done for individual tasks, and in many cases done manually by scientists together Copyright is held by the author/owner(s). with informatics and library experts. LWDM 2011, March 25, 2011, Uppsala, Sweden. Imagine an environment where internal data easily connects with Copyright 2011 ACM 978-1-4503-0608-9/11/03 ...$10.00. external datasets from different sources. The associations made,
  • 2. do make use of relationships developed by experts, in terms of so should be balanced with utilization of more mundane and relaxed called ontologies, and thereby help scientists find answerers not relationships as for example from DBpedia, a source humans are obvious today. While broaden the scientific spectrum where comfortable to use. individual scientists find their answers, this would also stimulate The research utility of shared datasets will also require explicit more cross scientific sharing and establish a foundation for provenanceiii information to make it possible for scientists and improved computational support in research and development. computers to make trust judgments about the datasets. We therefore propose the prospective use of linked data principles Linked datasets also needs to be described with a vocabulary that as a component of the information infrastructure for allow for automation of discovery, selection and association. The pharmaceutical research and development, to make sure internal Vocabulary of Interlinked Datasets (void)iv standard allows data connects to external data. This is a step towards an formally describing shared datasets and thereby improving the information infrastructure where data is prepared for association, research utility. for not yet defined use, and for facilitating computers to function alongside scientists to mitigate the complexity. A pragmatic approach for linked data management is needed that motivates people to constructively face the tension between 4. CHALLENGES FOR PHARMA different needs. This must be an iterative process where we From internal semantic web projects, by participation in the continuously improve the research utility of shared datasets by LarKCi and in W3C HCLSii since 2006 we have built experience applying three key enablers: in linking data. Although integration of disparate datasets today is  Linked Data principles, including establishing global easily done by experts, there are still more to do. We need to namespaces of URI:s reach a maturity level where datasets is routinely published in a  Semantic web standards, also for provenance data and way that automatically associates them with other datasets. dataset descriptions The Linking Open Drug Data (LODD) team in W3C HCLS  Open ontologies, with the required level of precision revealed various challenges during the work to connect public and consistency available datasets such as DrugBank and ClinicalTrial.gov. Many And, also a process where computers are used to work alongside of them relate to technology but a key aspect is peoples’ scientists to automatically: willingness to constructively face the tension of opposing needs to  Identify entities and assign URI:s improve the research utility of shared datasets.  Structure data and capture provenance data Relationships are a significant challenge for shared datasets in  Mine for causal relationships and inconsistency pharmaceutical research and development. We often have a strong prevalence of terminology conflicts, synonyms, and homonyms. For instance, the word ‘drug’ can refer to the whole REFERENCES pharmaceutical product, or just the active ingredient. [1] B. Andersson, V. Momtchev, Requirements summary and Biomedical ontologies represent what entities exist in portions of data repository, Large Knowledge Collider (LarKC), 2008. the biological and clinical reality, and how they relate to each http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d7a- other. One such ontology for translational medicine (TMO) [3] is 11_requirements-summary-and-data-repository_m6.pdf developed by a W3C HCLS team with representatives from [2] A. Jentzsch, et.al.. Enabling Tailored Therapeutics with pharmaceutical companies, health care providers, National Centre Linked Data. In Proceedings of the WWW2009 workshop on for Biomedical Ontology (NCBO) and academia groups. TMO Linked Data on the Web (LDOW2009), 2009. structure the different meanings of ‘drug’, by identifying different http://esw.w3.org/images/5/57/HCLSIG%24Final_ldow2009 types of entities and making relationships between them explicit: .pdf ’Molecular entity’ for single molecule kinds. ‘Active ingredient’ [3] M. Dumontier, et.al.. The Translational Medicine Ontology: as a role played by a chemical substance which has the disposition Driving personalized medicine by bridging the gap from to treat a certain disease and is part of a ‘pharmaceutical bedside to bench. In Proceedings of the 13th Annual Bio- formulation’ for biologically active chemicals. A ‘Pharmaceutical Ontologies Meeting, Boston, USA (Bio-Ontologies 2010), product’ is a formulated pharmaceutical that has been approved. 2010. http://dumontierlab.com/pdf/2010_BIOONT_TMO.pdf The difficulty in using precise and consistent defined relationships in ontologies, such as the TMO, for improved research utility i http://www.larkc.eu/ ii http://www.w3.org/2001/sw/hcls/ iii http://www.w3.org/2005/Incubator/prov/XGR-prov/ iv http://www.w3.org/TR/2011/NOTE-void-20110303/