SlideShare a Scribd company logo
1 of 24
Download to read offline
Broadening the Scope of Nanopublications
Tobias Kuhn,1,2 Paolo Emilio Barbano,3 Mate Levente Nagy,4
Michael Krauthammer4,1
1Department of Pathology, Yale University
2Chair of Sociology, in particular of Modeling and Simulation, ETH Zurich
3Department of Mathematics, Yale University
4Program for Computational Biology and Bioinformatics, Yale University
ESWC 2013, Montpellier (France)
29 May 2013
Motivation
The key problem of the current system of scholarly communication is
that it is centered around narrative articles:
• They are good for individual consumption by human beings but
very bad for aggregation or automatic processing
• They are very ineffective for sharing scientific information,
especially in data-intensive sciences
• There are no rewards for providing, sharing, and maintaining
datasets, software, and other digital artifacts
The current system is slow and inefficient.
Nanopublications have been proposed to solve these problems: they
are minimal portions of scientific contributions in RDF.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 2 / 24
Vision: Changing Scholarly Communication
Now
Narrative articles at the center
Future
Nanopublications at the center
Images from Mons et al. The value of data. Nature genetics, 43(4):281–283, 2011
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 3 / 24
Structure of Nanopublications
Nanopub0001
Assertion:
opm:wasDerivedFrom d:DataSourceX
cito:cites n:nanopub0042
dc:created “2013-01-01”
pav:createdBy p:Isabelle_Dubois
dc:isPartOf c:NanoPubCollection1
Provenance:
ns1:mosquito ns2:malaria
ns3:transmission
Assertion:
• Formalized scientific claim (or hypothesis)
Provenance:
• Reference to article, experimental methods, etc.
• Who published it when and how
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 4 / 24
nanobrowser: Classical Nanopublication
http://nanobrowser.inn.ac
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 5 / 24
Ocean of Nanopublications
authors
users
curators
structured
data sources
unstructured
data sources
ocean of
nanopublications
nanopublication
portals
in silico
experiments
network
analyses
aggregated
views
hypothesis
generation
...bots
1
2
3
4
5
6
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 6 / 24
Proposed Extension 1:
Informal Assertions
Nanopub0012
Assertion:
opm:wasDerivedFrom d:DataSourceX
cito:cites n:nanopub0042
dc:created “2013-01-01”
pav:createdBy p:Isabelle_Dubois
dc:isPartOf c:NanoPubCollection1
Provenance:
Malaria is transmitted by mosquitoes.
Assertion:
• Informal English sentence
• Sentences are independent entities and represented by URIs:
http://purl.org/aida/Malaria+is+transmitted+by+mosquitoes.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 7 / 24
Levels of Formalization
• Informal (only an AIDA sentence)
• Underspecified (formal representation for part of the sentence)
• Fully formal (formal representation for the complete sentence)
Malaria is transmitted by mosquitoes.
Malaria is transmitted by mosquitoes.
ns1:mosquito
ns3:transmission
ns1:mosquito
Malaria is transmitted by mosquitoes.
ns2:malaria
ns3:transmission
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 8 / 24
Proposed Extension 2:
Non-Scientific Assertions
Nanopub0042
Assertion:
dc:created “2013-05-01”
pav:createdBy p:Giuseppe
Provenance:
p:Giuseppe npx:disagrees
a:Malaria+is+transmitted+by+...
Assertion:
• Meta-statement or other non-scientific assertion
• Can include opinions, social relations, introduction of new
entities, ...
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 9 / 24
nanobrowser: Nanopublication with Sentence
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 10 / 24
Approach
Related existing approaches: SWAN, EXPO, GeneRIF, BEL
Our approach differs from these approaches in the following respects:
• Very broad application area (science as a whole and beyond)
• Sentences exist independently from authors (no ownership of
sentences)
• Use of a controlled natural language
• Continuum from informal over underspecified to fully formal
statements
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 11 / 24
AIDA Sentences
To fit into the nanopublication concept, the sentences of our
approach should be AIDA:
• Atomic: a sentence describing one thought that cannot be
further broken down in a practical way
• Independent: a sentence that can stand on its own, without
external references like “this effect” or “we”
• Declarative: a complete sentence ending with a full stop that
could in theory be either true or false
• Absolute: a sentence describing the core of a claim ignoring the
uncertainty about its truth and how it was discovered (no
“probably” or “evaluation showed”); typically in present tense
Example
The majority of patients with idiopathic REM sleep behavior disorder who develop
a neurodegenerative disease develop Parkinson disease and Lewy body dementia.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 12 / 24
Linking Scientific Claims
ns1:mosquito
Malaria is transmitted by mosquitoes.
ns2:malaria
ns3:transmission
Possible relations:
• [CLAIM] is equivalent to / contradicts / is similar to [CLAIM]
• [PERSON] agrees with / disagrees with / challenges [CLAIM]
• [STUDY] provides (counter-)evidence for [CLAIM]
These relations can be published as nanopublications too!
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 13 / 24
nanobrowser: Opinions and Sentence Relations
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 14 / 24
Publishing AIDA-Nanopubs
http://nanobrowser.inn.ac/publish
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 15 / 24
Evaluation
authors
users
curators
structured
data sources
unstructured
data sources
ocean of
nanopublications
nanopublication
portals
in silico
experiments
network
analyses
aggregated
views
hypothesis
generation
...bots
1
2
3
4
5
6
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 16 / 24
Evaluation of AIDA-Nanopublications
So far, the following aspects have been evaluated:
• How well can authors or curators express scientific findings as
AIDA sentences? (channels 1 and 3)
• How well can AIDA sentences be automatically extracted from
existing text sources? (channel 4)
• How well can similar AIDA sentences be automatically clustered?
(channel 6)
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 17 / 24
User Study Design
Questionnaire-style online study:
• 16 participants: Scientists with strong background in biology
and/or medicine
• Five short texts (one or two sentences) from conclusion sections
of structured abstracts of PubMed articles
• Task: Rewrite each short text as one or more AIDA sentences
• Brief explanation of the task and the AIDA concept
Example
Original text: The results of this study showed that the hepatic reticuloendothelial
function is impaired in cirrhotic patients, but the degree of impairment does not
differ between patients with and without previous history of SBP.
AIDA 1: The hepatic reticuloendothelial function is impaired in cirrhotic patients.
AIDA 2: The degree of hepatic reticuloendothelial function impairment does not
differ between cirrhotic patients with and without previous history of SBP.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 18 / 24
User Study Results
Quality of the sentences created within the user study:
163total 100%
114perfect 70%
7typo etc. 4%
10inaccurate 6%
32not AIDA 20%
4not atomic 2%
5not independent 3%
3not declarative 2%
25not absolute 15%
0 20 40 60 80 100 120 140 160
sentences
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 19 / 24
Automatic Extraction
Evaluation of automatic extraction of AIDA-nanopubs:
• We used the GeneRIF dataset, which contains sentences
describing the functions of genes and proteins
• Simple regular expressions to filter out sentences that are not
AIDA-compliant
• Simple transformations on the sentences, such as dropping
certain initial phrases
Example
Original GeneRIF sentence: We have established that LadC plays an important
role in L. pneumophila infection.
Extracted AIDA sentence: LadC plays an important role in L. pneumophila
infection.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 20 / 24
Automatic Extraction Results
Quality of the sentences extracted from the GeneRIF dataset:
250total 100%
177perfect 71%
8typo etc. 3%
65not AIDA 26%
34not atomic 14%
1not independent 0%
15not declarative 6%
30not absolute 12%
0 25 50 75 100 125 150 175 200 225 250
sentences
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 21 / 24
Automatic Clustering
Evaluation of automatic clustering:
• Sentences extracted from GeneRIF: 119 088 unique sentences
• Sentences from user study: 94 unique sentences from five tasks
• Results: Sentences from a user study task were clustered almost
exclusively (99.2%) with other sentences from the same task
Example
Hepatic reticuloendothelial function is impaired to the same degree in cirrhotic
patients with or without a previous history of SBP.
History of spontaneous bacterial peritonitis does not affect impairment of hepatic
reticuloendothelial function in cirrhotic patients.
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 22 / 24
Conclusions
As AIDA sentences:
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 23 / 24
Thank you for your Attention!
Questions?
Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 24 / 24

More Related Content

Viewers also liked

Digital Marketing Analytics talk given at Southern New Hampshire University
Digital Marketing Analytics talk given at Southern New Hampshire UniversityDigital Marketing Analytics talk given at Southern New Hampshire University
Digital Marketing Analytics talk given at Southern New Hampshire UniversityLoren Foxx
 
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...Joel Oleson
 
Platforming: Innovating Information Systems in the Google Age
Platforming: Innovating Information Systems in the Google AgePlatforming: Innovating Information Systems in the Google Age
Platforming: Innovating Information Systems in the Google AgeThoughtworks
 
A Multidimensional Classification of 55 Enterprise Architecture Frameworks
A Multidimensional Classification of 55 Enterprise Architecture FrameworksA Multidimensional Classification of 55 Enterprise Architecture Frameworks
A Multidimensional Classification of 55 Enterprise Architecture FrameworksCapgemini
 
Community Food Assessment: A Piece by Piece Approach
Community Food Assessment: A Piece by Piece ApproachCommunity Food Assessment: A Piece by Piece Approach
Community Food Assessment: A Piece by Piece Approachesheehancastro
 
Guia de estudio - sist. de igualacion , sustitucion y reduccion
Guia de estudio - sist. de igualacion , sustitucion y reduccionGuia de estudio - sist. de igualacion , sustitucion y reduccion
Guia de estudio - sist. de igualacion , sustitucion y reduccioncristianacuna
 
I silenzi di de magistris de magistris tace sul fratello -non me ne occupo io-
I silenzi di de magistris   de magistris tace sul fratello -non me ne occupo io-I silenzi di de magistris   de magistris tace sul fratello -non me ne occupo io-
I silenzi di de magistris de magistris tace sul fratello -non me ne occupo io-Daniela Petrecca
 
Ipsos MORI Scotland: Public Opinion Monitor June 2012
Ipsos MORI Scotland: Public Opinion Monitor June 2012Ipsos MORI Scotland: Public Opinion Monitor June 2012
Ipsos MORI Scotland: Public Opinion Monitor June 2012Ipsos UK
 
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...SlideTeam.net
 
PubCon 2016 Jordan Koene
PubCon 2016 Jordan KoenePubCon 2016 Jordan Koene
PubCon 2016 Jordan Koenejtkoene
 
Introducción a la compilación en ambientes Unix.
Introducción a la compilación en ambientes Unix.Introducción a la compilación en ambientes Unix.
Introducción a la compilación en ambientes Unix.Luis Claudio Pérez Tato
 

Viewers also liked (19)

Europa sangría
Europa sangríaEuropa sangría
Europa sangría
 
Indicadores tercer periodo
Indicadores tercer periodoIndicadores tercer periodo
Indicadores tercer periodo
 
Digital Marketing Analytics talk given at Southern New Hampshire University
Digital Marketing Analytics talk given at Southern New Hampshire UniversityDigital Marketing Analytics talk given at Southern New Hampshire University
Digital Marketing Analytics talk given at Southern New Hampshire University
 
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...
Boosting Productivity through Successful Engaging SharePoint Portals by Joel ...
 
战略沙盘
战略沙盘战略沙盘
战略沙盘
 
Platforming: Innovating Information Systems in the Google Age
Platforming: Innovating Information Systems in the Google AgePlatforming: Innovating Information Systems in the Google Age
Platforming: Innovating Information Systems in the Google Age
 
A Multidimensional Classification of 55 Enterprise Architecture Frameworks
A Multidimensional Classification of 55 Enterprise Architecture FrameworksA Multidimensional Classification of 55 Enterprise Architecture Frameworks
A Multidimensional Classification of 55 Enterprise Architecture Frameworks
 
Community Food Assessment: A Piece by Piece Approach
Community Food Assessment: A Piece by Piece ApproachCommunity Food Assessment: A Piece by Piece Approach
Community Food Assessment: A Piece by Piece Approach
 
thank"s
thank"sthank"s
thank"s
 
Caitlin Woltje Resume
Caitlin Woltje ResumeCaitlin Woltje Resume
Caitlin Woltje Resume
 
Guia de estudio - sist. de igualacion , sustitucion y reduccion
Guia de estudio - sist. de igualacion , sustitucion y reduccionGuia de estudio - sist. de igualacion , sustitucion y reduccion
Guia de estudio - sist. de igualacion , sustitucion y reduccion
 
resume
resumeresume
resume
 
I silenzi di de magistris de magistris tace sul fratello -non me ne occupo io-
I silenzi di de magistris   de magistris tace sul fratello -non me ne occupo io-I silenzi di de magistris   de magistris tace sul fratello -non me ne occupo io-
I silenzi di de magistris de magistris tace sul fratello -non me ne occupo io-
 
Ipsos MORI Scotland: Public Opinion Monitor June 2012
Ipsos MORI Scotland: Public Opinion Monitor June 2012Ipsos MORI Scotland: Public Opinion Monitor June 2012
Ipsos MORI Scotland: Public Opinion Monitor June 2012
 
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...
3 d pie chart circular with hole in center 11 stages powerpoint diagrams and ...
 
PubCon 2016 Jordan Koene
PubCon 2016 Jordan KoenePubCon 2016 Jordan Koene
PubCon 2016 Jordan Koene
 
Proceso de decisión markoviano
Proceso de decisión markovianoProceso de decisión markoviano
Proceso de decisión markoviano
 
Introducción a la compilación en ambientes Unix.
Introducción a la compilación en ambientes Unix.Introducción a la compilación en ambientes Unix.
Introducción a la compilación en ambientes Unix.
 
Indicadores tercer periodo tecnologia
Indicadores tercer periodo tecnologiaIndicadores tercer periodo tecnologia
Indicadores tercer periodo tecnologia
 

Similar to Broadening the Scope of Nanopublications

Convergence of Occupational and Environmental Exposure Science: the Whole Pic...
Convergence of Occupational and Environmental Exposure Science: the Whole Pic...Convergence of Occupational and Environmental Exposure Science: the Whole Pic...
Convergence of Occupational and Environmental Exposure Science: the Whole Pic...Retired
 
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraScott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraGigaScience, BGI Hong Kong
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...GigaScience, BGI Hong Kong
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...DeVonne Parks, CEM
 
Hamburg_Declaration_22_02_2022.pdf
Hamburg_Declaration_22_02_2022.pdfHamburg_Declaration_22_02_2022.pdf
Hamburg_Declaration_22_02_2022.pdfMartin Hoffmann
 
OA Week 2012 Miami U: How Open Scholarship is Changing Research
OA Week 2012 Miami U: How Open Scholarship is Changing ResearchOA Week 2012 Miami U: How Open Scholarship is Changing Research
OA Week 2012 Miami U: How Open Scholarship is Changing ResearchWilliam Gunn
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...GigaScience, BGI Hong Kong
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015IRIDA_community
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William HsiaoWilliam Hsiao
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...GigaScience, BGI Hong Kong
 
Tracing the paths between concepts in large bio medical corpora
Tracing the paths between concepts in large bio medical corporaTracing the paths between concepts in large bio medical corpora
Tracing the paths between concepts in large bio medical corporaUniversity Politehnica Bucharest
 
Open Notebook Science and Malaria
Open Notebook Science and MalariaOpen Notebook Science and Malaria
Open Notebook Science and MalariaJean-Claude Bradley
 
Open Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott EdmundsOpen Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott EdmundsScott Edmunds
 
Cómo distinguir una investigación seria de una fraudulenta
Cómo distinguir una investigación seria de una fraudulentaCómo distinguir una investigación seria de una fraudulenta
Cómo distinguir una investigación seria de una fraudulentaantenasysalud
 
Why ContentMining is useful
Why ContentMining is usefulWhy ContentMining is useful
Why ContentMining is usefulTheContentMine
 
Why ContentMining is useful
Why ContentMining is usefulWhy ContentMining is useful
Why ContentMining is usefulpetermurrayrust
 
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015Torsten Seemann
 

Similar to Broadening the Scope of Nanopublications (20)

Convergence of Occupational and Environmental Exposure Science: the Whole Pic...
Convergence of Occupational and Environmental Exposure Science: the Whole Pic...Convergence of Occupational and Environmental Exposure Science: the Whole Pic...
Convergence of Occupational and Environmental Exposure Science: the Whole Pic...
 
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraScott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
 
StudyDesign.ppt
StudyDesign.pptStudyDesign.ppt
StudyDesign.ppt
 
Hamburg_Declaration_22_02_2022.pdf
Hamburg_Declaration_22_02_2022.pdfHamburg_Declaration_22_02_2022.pdf
Hamburg_Declaration_22_02_2022.pdf
 
OA Week 2012 Miami U: How Open Scholarship is Changing Research
OA Week 2012 Miami U: How Open Scholarship is Changing ResearchOA Week 2012 Miami U: How Open Scholarship is Changing Research
OA Week 2012 Miami U: How Open Scholarship is Changing Research
 
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
Scott Edmunds, HKU Open Access Week: Experiences from the front-line of Open ...
 
Abstracts
AbstractsAbstracts
Abstracts
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
 
Tracing the paths between concepts in large bio medical corpora
Tracing the paths between concepts in large bio medical corporaTracing the paths between concepts in large bio medical corpora
Tracing the paths between concepts in large bio medical corpora
 
Open Notebook Science and Malaria
Open Notebook Science and MalariaOpen Notebook Science and Malaria
Open Notebook Science and Malaria
 
Open Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott EdmundsOpen Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott Edmunds
 
Cómo distinguir una investigación seria de una fraudulenta
Cómo distinguir una investigación seria de una fraudulentaCómo distinguir una investigación seria de una fraudulenta
Cómo distinguir una investigación seria de una fraudulenta
 
Why ContentMining is useful
Why ContentMining is usefulWhy ContentMining is useful
Why ContentMining is useful
 
Why ContentMining is useful
Why ContentMining is usefulWhy ContentMining is useful
Why ContentMining is useful
 
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
 

More from Tobias Kuhn

Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingTobias Kuhn
 
Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsTobias Kuhn
 
Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishingTobias Kuhn
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataTobias Kuhn
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer Tobias Kuhn
 
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...Tobias Kuhn
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for NanopublicationsTobias Kuhn
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsTobias Kuhn
 
Scientific Data Publishing
Scientific Data PublishingScientific Data Publishing
Scientific Data PublishingTobias Kuhn
 
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...Tobias Kuhn
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Tobias Kuhn
 
Data Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsData Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsTobias Kuhn
 
Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Tobias Kuhn
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksTobias Kuhn
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageTobias Kuhn
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureTobias Kuhn
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureTobias Kuhn
 
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Tobias Kuhn
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiTobias Kuhn
 

More from Tobias Kuhn (20)

Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with Nanopublications
 
Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishing
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer
 
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
Publishing without Publishers: a Decentralized Approach to Dissemination, Ret...
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublications
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and Nanopublications
 
Scientific Data Publishing
Scientific Data PublishingScientific Data Publishing
Scientific Data Publishing
 
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?
 
Data Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsData Publishing and Post-Publication Reviews
Data Publishing and Post-Publication Reviews
 
Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications
 
Nanopubs
NanopubsNanopubs
Nanopubs
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation Networks
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural Language
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific Literature
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific Literature
 
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen Wiki
 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Broadening the Scope of Nanopublications

  • 1. Broadening the Scope of Nanopublications Tobias Kuhn,1,2 Paolo Emilio Barbano,3 Mate Levente Nagy,4 Michael Krauthammer4,1 1Department of Pathology, Yale University 2Chair of Sociology, in particular of Modeling and Simulation, ETH Zurich 3Department of Mathematics, Yale University 4Program for Computational Biology and Bioinformatics, Yale University ESWC 2013, Montpellier (France) 29 May 2013
  • 2. Motivation The key problem of the current system of scholarly communication is that it is centered around narrative articles: • They are good for individual consumption by human beings but very bad for aggregation or automatic processing • They are very ineffective for sharing scientific information, especially in data-intensive sciences • There are no rewards for providing, sharing, and maintaining datasets, software, and other digital artifacts The current system is slow and inefficient. Nanopublications have been proposed to solve these problems: they are minimal portions of scientific contributions in RDF. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 2 / 24
  • 3. Vision: Changing Scholarly Communication Now Narrative articles at the center Future Nanopublications at the center Images from Mons et al. The value of data. Nature genetics, 43(4):281–283, 2011 Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 3 / 24
  • 4. Structure of Nanopublications Nanopub0001 Assertion: opm:wasDerivedFrom d:DataSourceX cito:cites n:nanopub0042 dc:created “2013-01-01” pav:createdBy p:Isabelle_Dubois dc:isPartOf c:NanoPubCollection1 Provenance: ns1:mosquito ns2:malaria ns3:transmission Assertion: • Formalized scientific claim (or hypothesis) Provenance: • Reference to article, experimental methods, etc. • Who published it when and how Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 4 / 24
  • 5. nanobrowser: Classical Nanopublication http://nanobrowser.inn.ac Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 5 / 24
  • 6. Ocean of Nanopublications authors users curators structured data sources unstructured data sources ocean of nanopublications nanopublication portals in silico experiments network analyses aggregated views hypothesis generation ...bots 1 2 3 4 5 6 Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 6 / 24
  • 7. Proposed Extension 1: Informal Assertions Nanopub0012 Assertion: opm:wasDerivedFrom d:DataSourceX cito:cites n:nanopub0042 dc:created “2013-01-01” pav:createdBy p:Isabelle_Dubois dc:isPartOf c:NanoPubCollection1 Provenance: Malaria is transmitted by mosquitoes. Assertion: • Informal English sentence • Sentences are independent entities and represented by URIs: http://purl.org/aida/Malaria+is+transmitted+by+mosquitoes. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 7 / 24
  • 8. Levels of Formalization • Informal (only an AIDA sentence) • Underspecified (formal representation for part of the sentence) • Fully formal (formal representation for the complete sentence) Malaria is transmitted by mosquitoes. Malaria is transmitted by mosquitoes. ns1:mosquito ns3:transmission ns1:mosquito Malaria is transmitted by mosquitoes. ns2:malaria ns3:transmission Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 8 / 24
  • 9. Proposed Extension 2: Non-Scientific Assertions Nanopub0042 Assertion: dc:created “2013-05-01” pav:createdBy p:Giuseppe Provenance: p:Giuseppe npx:disagrees a:Malaria+is+transmitted+by+... Assertion: • Meta-statement or other non-scientific assertion • Can include opinions, social relations, introduction of new entities, ... Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 9 / 24
  • 10. nanobrowser: Nanopublication with Sentence Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 10 / 24
  • 11. Approach Related existing approaches: SWAN, EXPO, GeneRIF, BEL Our approach differs from these approaches in the following respects: • Very broad application area (science as a whole and beyond) • Sentences exist independently from authors (no ownership of sentences) • Use of a controlled natural language • Continuum from informal over underspecified to fully formal statements Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 11 / 24
  • 12. AIDA Sentences To fit into the nanopublication concept, the sentences of our approach should be AIDA: • Atomic: a sentence describing one thought that cannot be further broken down in a practical way • Independent: a sentence that can stand on its own, without external references like “this effect” or “we” • Declarative: a complete sentence ending with a full stop that could in theory be either true or false • Absolute: a sentence describing the core of a claim ignoring the uncertainty about its truth and how it was discovered (no “probably” or “evaluation showed”); typically in present tense Example The majority of patients with idiopathic REM sleep behavior disorder who develop a neurodegenerative disease develop Parkinson disease and Lewy body dementia. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 12 / 24
  • 13. Linking Scientific Claims ns1:mosquito Malaria is transmitted by mosquitoes. ns2:malaria ns3:transmission Possible relations: • [CLAIM] is equivalent to / contradicts / is similar to [CLAIM] • [PERSON] agrees with / disagrees with / challenges [CLAIM] • [STUDY] provides (counter-)evidence for [CLAIM] These relations can be published as nanopublications too! Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 13 / 24
  • 14. nanobrowser: Opinions and Sentence Relations Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 14 / 24
  • 15. Publishing AIDA-Nanopubs http://nanobrowser.inn.ac/publish Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 15 / 24
  • 16. Evaluation authors users curators structured data sources unstructured data sources ocean of nanopublications nanopublication portals in silico experiments network analyses aggregated views hypothesis generation ...bots 1 2 3 4 5 6 Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 16 / 24
  • 17. Evaluation of AIDA-Nanopublications So far, the following aspects have been evaluated: • How well can authors or curators express scientific findings as AIDA sentences? (channels 1 and 3) • How well can AIDA sentences be automatically extracted from existing text sources? (channel 4) • How well can similar AIDA sentences be automatically clustered? (channel 6) Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 17 / 24
  • 18. User Study Design Questionnaire-style online study: • 16 participants: Scientists with strong background in biology and/or medicine • Five short texts (one or two sentences) from conclusion sections of structured abstracts of PubMed articles • Task: Rewrite each short text as one or more AIDA sentences • Brief explanation of the task and the AIDA concept Example Original text: The results of this study showed that the hepatic reticuloendothelial function is impaired in cirrhotic patients, but the degree of impairment does not differ between patients with and without previous history of SBP. AIDA 1: The hepatic reticuloendothelial function is impaired in cirrhotic patients. AIDA 2: The degree of hepatic reticuloendothelial function impairment does not differ between cirrhotic patients with and without previous history of SBP. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 18 / 24
  • 19. User Study Results Quality of the sentences created within the user study: 163total 100% 114perfect 70% 7typo etc. 4% 10inaccurate 6% 32not AIDA 20% 4not atomic 2% 5not independent 3% 3not declarative 2% 25not absolute 15% 0 20 40 60 80 100 120 140 160 sentences Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 19 / 24
  • 20. Automatic Extraction Evaluation of automatic extraction of AIDA-nanopubs: • We used the GeneRIF dataset, which contains sentences describing the functions of genes and proteins • Simple regular expressions to filter out sentences that are not AIDA-compliant • Simple transformations on the sentences, such as dropping certain initial phrases Example Original GeneRIF sentence: We have established that LadC plays an important role in L. pneumophila infection. Extracted AIDA sentence: LadC plays an important role in L. pneumophila infection. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 20 / 24
  • 21. Automatic Extraction Results Quality of the sentences extracted from the GeneRIF dataset: 250total 100% 177perfect 71% 8typo etc. 3% 65not AIDA 26% 34not atomic 14% 1not independent 0% 15not declarative 6% 30not absolute 12% 0 25 50 75 100 125 150 175 200 225 250 sentences Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 21 / 24
  • 22. Automatic Clustering Evaluation of automatic clustering: • Sentences extracted from GeneRIF: 119 088 unique sentences • Sentences from user study: 94 unique sentences from five tasks • Results: Sentences from a user study task were clustered almost exclusively (99.2%) with other sentences from the same task Example Hepatic reticuloendothelial function is impaired to the same degree in cirrhotic patients with or without a previous history of SBP. History of spontaneous bacterial peritonitis does not affect impairment of hepatic reticuloendothelial function in cirrhotic patients. Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 22 / 24
  • 23. Conclusions As AIDA sentences: Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 23 / 24
  • 24. Thank you for your Attention! Questions? Tobias Kuhn, Yale / ETH Zurich Broadening the Scope of Nanopublications 24 / 24