SlideShare une entreprise Scribd logo
1  sur  1
PubMed now indexes roughly 25 million articles and is growing by
more than a million per year. The scale of this “Big Knowledge”
repository renders traditional, article-based modes of user interaction
unsatisfactory, demanding new interfaces for integrating and
summarizing widely distributed knowledge. Natural language
processing (NLP) techniques coupled with rich user interfaces can
help meet this demand, providing end-users with enhanced views into
public knowledge, stimulating their ability to form new hypotheses.
Knowledge.Bio provides a Web interface for exploring the results from
text-mining PubMed. It works with subject, predicate, object
assertions (triples) extracted from individual abstracts and with
predicted statistical associations between pairs of concepts. While
agnostic to the NLP technology employed, the current implementation
is loaded with triples from the SemRep-generated SemmedDB
database and putative gene-disease pairs obtained using Leiden
University Medical Center’s ‘Implicitome’ technology.
Users of Knowledge.Bio begin by identifying a concept of interest using
text search. Once a concept is identified, associated triples and
concept-pairs are displayed in tables. These tables have text-based
and semantic filters to help refine the list of triples to relations of
interest. The user then selects relations for insertion into a personal
knowledge graph implemented using cytoscape.js. The graph is used
as a note-taking or ‘mind-mapping’ structure that can be saved offline
and then later reloaded into the application. Clicking on edges within
a graph or on the ‘evidence’ element of a triple displays the abstracts
where that relation was detected, thus allowing the user to judge the
veracity of the statement and to read the underlying articles.
Knowledge.Bio is a free, open-source application that can provide, deep,
personal, concise, shareable views into the “Big Knowledge”
scattered across the biomedical literature.
Application: http://knowledge.bio
Source code: https://bitbucket.org/sulab/kb1/
Abstract
References
[1] Ono, Keiichiro, Barry Demchak, and Trey Ideker. "Cytoscape tools for the
web age: D3. js and Cytoscape. js exporters." F1000Research 3 (2014).
[2] Kilicoglu, Halil, et al. "SemMedDB: a PubMed-scale repository of biomedical
semantic predications." Bioinformatics 28.23 (2012): 3158-3160.
[3] Rindflesch, Thomas C., and Marcelo Fiszman. "The interaction of domain
knowledge and linguistic structure in natural language processing: interpreting
hypernymic propositions in biomedical text." Journal of biomedical informatics
36.6 (2003): 462-477.
[4] Bodenreider, Olivier. "The unified medical language system (UMLS):
integrating biomedical terminology." Nucleic acids research 32.suppl 1 (2004):
D267-D270.
[5] Hettne KM, Thompson M, Van Haagen H, Van der Horst E, Kaliyaperumal R,
Mina E, Tatum Z, Laros JFJ, Van Mulligen EM, Schuemie M, Aten E, Shu Li
T, Bruskiewich R, Good BM, Su AI, Kors JA, Den Dunnen J, Van Ommen G,
Roos M, ìt Hoen PAC, Mons B, Schultes EA. The implicitome: a resource for
inferring gene-disease associations. Under review.
[6] https://github.com/BiosemanticsDotOrg/GeneDiseasePaper
[7] Swanson, Don R. "Medical literature as a potential source of new knowledge."
Bulletin of the Medical Library Association 78.1 (1990): 29.
Acknowledgements
NIGMS
GM089820
Benjamin M. Good, Ph.D.1; Richard M. Bruskiewich, Ph.D.2; Kenneth C. Huellas-Bruskiewicz2; Farzin Ahmed2; Andrew I. Su, Ph.D.1
1The Scripps Research Institute, La Jolla, CA, USA. 2STAR Informatics / Delphinai Corporation, Port Moody, BC, Canada
Knowledge.Bio: an Interactive Tool for Literature-based Discovery
Big Knowledge
Cytoscape.js mindmap [1] for charting semantic
relationships mined from the literature. User’s
create their own maps as they interact with the tool.
The maps are interactive, with each edge linked to
the evidence underlying it. Maps can be saved as
local json files, shared and reloaded into the
application.
NHGRI HG008015
Contact
@bgood bgood@scripps.edu
Evidence view. Shows original sentence contexts for
explicit triples along with links to view the associated
abstract and to view other triples mined from that
abstract. For implicit relations, clicking on ‘’show
evidence’ opens the ‘co-occurrence’ view so that the
user can examine the A-B and B-C connections.
Selecting edges in the map allows the
user to “show evidence” or to
remove the edge.
Table views. Concept relations are
presented in tables that may be
filtered for text or semantic types.
Concept search. The user begins with
a text search to identify a concept.
Once selected, the table views
provide access to related concepts
spread across the literature
• Knowledge.bio supports a concept-centric rather than document-centric
mode of interacting with the scientific literature
• It provides a way of using both statistically predicted, implicit associations
and explicit predications linked to specific sentences in the same application
• User-created concept maps can be saved, shared with others, and reloaded
into the application.
Key features
SemMedDB [2]
• Uses SemRep [3] to extract semantic predications (‘triples’) from
PubMed abstracts.
• The complete database contains more than 70 million predications.
• Utilizes properties such as ‘treats’, ‘causes’, ‘disrupts’, and ‘augments’
from the UMLS semantic network. [4]
Implicitome [5,6]
• Uses ‘concept profiles’ to identify implicit relations in the literature.
• Concept profiles are defined by weighted vectors of co-occurring concepts
• Concept profiles enable inference of relationships through Swanson’s ABC model [7]
• LUMC generated 204,072,376 ranked, implicit genes-diseases relations.
“Convulsive seizure was suppressed by
physostigmine (p less than 0.01) or 5-HTP (p less than
0.20). “
PMID 2893633
SemRep
Current sources of concept relations
Implicit connection. The edge linking Smith
Lemli Opitz Syndrome to CYP2R1 is
computationally inferred by the Implicitome.
Those concepts never co-occur in any abstract.
The explicit relations emanating from them provide
many hypothetical explanations for their
relationship.
Work in progress
• Conversion from current implementation as a MySQL-backed
Python/DJANGO application to a Java server implementation based on
NEO4J.
• Supporting “closed discovery” workflows by suggesting relations that
connect multiple input nodes.
• Integration with http://www.ndexbio.org for storing and sharing user-created
concept maps.
• Improving the facilitation of collaborative concept-map construction
• On-demand import of new concept-network sources such as
http://www.wikipathways.org
• Capturing user feedback for improving text-mining algorithms
0
200000
400000
600000
800000
1000000
1200000
1914 1934 1954 1974 1994 2014
Number of
biomedical
research
articles created
(as listed in
PubMed)
Year

Contenu connexe

Tendances

A semantic framework for biomedical image discovery
A semantic framework for biomedical image discoveryA semantic framework for biomedical image discovery
A semantic framework for biomedical image discovery
Syed Ahmad Chan Bukhari, PhD
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documents
maria.grineva
 
Semantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence imagesSemantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence images
Syed Ahmad Chan Bukhari, PhD
 

Tendances (9)

NRNB Annual Report 2012
NRNB Annual Report 2012NRNB Annual Report 2012
NRNB Annual Report 2012
 
A Survey on Bioinformatics Tools
A Survey on Bioinformatics ToolsA Survey on Bioinformatics Tools
A Survey on Bioinformatics Tools
 
A semantic framework for biomedical image discovery
A semantic framework for biomedical image discoveryA semantic framework for biomedical image discovery
A semantic framework for biomedical image discovery
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documents
 
Effective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From TextEffective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From Text
 
NRNB EAC Meeting 2012
NRNB EAC Meeting 2012NRNB EAC Meeting 2012
NRNB EAC Meeting 2012
 
Semantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence imagesSemantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence images
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
 
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanOdsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
 

En vedette

From peer-reviewed to peer-reproduced: a role for research objects in scholar...
From peer-reviewed to peer-reproduced: a role for research objects in scholar...From peer-reviewed to peer-reproduced: a role for research objects in scholar...
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
Alejandra Gonzalez-Beltran
 
Near field Technology
Near field TechnologyNear field Technology
Near field Technology
shrien_sahi
 

En vedette (18)

From peer-reviewed to peer-reproduced: a role for research objects in scholar...
From peer-reviewed to peer-reproduced: a role for research objects in scholar...From peer-reviewed to peer-reproduced: a role for research objects in scholar...
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
 
ISMB Workshop 2014
ISMB Workshop 2014ISMB Workshop 2014
ISMB Workshop 2014
 
1 radar basic - part ii
1 radar basic - part ii1 radar basic - part ii
1 radar basic - part ii
 
NXP MIFARE Webinar: Innovation Road Map: Present Improved- Future Inside
NXP MIFARE Webinar: Innovation Road Map: Present Improved- Future Inside NXP MIFARE Webinar: Innovation Road Map: Present Improved- Future Inside
NXP MIFARE Webinar: Innovation Road Map: Present Improved- Future Inside
 
Wireless Patents for Standards & Applications 1Q 2015
Wireless Patents for Standards & Applications 1Q 2015Wireless Patents for Standards & Applications 1Q 2015
Wireless Patents for Standards & Applications 1Q 2015
 
NFC and consumers - Success factors and limitations in retail business - Flor...
NFC and consumers - Success factors and limitations in retail business - Flor...NFC and consumers - Success factors and limitations in retail business - Flor...
NFC and consumers - Success factors and limitations in retail business - Flor...
 
Near field Technology
Near field TechnologyNear field Technology
Near field Technology
 
3e jaars
3e jaars3e jaars
3e jaars
 
Near field communication (nfc) technology
Near field communication (nfc) technologyNear field communication (nfc) technology
Near field communication (nfc) technology
 
IMCI 2008 Edition by WHO
IMCI 2008 Edition by WHOIMCI 2008 Edition by WHO
IMCI 2008 Edition by WHO
 
Automação com clp (ladder)
Automação com clp (ladder)Automação com clp (ladder)
Automação com clp (ladder)
 
Principles of corrosion
Principles of corrosionPrinciples of corrosion
Principles of corrosion
 
Reprap
ReprapReprap
Reprap
 
Organiser son Doctorat
Organiser son DoctoratOrganiser son Doctorat
Organiser son Doctorat
 
Seminar Report on NFC
Seminar Report on NFCSeminar Report on NFC
Seminar Report on NFC
 
Pfsmet amazing rise of solid state lighting
Pfsmet   amazing rise of solid state lightingPfsmet   amazing rise of solid state lighting
Pfsmet amazing rise of solid state lighting
 
Dear NSA, let me take care of your slides.
Dear NSA, let me take care of your slides.Dear NSA, let me take care of your slides.
Dear NSA, let me take care of your slides.
 
STOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to SlideshareSTOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
 

Similaire à (Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery

Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
ijcsa
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseases
eSAT Publishing House
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiers
Benjamin Good
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Benjamin Good
 

Similaire à (Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery (20)

Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
Statistical Analysis based Hypothesis Testing Method in Biological Knowledge ...
 
A Critical Survey On Current Literature-Based Discovery Models
A Critical Survey On Current Literature-Based Discovery ModelsA Critical Survey On Current Literature-Based Discovery Models
A Critical Survey On Current Literature-Based Discovery Models
 
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSTWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseases
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseases
 
Branch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiersBranch: An interactive, web-based tool for building decision tree classifiers
Branch: An interactive, web-based tool for building decision tree classifiers
 
Technology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network RepresentationsTechnology R&D Theme 3: Multi-scale Network Representations
Technology R&D Theme 3: Multi-scale Network Representations
 
Semantic Similarity Measures between Terms in the Biomedical Domain within f...
 Semantic Similarity Measures between Terms in the Biomedical Domain within f... Semantic Similarity Measures between Terms in the Biomedical Domain within f...
Semantic Similarity Measures between Terms in the Biomedical Domain within f...
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
gky1131.pdf
gky1131.pdfgky1131.pdf
gky1131.pdf
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
 
Edu.03 assignment
Edu.03 assignment Edu.03 assignment
Edu.03 assignment
 

Plus de Benjamin Good

Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Benjamin Good
 
2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata
Benjamin Good
 
Building a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen scienceBuilding a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen science
Benjamin Good
 

Plus de Benjamin Good (20)

Representing and reasoning with biological knowledge
Representing and reasoning with biological knowledgeRepresenting and reasoning with biological knowledge
Representing and reasoning with biological knowledge
 
Integrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity ModelsIntegrating Pathway Databases with Gene Ontology Causal Activity Models
Integrating Pathway Databases with Gene Ontology Causal Activity Models
 
Pathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMsPathways2GO: Converting BioPax pathways to GO-CAMs
Pathways2GO: Converting BioPax pathways to GO-CAMs
 
Knowledge Beacons
Knowledge BeaconsKnowledge Beacons
Knowledge Beacons
 
Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden Building a Biomedical Knowledge Garden
Building a Biomedical Knowledge Garden
 
Science Game Lab
Science Game LabScience Game Lab
Science Game Lab
 
Wikidata and the Semantic Web of Food
Wikidata and the  Semantic Web of FoodWikidata and the  Semantic Web of Food
Wikidata and the Semantic Web of Food
 
Gene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshopGene Wiki and Wikimedia Foundation SPARQL workshop
Gene Wiki and Wikimedia Foundation SPARQL workshop
 
Opportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocurationOpportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocuration
 
Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2Scripps bioinformatics seminar_day_2
Scripps bioinformatics seminar_day_2
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
Wikidata workshop for ISB Biocuration 2016
Wikidata workshop for ISB Biocuration 2016Wikidata workshop for ISB Biocuration 2016
Wikidata workshop for ISB Biocuration 2016
 
Channeling Collaborative Spirit
Channeling Collaborative SpiritChanneling Collaborative Spirit
Channeling Collaborative Spirit
 
2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata
 
2016 mem good
2016 mem good2016 mem good
2016 mem good
 
Gene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2KGene Wiki and Mark2Cure update for BD2K
Gene Wiki and Mark2Cure update for BD2K
 
2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbio2015 6 bd2k_biobranch_knowbio
2015 6 bd2k_biobranch_knowbio
 
(Bio)Hackathons
(Bio)Hackathons(Bio)Hackathons
(Bio)Hackathons
 
Citizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdfCitizen sciencepanel2015 pdf
Citizen sciencepanel2015 pdf
 
Building a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen scienceBuilding a massive biomedical knowledge graph with citizen science
Building a massive biomedical knowledge graph with citizen science
 

Dernier

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 

Dernier (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 

(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery

  • 1. PubMed now indexes roughly 25 million articles and is growing by more than a million per year. The scale of this “Big Knowledge” repository renders traditional, article-based modes of user interaction unsatisfactory, demanding new interfaces for integrating and summarizing widely distributed knowledge. Natural language processing (NLP) techniques coupled with rich user interfaces can help meet this demand, providing end-users with enhanced views into public knowledge, stimulating their ability to form new hypotheses. Knowledge.Bio provides a Web interface for exploring the results from text-mining PubMed. It works with subject, predicate, object assertions (triples) extracted from individual abstracts and with predicted statistical associations between pairs of concepts. While agnostic to the NLP technology employed, the current implementation is loaded with triples from the SemRep-generated SemmedDB database and putative gene-disease pairs obtained using Leiden University Medical Center’s ‘Implicitome’ technology. Users of Knowledge.Bio begin by identifying a concept of interest using text search. Once a concept is identified, associated triples and concept-pairs are displayed in tables. These tables have text-based and semantic filters to help refine the list of triples to relations of interest. The user then selects relations for insertion into a personal knowledge graph implemented using cytoscape.js. The graph is used as a note-taking or ‘mind-mapping’ structure that can be saved offline and then later reloaded into the application. Clicking on edges within a graph or on the ‘evidence’ element of a triple displays the abstracts where that relation was detected, thus allowing the user to judge the veracity of the statement and to read the underlying articles. Knowledge.Bio is a free, open-source application that can provide, deep, personal, concise, shareable views into the “Big Knowledge” scattered across the biomedical literature. Application: http://knowledge.bio Source code: https://bitbucket.org/sulab/kb1/ Abstract References [1] Ono, Keiichiro, Barry Demchak, and Trey Ideker. "Cytoscape tools for the web age: D3. js and Cytoscape. js exporters." F1000Research 3 (2014). [2] Kilicoglu, Halil, et al. "SemMedDB: a PubMed-scale repository of biomedical semantic predications." Bioinformatics 28.23 (2012): 3158-3160. [3] Rindflesch, Thomas C., and Marcelo Fiszman. "The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text." Journal of biomedical informatics 36.6 (2003): 462-477. [4] Bodenreider, Olivier. "The unified medical language system (UMLS): integrating biomedical terminology." Nucleic acids research 32.suppl 1 (2004): D267-D270. [5] Hettne KM, Thompson M, Van Haagen H, Van der Horst E, Kaliyaperumal R, Mina E, Tatum Z, Laros JFJ, Van Mulligen EM, Schuemie M, Aten E, Shu Li T, Bruskiewich R, Good BM, Su AI, Kors JA, Den Dunnen J, Van Ommen G, Roos M, ìt Hoen PAC, Mons B, Schultes EA. The implicitome: a resource for inferring gene-disease associations. Under review. [6] https://github.com/BiosemanticsDotOrg/GeneDiseasePaper [7] Swanson, Don R. "Medical literature as a potential source of new knowledge." Bulletin of the Medical Library Association 78.1 (1990): 29. Acknowledgements NIGMS GM089820 Benjamin M. Good, Ph.D.1; Richard M. Bruskiewich, Ph.D.2; Kenneth C. Huellas-Bruskiewicz2; Farzin Ahmed2; Andrew I. Su, Ph.D.1 1The Scripps Research Institute, La Jolla, CA, USA. 2STAR Informatics / Delphinai Corporation, Port Moody, BC, Canada Knowledge.Bio: an Interactive Tool for Literature-based Discovery Big Knowledge Cytoscape.js mindmap [1] for charting semantic relationships mined from the literature. User’s create their own maps as they interact with the tool. The maps are interactive, with each edge linked to the evidence underlying it. Maps can be saved as local json files, shared and reloaded into the application. NHGRI HG008015 Contact @bgood bgood@scripps.edu Evidence view. Shows original sentence contexts for explicit triples along with links to view the associated abstract and to view other triples mined from that abstract. For implicit relations, clicking on ‘’show evidence’ opens the ‘co-occurrence’ view so that the user can examine the A-B and B-C connections. Selecting edges in the map allows the user to “show evidence” or to remove the edge. Table views. Concept relations are presented in tables that may be filtered for text or semantic types. Concept search. The user begins with a text search to identify a concept. Once selected, the table views provide access to related concepts spread across the literature • Knowledge.bio supports a concept-centric rather than document-centric mode of interacting with the scientific literature • It provides a way of using both statistically predicted, implicit associations and explicit predications linked to specific sentences in the same application • User-created concept maps can be saved, shared with others, and reloaded into the application. Key features SemMedDB [2] • Uses SemRep [3] to extract semantic predications (‘triples’) from PubMed abstracts. • The complete database contains more than 70 million predications. • Utilizes properties such as ‘treats’, ‘causes’, ‘disrupts’, and ‘augments’ from the UMLS semantic network. [4] Implicitome [5,6] • Uses ‘concept profiles’ to identify implicit relations in the literature. • Concept profiles are defined by weighted vectors of co-occurring concepts • Concept profiles enable inference of relationships through Swanson’s ABC model [7] • LUMC generated 204,072,376 ranked, implicit genes-diseases relations. “Convulsive seizure was suppressed by physostigmine (p less than 0.01) or 5-HTP (p less than 0.20). “ PMID 2893633 SemRep Current sources of concept relations Implicit connection. The edge linking Smith Lemli Opitz Syndrome to CYP2R1 is computationally inferred by the Implicitome. Those concepts never co-occur in any abstract. The explicit relations emanating from them provide many hypothetical explanations for their relationship. Work in progress • Conversion from current implementation as a MySQL-backed Python/DJANGO application to a Java server implementation based on NEO4J. • Supporting “closed discovery” workflows by suggesting relations that connect multiple input nodes. • Integration with http://www.ndexbio.org for storing and sharing user-created concept maps. • Improving the facilitation of collaborative concept-map construction • On-demand import of new concept-network sources such as http://www.wikipathways.org • Capturing user feedback for improving text-mining algorithms 0 200000 400000 600000 800000 1000000 1200000 1914 1934 1954 1974 1994 2014 Number of biomedical research articles created (as listed in PubMed) Year