SlideShare a Scribd company logo
1 of 21
KUPKB: Sharing, Connecting and
 Exposing Kidney and Urinary
Knowledge using RDF and OWL




              www.kupkb.org

        Julie Klein & Simon Jupp
      Bio-health informatics group
        University of Manchester
The problem domain

Thousands of studies have been conducted by the kidney research community



          On different species
                                            human   mouse



          On different materials

                                            urine   tissue    cell



         • On different biological levels

                                            gene    protein



      Large diversity  Integration of the knowldege is complex
Where does the data go?
      Bespoke kidney laboratory databases
                                             Research Papers




            Generalist databases




Scattered, hidden in figures, coming in different formats
                Most of the data is lost!
The Kidney and Urinary Pathway Knowledge Base:

                                        SHARE AND CONNECT

The iKUP Browser:

                                        EXPOSE


                        www.kupkb.org
Stucture

 Populous
                           Experimental data


KUP Ontology
  (schema)
                                  RightField




               RDF triple store


                                               iKUP Browser
                KUP Knowledge Base
Ontologies provide the schema
                  What has been observed, where and when?



      Mouse anatomy                                             Experimental factors
         ontology

                                     Gene Ontology


                                                                  Animal model
      Cell type ontology
                                                                Disease ontology




             We needed to connect these reference ontologies.
Creation of a specialized Kidney and Urinary Pathway Ontology (KUPO)
                           http://www.e-lico.org/public/kupo/
Ontologies by stealth
                 The domain experts are the experts so get them build it
                                                    Biological
                             Cells        Anatomy
                                                    processes(
                            (CTO)          (MAO)
                                                       GO)

Spreadsheet
  OPPL Scripts




 Ontology

                 Populous generates simple Excel based templates
                          http://www.e-lico.eu/populous.html
Describing/Collecting experimental data
Gathering good meta-data AND data again by stealth using RightField




                      Content of the meta-data cells is constraint to
                      the relevant set of KUPO terms




                  http://www.sysmo-db.org/rightfield
Describing/Collecting experimental data
Gathering good meta-data AND data again by stealth using RightField




                      Content of the meta-data cells is constraint to
                      the relevant set of KUPO terms
Mashing it all together



 Kidney and Urinary Pathway Ontology                          Experimental data
~1800 classes (~40,000 after imports closure)         220 KUP experiments integrated


                                        Owl reasoning




              RDF triple store
               ~35M triples
                                         KUP Knowledge Base
SPARQLing results
Make it all RDF/OWL and expose a SPARQL endpoint…
                                       …then we are done right?

   We can now ask queries that span several databases
   We can exploit OWL semantics for intelligent answers


     BUT!
 Easy to use application…
                                       …this is what the biologist really want
The iKUP browser




Built as an easy-to-use and light Google Web Toolkit application
To expose data from the KUPKB
Doing some biology
1. A biological question         2. No answer with classical tools
Can calreticulin be associated   Search in Pubmed and Google does
to the development of human      not return any relevant result!
kidney disease?



3. Querying the KUPKB




4. Validation in the wet-lab     5. Publish an innovative result
KUPKB in silico result           Accepted for publication in the FASEB J!
confirmed.
Reusing and Building


Ontologies provide the schema                        Experimental data


                                Owl reasoning




       RDF triple store
                                KUP Knowledge Base
Reusing and Building


    Ontologies provide the schema                          Experimental data
Kidney and Urinary Pathway Ontology                 Annotations, homogenization
   Tool to facilitate building of onto.            Tool to facilitate data annotation
                                        Owl reasoning




            RDF triple store                                     iKUP Browser
                                     KUP Knowledge Base
What next


 User study and evaluation experiments ongoing with
  Manchester Web Ergonomics Lab

 Application to other biological domains
    Change the domain model in the ontologies and we can construct any
     organ knowledge base in this way
    Already interests in gut, liver, heart and metabolic diseases
Acknowledgments
•   Simon Jupp

•   Stuart Owen, Matthew Horridge, Katy Wolstencroft and Carole Goble @
    University of Manchester for RightField

•   Joost Schanstra, Panagiotis Moulos, Jean-Loup Bascands @ Renal Fibrosis
    Lab, Toulouse, France

•   Aristidis Charonis, Bénédicte Buffin-Meyer, Myriem Fernandez for the CALR
    example

•   e-LICO FP7 project and EuroKUP

•   Robert Stevens, ontology development, University of Manchester

    Open Source License: GNU Lesser General Public License
    Code: http://code.google.com/p/kupkb-dev/
Thank you for listening…




www.kupk b .or
Some rough stats…
• 195 KUP experiments integrated
• KUPKB RDF store ~35M triples
• KUPK Ontology ~1800 classes. ~40,000 after imports closure



Architecture
• Sesame and BigOWLIM for the RDF store
• Web site developed with Google web toolkit
• OWL API and HermiT reasoner for classification and faceted browsing
Summary
   The KUPKB RDF store is a mashup of biological knowledge relating to the
    KUP domain

   Ontologies provide the schema and a consistent data annotation mechanism

   We expose this knowledge base through a simple web interface that real
    biologists can use, the iKUP

   iKUP and KUPKB provides a faster mechanism for the biologist to survey the
    data in biological publications and helps the hypothesis generation process.

   It is a testament to the tools and APIs that such applications are now being
    delivered at relatively low cost

More Related Content

What's hot

Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
Denis Shestakov
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
Connected Data World
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
Sreekanth Gali
 
Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talk
TOASTworkshop
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
mbruggen
 

What's hot (19)

Article
ArticleArticle
Article
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
 
Flash introduction to Qiime2 -- 16S Amplicon analysis
Flash introduction to Qiime2 -- 16S Amplicon analysisFlash introduction to Qiime2 -- 16S Amplicon analysis
Flash introduction to Qiime2 -- 16S Amplicon analysis
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in Uberon
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Prosite
PrositeProsite
Prosite
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Rishi
RishiRishi
Rishi
 
BHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussionBHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussion
 
Use of data
Use of dataUse of data
Use of data
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
 
Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talk
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
 

Viewers also liked

Viewers also liked (8)

S Cheng - eagle-i: development and expansion of a scientific resource discove...
S Cheng - eagle-i: development and expansion of a scientific resource discove...S Cheng - eagle-i: development and expansion of a scientific resource discove...
S Cheng - eagle-i: development and expansion of a scientific resource discove...
 
D Baker - Galaxy Update
D Baker - Galaxy UpdateD Baker - Galaxy Update
D Baker - Galaxy Update
 
B Kinoshita - Creating biology pipelines with BioUno
B Kinoshita - Creating biology pipelines with BioUnoB Kinoshita - Creating biology pipelines with BioUno
B Kinoshita - Creating biology pipelines with BioUno
 
D Robinson - Using HDF5 to work with large quantities of rich biological data
D Robinson - Using HDF5 to work with large quantities of rich biological dataD Robinson - Using HDF5 to work with large quantities of rich biological data
D Robinson - Using HDF5 to work with large quantities of rich biological data
 
A Kalderimis - InterMine: Embeddable datamining components
A Kalderimis - InterMine: Embeddable datamining componentsA Kalderimis - InterMine: Embeddable datamining components
A Kalderimis - InterMine: Embeddable datamining components
 
M Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
M Gumbel - SCABIO: a framework for bioinformatics algorithms in ScalaM Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
M Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
 
VIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic VariationVIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic Variation
 
Wolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national serviceWolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national service
 

Similar to J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL

Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
Monica Munoz-Torres
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
Neo4j
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Monica Munoz-Torres
 

Similar to J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL (20)

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-jupp
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
Ontologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlin
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Bio solr building a better search for bioinformatics
Bio solr   building a better search for bioinformaticsBio solr   building a better search for bioinformatics
Bio solr building a better search for bioinformatics
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
 

More from Jan Aerts

Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?
Jan Aerts
 
Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013
Jan Aerts
 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Jan Aerts
 

More from Jan Aerts (20)

Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?
 
Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?
 
Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013
 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
 
Humanizing Data Analysis
Humanizing Data AnalysisHumanizing Data Analysis
Humanizing Data Analysis
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualization
 
L Fu - Dao: a novel programming language for bioinformatics
L Fu - Dao: a novel programming language for bioinformaticsL Fu - Dao: a novel programming language for bioinformatics
L Fu - Dao: a novel programming language for bioinformatics
 
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
 
S Cain - GMOD in the cloud
S Cain - GMOD in the cloudS Cain - GMOD in the cloud
S Cain - GMOD in the cloud
 
B Temperton - The Bioinformatics Testing Consortium
B Temperton - The Bioinformatics Testing ConsortiumB Temperton - The Bioinformatics Testing Consortium
B Temperton - The Bioinformatics Testing Consortium
 
J Goecks - The Galaxy Visual Analysis Framework
J Goecks - The Galaxy Visual Analysis FrameworkJ Goecks - The Galaxy Visual Analysis Framework
J Goecks - The Galaxy Visual Analysis Framework
 
S Cain - GMOD in the cloud
S Cain - GMOD in the cloudS Cain - GMOD in the cloud
S Cain - GMOD in the cloud
 
B Chapman - Toolkit for variation comparison and analysis
B Chapman - Toolkit for variation comparison and analysisB Chapman - Toolkit for variation comparison and analysis
B Chapman - Toolkit for variation comparison and analysis
 
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
 
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
 
E Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutesE Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutes
 
M Reich - GenomeSpace
M Reich - GenomeSpaceM Reich - GenomeSpace
M Reich - GenomeSpace
 
CT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloudCT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloud
 
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
 
Holland R - Pistoia Alliance Sequence Squeeze
Holland R - Pistoia Alliance Sequence SqueezeHolland R - Pistoia Alliance Sequence Squeeze
Holland R - Pistoia Alliance Sequence Squeeze
 

Recently uploaded

會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
中 央社
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
Krashi Coaching
 

Recently uploaded (20)

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
e-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopale-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopal
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
demyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxdemyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptx
 
IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).
 

J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL

  • 1. KUPKB: Sharing, Connecting and Exposing Kidney and Urinary Knowledge using RDF and OWL www.kupkb.org Julie Klein & Simon Jupp Bio-health informatics group University of Manchester
  • 2. The problem domain Thousands of studies have been conducted by the kidney research community  On different species human mouse  On different materials urine tissue cell • On different biological levels gene protein Large diversity  Integration of the knowldege is complex
  • 3. Where does the data go? Bespoke kidney laboratory databases Research Papers Generalist databases Scattered, hidden in figures, coming in different formats Most of the data is lost!
  • 4. The Kidney and Urinary Pathway Knowledge Base: SHARE AND CONNECT The iKUP Browser: EXPOSE www.kupkb.org
  • 5. Stucture Populous Experimental data KUP Ontology (schema) RightField RDF triple store iKUP Browser KUP Knowledge Base
  • 6. Ontologies provide the schema What has been observed, where and when? Mouse anatomy Experimental factors ontology Gene Ontology Animal model Cell type ontology Disease ontology We needed to connect these reference ontologies. Creation of a specialized Kidney and Urinary Pathway Ontology (KUPO) http://www.e-lico.org/public/kupo/
  • 7. Ontologies by stealth The domain experts are the experts so get them build it Biological Cells Anatomy processes( (CTO) (MAO) GO) Spreadsheet OPPL Scripts Ontology Populous generates simple Excel based templates http://www.e-lico.eu/populous.html
  • 8. Describing/Collecting experimental data Gathering good meta-data AND data again by stealth using RightField Content of the meta-data cells is constraint to the relevant set of KUPO terms http://www.sysmo-db.org/rightfield
  • 9. Describing/Collecting experimental data Gathering good meta-data AND data again by stealth using RightField Content of the meta-data cells is constraint to the relevant set of KUPO terms
  • 10. Mashing it all together Kidney and Urinary Pathway Ontology Experimental data ~1800 classes (~40,000 after imports closure) 220 KUP experiments integrated Owl reasoning RDF triple store ~35M triples KUP Knowledge Base
  • 11. SPARQLing results Make it all RDF/OWL and expose a SPARQL endpoint… …then we are done right?  We can now ask queries that span several databases  We can exploit OWL semantics for intelligent answers BUT!  Easy to use application… …this is what the biologist really want
  • 12. The iKUP browser Built as an easy-to-use and light Google Web Toolkit application
  • 13. To expose data from the KUPKB
  • 14. Doing some biology 1. A biological question 2. No answer with classical tools Can calreticulin be associated Search in Pubmed and Google does to the development of human not return any relevant result! kidney disease? 3. Querying the KUPKB 4. Validation in the wet-lab 5. Publish an innovative result KUPKB in silico result Accepted for publication in the FASEB J! confirmed.
  • 15. Reusing and Building Ontologies provide the schema Experimental data Owl reasoning RDF triple store KUP Knowledge Base
  • 16. Reusing and Building Ontologies provide the schema Experimental data Kidney and Urinary Pathway Ontology Annotations, homogenization Tool to facilitate building of onto. Tool to facilitate data annotation Owl reasoning RDF triple store iKUP Browser KUP Knowledge Base
  • 17. What next  User study and evaluation experiments ongoing with Manchester Web Ergonomics Lab  Application to other biological domains  Change the domain model in the ontologies and we can construct any organ knowledge base in this way  Already interests in gut, liver, heart and metabolic diseases
  • 18. Acknowledgments • Simon Jupp • Stuart Owen, Matthew Horridge, Katy Wolstencroft and Carole Goble @ University of Manchester for RightField • Joost Schanstra, Panagiotis Moulos, Jean-Loup Bascands @ Renal Fibrosis Lab, Toulouse, France • Aristidis Charonis, Bénédicte Buffin-Meyer, Myriem Fernandez for the CALR example • e-LICO FP7 project and EuroKUP • Robert Stevens, ontology development, University of Manchester Open Source License: GNU Lesser General Public License Code: http://code.google.com/p/kupkb-dev/
  • 19. Thank you for listening… www.kupk b .or
  • 20. Some rough stats… • 195 KUP experiments integrated • KUPKB RDF store ~35M triples • KUPK Ontology ~1800 classes. ~40,000 after imports closure Architecture • Sesame and BigOWLIM for the RDF store • Web site developed with Google web toolkit • OWL API and HermiT reasoner for classification and faceted browsing
  • 21. Summary  The KUPKB RDF store is a mashup of biological knowledge relating to the KUP domain  Ontologies provide the schema and a consistent data annotation mechanism  We expose this knowledge base through a simple web interface that real biologists can use, the iKUP  iKUP and KUPKB provides a faster mechanism for the biologist to survey the data in biological publications and helps the hypothesis generation process.  It is a testament to the tools and APIs that such applications are now being delivered at relatively low cost

Editor's Notes

  1. Renal physiology Human urinary protein map Renal pathophysiology Biomarker discovery
  2. Animate
  3. Animate
  4. Animate
  5. Animate