SlideShare une entreprise Scribd logo
1  sur  49
Building a semantic integration framework to support a federated query environment in 5 steps Philip Ashworth UCB Celltech Dean Allemang TopQuadrant
Data Integration… Why? ,[object Object],[object Object],[object Object],[object Object],[object Object]
Data Integration… Problems Warehouse DB Project DB Project Marts Applications App DB App DB’s App DB’s Registration, Query DI, Query DI Query DI App DB’s
Data Integration… Problems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Integration… Thoughts  Data Integration is clearly evolving But it is not fulfilling the needs If we identify the need… can we see what we should be doing?
Accessible Data True Integration Variety of Sources Align Concepts Data has Context All Data for All Projects Data Integration… Needs
Data Integration… There is a way! Open Linked Data Cloud Connected and linked data with context Created by a community Significant linking hubs appearing Significant scientific content A Valuable resource that will only Grow! Something we can learn from!
Data Integration… Starting an Evolutionary Leap  No one internally really knows about this Can’t just rip and replace old systems Have to do some ground work
Linked Data…The Quest ,[object Object],[object Object],[object Object],[object Object],[object Object]
Linked Data… The Quest Highly Repetitive & Promiscuous Highly Promiscuous & Repetitive
[object Object],[object Object],[object Object],[object Object],[object Object],Linked Data
Rest Services  (Abstraction layer) Semantic Integration Framework Knowledge Collation, Concept mapping, Distributed Query Result inference, Aggregation Increasing Ease of Development Decreasing knowledge of Semantic technologies The Idea Applications Business Process /  Workflow Automation PURL Data Sources RDBMS Oracle,Postgres SQL, mySql RDF Triple  Store MS Excel TXT Doc RDF Sparql EndPoint Sparql EndPoint Native
Step 1. Data Sources ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],RDBMS D2R SPARQL Endpoints Virtuoso RDF
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Step 1. Data Sources IDAC MOC PEP UCB Data Cloud Linked Open Data Cloud Abysis NBE Mart SEQ Bio2RDF PDB NBE WH ITrack PMT LDAP WKW UCB PDB Premier Sider Kegg cpd   Diseasome Kegg gl   Kegg dr  chebi Uniprot ec  geneid  RDF
Step 2: Integration Framework: ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Rest Services  (Abstraction layer) Semantic Integration Framework Knowledge Collation, Concept mapping, Distributed Query Result inference, Aggregation Applications Business Process / Workflow Automation PURL RDF Data Sources Understand Data Sources (concepts, access, props) Understand  Links Across Sources Automate some tasks Accessible Via Services Easy to wire up Understand UCB concepts Understand how UCB Concepts fit with source concepts Step 2: Integration Framework
Step 2: Integration Framework. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Sem Int Framework
Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Integration Framework Sem Int Framework narrowMatch UCB:Antibody DB1:Antibody UCB:Person DB1:User narrowMatch narrowMatch UCB:Project DB1:Project
Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2:Person UCB:Person DB1:User DB3:Employee DB2 DB3 DB3:Contact Sem Int Framework narrowMatch narrowMatch narrowMatch narrowMatch
Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2:Person UCB:Person DB1:User DB3:Employee DB2 DB3 DB3:Contact Person_DB1_DB2 Person_DB1_DB3 Linksets Sem Int Framework narrowMatch narrowMatch narrowMatch narrowMatch
Step 2: Integration Framework. Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Sem Int Framework 2 3 1 10 7 4 8 5 9 6 12 11
Step 3: Rest Services ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Rest Services
Step 3: Rest Services Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2 DB3 Keyword Search Get Info Find UCB:Person “phil” Search DB1:User Tell me the sub-types of UCB:Person Here are the resources for “phil” ldap:U0xx10x, itrack:101, moc:scordisp etc…. Search DB3:Employee Search DB3:Contact Search DB2:Person Rest Services Can the linksets tell us any info? Tell me the datasets for the sub-types DB1
Step 3: Rest Services Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Keyword Search Get Info Tell me the super-types of all resources Retrieve DB1:U0xx10x Tell me about moc:scordisp Here is everything I know about it. DB2 DB3 Retrieve   DB2:scordisp Retrieve   DB3:philscordis Tell me everything about this resource? Rest Services DB1
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Step 4: Building an Application 1 Applications
Step 4: Data Exploration UCB Concepts Search submitted to “Keyword Search” Service Applications
Step 4: Data Exploration Results Displayed. Index shows inference is already taking place Applications
Step 4: Data Exploration Drag Instance to basket, Initiates “Get Info” Service call Applications
Step 4: Data Exploration Select Instance Data Displayed per Source Applications
Step 4: Data Exploration Links to other data items Applications
Step 4: Data Exploration Displays Sparse data Submit Instance to“Get info” service Applications
Step 4: Data Exploration More Detailed Information Applications
Step 4: Data Exploration He has another interaction.  Lets Explore. Applications
Step 4: Data Exploration Applications
Step 4: Data Exploration Applications Data cached as we navigated Concept Explorer. Can now be investigated.
Step 4: Data Exploration Structure concept Keyword Search pulls data from internal and external data sources Add to basket  After detailed Information retrieved a second Structure has been identified without a keyword search Integrated Internal and External data Applications
Step 4: Data Exploration Applications
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Step 4: Building an Application 2 Applications
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Step 4: Building an Application 3 Applications
Step 4:  Knowledge Base Semantic Integration Framework Keyword Search Get Info Data Sources App Service “ Tell me about the protein with  Gene ID X ” and I want to know about  Literature Refs ,  Sequences ,  Descriptions,  Structure …… etc. Applications
Step 4: Knowledge Base  Applications
Step 4: Knowledge Base  Applications
Step 4: Knowledge Base  Applications
Step 4: Knowledge Base  Applications
Step 4: Knowledge Base  Applications
Step 5: Purl Server ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],PURL
Conclusions & Business value ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Conclusions & Business value ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Questions?

Contenu connexe

Tendances

Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Markus Harrer
 

Tendances (20)

Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discovery
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Windows Azure: Lessons From The Field
Windows Azure: Lessons From The FieldWindows Azure: Lessons From The Field
Windows Azure: Lessons From The Field
 
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
Course 4 : Big Data Structuring, Integration and Management Systems by Daan G...
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
 
Amundsen: From discovering to security data
Amundsen: From discovering to security dataAmundsen: From discovering to security data
Amundsen: From discovering to security data
 
Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...
 
Hibernate
HibernateHibernate
Hibernate
 
Data Discovery & Trust through Metadata
Data Discovery & Trust through MetadataData Discovery & Trust through Metadata
Data Discovery & Trust through Metadata
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
DataHub
DataHubDataHub
DataHub
 
The Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge GraphThe Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge Graph
 
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
 
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry Report
 
OrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data RelationshipsOrientDB: Unlock the Value of Document Data Relationships
OrientDB: Unlock the Value of Document Data Relationships
 
Einführung in Neo4j
Einführung in Neo4jEinführung in Neo4j
Einführung in Neo4j
 
Graph Databases - Where Do We Do the Modeling Part?
Graph Databases - Where Do We Do the Modeling Part?Graph Databases - Where Do We Do the Modeling Part?
Graph Databases - Where Do We Do the Modeling Part?
 
Providing Tools for Author Evaluation - A case study
Providing Tools for Author Evaluation - A case studyProviding Tools for Author Evaluation - A case study
Providing Tools for Author Evaluation - A case study
 

En vedette

Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
dallemang
 

En vedette (6)

Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Lotico oct 2010
Lotico oct 2010Lotico oct 2010
Lotico oct 2010
 
LA CIRUGÍA ESTÉTICA A FAVOR O EN CONTRA
LA CIRUGÍA ESTÉTICA A FAVOR O EN CONTRA LA CIRUGÍA ESTÉTICA A FAVOR O EN CONTRA
LA CIRUGÍA ESTÉTICA A FAVOR O EN CONTRA
 
Teaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & TextspeakTeaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & Textspeak
 
Hype vs. Reality: The AI Explainer
Hype vs. Reality: The AI ExplainerHype vs. Reality: The AI Explainer
Hype vs. Reality: The AI Explainer
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 

Similaire à Sem tech 2011 v8

Document Based Data Modeling Technique
Document Based Data Modeling TechniqueDocument Based Data Modeling Technique
Document Based Data Modeling Technique
Carmen Sanborn
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch Council
Sunita Shrivastava
 
The Recent Pronouncement Of The World Wide Web (Www) Had
The Recent Pronouncement Of The World Wide Web (Www) HadThe Recent Pronouncement Of The World Wide Web (Www) Had
The Recent Pronouncement Of The World Wide Web (Www) Had
Deborah Gastineau
 
Apprendre Via les Objets Xin Chen
Apprendre Via les Objets  Xin ChenApprendre Via les Objets  Xin Chen
Apprendre Via les Objets Xin Chen
cecilechen85
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
kalai75
 

Similaire à Sem tech 2011 v8 (20)

Document Based Data Modeling Technique
Document Based Data Modeling TechniqueDocument Based Data Modeling Technique
Document Based Data Modeling Technique
 
The Evolution of Metadata: LinkedIn's Story [Strata NYC 2019]
The Evolution of Metadata: LinkedIn's Story [Strata NYC 2019]The Evolution of Metadata: LinkedIn's Story [Strata NYC 2019]
The Evolution of Metadata: LinkedIn's Story [Strata NYC 2019]
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus Platform
 
Spark Based Distributed Deep Learning Framework For Big Data Applications
Spark Based Distributed Deep Learning Framework For Big Data Applications Spark Based Distributed Deep Learning Framework For Big Data Applications
Spark Based Distributed Deep Learning Framework For Big Data Applications
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
My Master's Thesis
My Master's ThesisMy Master's Thesis
My Master's Thesis
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch Council
 
disertation
disertationdisertation
disertation
 
NoSQL Basics - a quick tour
NoSQL Basics - a quick tourNoSQL Basics - a quick tour
NoSQL Basics - a quick tour
 
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
Myth Busters II: BI Tools and Data Virtualization are InterchangeableMyth Busters II: BI Tools and Data Virtualization are Interchangeable
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
 
Stanford DeepDive Framework
Stanford DeepDive FrameworkStanford DeepDive Framework
Stanford DeepDive Framework
 
The Recent Pronouncement Of The World Wide Web (Www) Had
The Recent Pronouncement Of The World Wide Web (Www) HadThe Recent Pronouncement Of The World Wide Web (Www) Had
The Recent Pronouncement Of The World Wide Web (Www) Had
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
 
How Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information DiscoveryHow Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information Discovery
 
70487.pdf
70487.pdf70487.pdf
70487.pdf
 
Apprendre Via les Objets Xin Chen
Apprendre Via les Objets  Xin ChenApprendre Via les Objets  Xin Chen
Apprendre Via les Objets Xin Chen
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
 
What Impact Will Entity Framework Have On Architecture
What Impact Will Entity Framework Have On ArchitectureWhat Impact Will Entity Framework Have On Architecture
What Impact Will Entity Framework Have On Architecture
 
Erciyes university
Erciyes universityErciyes university
Erciyes university
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 

Dernier

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Dernier (20)

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 

Sem tech 2011 v8

  • 1. Building a semantic integration framework to support a federated query environment in 5 steps Philip Ashworth UCB Celltech Dean Allemang TopQuadrant
  • 2.
  • 3. Data Integration… Problems Warehouse DB Project DB Project Marts Applications App DB App DB’s App DB’s Registration, Query DI, Query DI Query DI App DB’s
  • 4.
  • 5. Data Integration… Thoughts Data Integration is clearly evolving But it is not fulfilling the needs If we identify the need… can we see what we should be doing?
  • 6. Accessible Data True Integration Variety of Sources Align Concepts Data has Context All Data for All Projects Data Integration… Needs
  • 7. Data Integration… There is a way! Open Linked Data Cloud Connected and linked data with context Created by a community Significant linking hubs appearing Significant scientific content A Valuable resource that will only Grow! Something we can learn from!
  • 8. Data Integration… Starting an Evolutionary Leap No one internally really knows about this Can’t just rip and replace old systems Have to do some ground work
  • 9.
  • 10. Linked Data… The Quest Highly Repetitive & Promiscuous Highly Promiscuous & Repetitive
  • 11.
  • 12. Rest Services (Abstraction layer) Semantic Integration Framework Knowledge Collation, Concept mapping, Distributed Query Result inference, Aggregation Increasing Ease of Development Decreasing knowledge of Semantic technologies The Idea Applications Business Process / Workflow Automation PURL Data Sources RDBMS Oracle,Postgres SQL, mySql RDF Triple Store MS Excel TXT Doc RDF Sparql EndPoint Sparql EndPoint Native
  • 13.
  • 14.
  • 15.
  • 16. Rest Services (Abstraction layer) Semantic Integration Framework Knowledge Collation, Concept mapping, Distributed Query Result inference, Aggregation Applications Business Process / Workflow Automation PURL RDF Data Sources Understand Data Sources (concepts, access, props) Understand Links Across Sources Automate some tasks Accessible Via Services Easy to wire up Understand UCB concepts Understand how UCB Concepts fit with source concepts Step 2: Integration Framework
  • 17.
  • 18. Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Integration Framework Sem Int Framework narrowMatch UCB:Antibody DB1:Antibody UCB:Person DB1:User narrowMatch narrowMatch UCB:Project DB1:Project
  • 19. Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2:Person UCB:Person DB1:User DB3:Employee DB2 DB3 DB3:Contact Sem Int Framework narrowMatch narrowMatch narrowMatch narrowMatch
  • 20. Step 2: Integration Framework. DB1 Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2:Person UCB:Person DB1:User DB3:Employee DB2 DB3 DB3:Contact Person_DB1_DB2 Person_DB1_DB3 Linksets Sem Int Framework narrowMatch narrowMatch narrowMatch narrowMatch
  • 21. Step 2: Integration Framework. Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Sem Int Framework 2 3 1 10 7 4 8 5 9 6 12 11
  • 22.
  • 23. Step 3: Rest Services Dataset Ontology (VoID) UCB Concept Ontology (SKOS) DB2 DB3 Keyword Search Get Info Find UCB:Person “phil” Search DB1:User Tell me the sub-types of UCB:Person Here are the resources for “phil” ldap:U0xx10x, itrack:101, moc:scordisp etc…. Search DB3:Employee Search DB3:Contact Search DB2:Person Rest Services Can the linksets tell us any info? Tell me the datasets for the sub-types DB1
  • 24. Step 3: Rest Services Dataset Ontology (VoID) UCB Concept Ontology (SKOS) Keyword Search Get Info Tell me the super-types of all resources Retrieve DB1:U0xx10x Tell me about moc:scordisp Here is everything I know about it. DB2 DB3 Retrieve DB2:scordisp Retrieve DB3:philscordis Tell me everything about this resource? Rest Services DB1
  • 25.
  • 26. Step 4: Data Exploration UCB Concepts Search submitted to “Keyword Search” Service Applications
  • 27. Step 4: Data Exploration Results Displayed. Index shows inference is already taking place Applications
  • 28. Step 4: Data Exploration Drag Instance to basket, Initiates “Get Info” Service call Applications
  • 29. Step 4: Data Exploration Select Instance Data Displayed per Source Applications
  • 30. Step 4: Data Exploration Links to other data items Applications
  • 31. Step 4: Data Exploration Displays Sparse data Submit Instance to“Get info” service Applications
  • 32. Step 4: Data Exploration More Detailed Information Applications
  • 33. Step 4: Data Exploration He has another interaction. Lets Explore. Applications
  • 34. Step 4: Data Exploration Applications
  • 35. Step 4: Data Exploration Applications Data cached as we navigated Concept Explorer. Can now be investigated.
  • 36. Step 4: Data Exploration Structure concept Keyword Search pulls data from internal and external data sources Add to basket After detailed Information retrieved a second Structure has been identified without a keyword search Integrated Internal and External data Applications
  • 37. Step 4: Data Exploration Applications
  • 38.
  • 39.
  • 40. Step 4: Knowledge Base Semantic Integration Framework Keyword Search Get Info Data Sources App Service “ Tell me about the protein with Gene ID X ” and I want to know about Literature Refs , Sequences , Descriptions, Structure …… etc. Applications
  • 41. Step 4: Knowledge Base Applications
  • 42. Step 4: Knowledge Base Applications
  • 43. Step 4: Knowledge Base Applications
  • 44. Step 4: Knowledge Base Applications
  • 45. Step 4: Knowledge Base Applications
  • 46.
  • 47.
  • 48.