SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Pay-as-you-go OWL Query Answering 
Using a Triple Store 
Yujiao Zhou, Yavor Nenov, Bernardo Cuenca Grau and Ian Horrocks 
Problem Setting 
‣ Ontology Σ — a set of rules of the form φ(x) → Vi ∃yi ψ(x, yi) 
‣ Data D — a set of ground atoms of the form P(a) 
‣ Conjunctive queries — FO formula of the form q(x) ← ∃y ψ(x, y) 
where ψ and φ are conjunctions of atoms. 
Pay-as-you-go Approach 
Intuition 
‣ to delegate the bulk of the 
computational workload to a 
highly scalable datalog reasoner 
! 
‣ to minimise the use of a fully-fledged 
reasoner 
Diagram Over-approx to datalog 
‣ Disjunctive knowledge 
Evaluation 
‣ Existential knowledge 
{A, . . .} 
‣ Evaluated on LUBM(100,1000), UOBM(1, 60, 500), FLY, DBPedia+travel 
and NPD FactPages. 
Average time without OWL 2 reasoning 
triple store OWL 2 reasoner 
ELHO Lower 
Upper 
Data 
Ontology 
L=LRL ∪ LEL ∪ … U 
U 
D 
Query 
Datalog Engine 
Fragment 
Summarisation 
Summary 
Datalog 
Engine 
Datalog 
Engine 
Full Reasoner Q 
F 
Dependency Analysis 
F 
Full Reasoner Q 
Output 
L = U 
Tracking by datalog encoding 
σ(cert(q, F)) ⊆ cert(q, σ(F)) 
Rule out 
non-answers 
Incomplete endomorphisms 
Arrange calls to the 
reasoner according 
to the dependencies 
heuristically 
Acknowledgements Average time 
Lower 
Data 
Done 
This work was supported by the Royal Society, the EPSRC projects 
Score!, ExODA, and MaSI3, and the FP7 project OPTIQUE. 
! 
! 
! 
! 
‣ upper bound U 
answer of q w.r.t the resulting set of rules U(Σ) and D. 
Lower bounds 
‣ basic lower bound LRL 
answer of q w.r.t. the datalog fragment of Σ and D; 
‣ EL lower bound LEL 
answer of q w.r.t. the ELHO fragment of Σ and D. 
Tracking encoding in datalog 
Intuition: to compute all the rules and facts that participate in a proof 
of q(a) in Σ∪D. 
This goal can be archived using datalog encoding. 
‣ Example: 
‣ If B1(x1),…,Bm(xm) → H(x) is a rule in U(Σ), 
Ht(x), B1 (x1), . . . , Bm (xm) → S(cr)∧B1t (x1 )∧ . . . ∧Bmt(xm ) is added to 
the tracking rule. 
‣ Involved rules: {r | S(cr) is derived} 
Involved facts: {P(a) ∈ D | Pt(a) is derived} 
Summarisation & dependency between answers 
‣ Let σ be the summary function, σ(cert(q, F)) ⊆ cert(q, σ(F)) 
‣ If there is an endomorphism from a to b in F, then 
a ∈ cert(q, F) implies b ∈ cert(q, F) 
{. . . , A u B, . . .} 
{C} {C} 
x1 
{A, . . .} 
x2 
R R 
{A, . . .} 
c {C} 
x1 
{A, . . .} 
x2 
R R 
{. . . , A t B, . . .} 
DL Ontology Dataset Queries 
LUBM(n) SHI 93 ~100,000n 14 (std)+10 
UOBM(n) SHIN 314 ~200,000n 1!5 
FLY SRI 144,407 6,308 
88 
5 
DBPedia SHOIN 1,757 12,119,662 441 (atomic) 
NPD SHIF 819 3,817,079 329 (atomic) 
LUBM(1000) UOBM(100) FLY DBPedia NPD 
Queries 22/24 12/15 5/5 439/441 294/329 
Time(s) 18.4 0.7 0.2 0.3 0.1 
LUBM(100) UOBM(1) FLY DBPedia NPD 
Time(s) 29.6 1.8 0.2 3 3

Contenu connexe

Tendances

An adaptive algorithm for detection of duplicate records
An adaptive algorithm for detection of duplicate recordsAn adaptive algorithm for detection of duplicate records
An adaptive algorithm for detection of duplicate records
Likan Patra
 

Tendances (20)

Formal methods 8 - category theory (last one)
Formal methods   8 - category theory (last one)Formal methods   8 - category theory (last one)
Formal methods 8 - category theory (last one)
 
Variational inference using implicit distributions
Variational inference using implicit distributionsVariational inference using implicit distributions
Variational inference using implicit distributions
 
Sol8
Sol8Sol8
Sol8
 
Truth, deduction, computation lecture i (last one)
Truth, deduction, computation   lecture i (last one)Truth, deduction, computation   lecture i (last one)
Truth, deduction, computation lecture i (last one)
 
Lecture 7 2011
Lecture 7 2011Lecture 7 2011
Lecture 7 2011
 
Bolt: Building A Distributed ndarray
Bolt: Building A Distributed ndarrayBolt: Building A Distributed ndarray
Bolt: Building A Distributed ndarray
 
Non determinism through type isomophism
Non determinism through type isomophismNon determinism through type isomophism
Non determinism through type isomophism
 
Improving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operatorsImproving estimates for discrete polynomial averaging operators
Improving estimates for discrete polynomial averaging operators
 
Binary Session Types for Psi-Calculi (APLAS 2016)
Binary Session Types for Psi-Calculi (APLAS 2016)Binary Session Types for Psi-Calculi (APLAS 2016)
Binary Session Types for Psi-Calculi (APLAS 2016)
 
RuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information SystemsRuleML2015: Learning Characteristic Rules in Geographic Information Systems
RuleML2015: Learning Characteristic Rules in Geographic Information Systems
 
Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...
 
Slides econ-lm
Slides econ-lmSlides econ-lm
Slides econ-lm
 
An adaptive algorithm for detection of duplicate records
An adaptive algorithm for detection of duplicate recordsAn adaptive algorithm for detection of duplicate records
An adaptive algorithm for detection of duplicate records
 
A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)
 
Per3 logika
Per3 logikaPer3 logika
Per3 logika
 
Volume computation and applications
Volume computation and applications Volume computation and applications
Volume computation and applications
 
Chapter7
Chapter7Chapter7
Chapter7
 
NICE Implementations of Variational Inference
NICE Implementations of Variational Inference NICE Implementations of Variational Inference
NICE Implementations of Variational Inference
 
KNN, SVM, Naive bayes classifiers
KNN, SVM, Naive bayes classifiersKNN, SVM, Naive bayes classifiers
KNN, SVM, Naive bayes classifiers
 
Cs6503 theory of computation november december 2015 be cse anna university q...
Cs6503 theory of computation november december 2015  be cse anna university q...Cs6503 theory of computation november december 2015  be cse anna university q...
Cs6503 theory of computation november december 2015 be cse anna university q...
 

En vedette

Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
DBOnto
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
DBOnto
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
DBOnto
 

En vedette (20)

PDQ Poster
PDQ PosterPDQ Poster
PDQ Poster
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox Poster
 
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paper
 
Optique - poster
Optique - posterOptique - poster
Optique - poster
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning Paper
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
SemFacet Poster
SemFacet PosterSemFacet Poster
SemFacet Poster
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
 
Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentation
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA Presentation
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentation
 
Optique presentation
Optique presentationOptique presentation
Optique presentation
 
Semantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentationSemantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentation
 
Welcome by Ian Horrocks
Welcome by Ian HorrocksWelcome by Ian Horrocks
Welcome by Ian Horrocks
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 

Similaire à PAGOdA poster

Boolean Programs and Quantified Propositional Proof System -
Boolean Programs and Quantified Propositional Proof System - Boolean Programs and Quantified Propositional Proof System -
Boolean Programs and Quantified Propositional Proof System -
Michael Soltys
 
Lecture 3 qualtifed rules of inference
Lecture 3 qualtifed rules of inferenceLecture 3 qualtifed rules of inference
Lecture 3 qualtifed rules of inference
asimnawaz54
 
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group PreferencesQuery Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Oana Tifrea-Marciuska
 
02-VariableLengthCodes_pres.pdf
02-VariableLengthCodes_pres.pdf02-VariableLengthCodes_pres.pdf
02-VariableLengthCodes_pres.pdf
JunZhao68
 

Similaire à PAGOdA poster (20)

L03 ai - knowledge representation using logic
L03 ai - knowledge representation using logicL03 ai - knowledge representation using logic
L03 ai - knowledge representation using logic
 
Algorithmic foundations.docx
Algorithmic foundations.docxAlgorithmic foundations.docx
Algorithmic foundations.docx
 
IVR - Chapter 1 - Introduction
IVR - Chapter 1 - IntroductionIVR - Chapter 1 - Introduction
IVR - Chapter 1 - Introduction
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
從 VAE 走向深度學習新理論
從 VAE 走向深度學習新理論從 VAE 走向深度學習新理論
從 VAE 走向深度學習新理論
 
HPWFcorePRES--FUR2016
HPWFcorePRES--FUR2016HPWFcorePRES--FUR2016
HPWFcorePRES--FUR2016
 
The Chase in Database Theory
The Chase in Database TheoryThe Chase in Database Theory
The Chase in Database Theory
 
Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)
 
Boolean Programs and Quantified Propositional Proof System -
Boolean Programs and Quantified Propositional Proof System - Boolean Programs and Quantified Propositional Proof System -
Boolean Programs and Quantified Propositional Proof System -
 
Testing for mixtures by seeking components
Testing for mixtures by seeking componentsTesting for mixtures by seeking components
Testing for mixtures by seeking components
 
Delayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithmsDelayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithms
 
Lecture 3 qualtifed rules of inference
Lecture 3 qualtifed rules of inferenceLecture 3 qualtifed rules of inference
Lecture 3 qualtifed rules of inference
 
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group PreferencesQuery Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
Query Answering in Probabilistic Datalog+/{ Ontologies under Group Preferences
 
Locality-sensitive hashing for search in metric space
Locality-sensitive hashing for search in metric space Locality-sensitive hashing for search in metric space
Locality-sensitive hashing for search in metric space
 
Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9Murphy: Machine learning A probabilistic perspective: Ch.9
Murphy: Machine learning A probabilistic perspective: Ch.9
 
Approximate Tree Kernels
Approximate Tree KernelsApproximate Tree Kernels
Approximate Tree Kernels
 
Unit III Knowledge Representation in AI K.Sundar,AP/CSE,VEC
Unit III  Knowledge Representation in AI   K.Sundar,AP/CSE,VECUnit III  Knowledge Representation in AI   K.Sundar,AP/CSE,VEC
Unit III Knowledge Representation in AI K.Sundar,AP/CSE,VEC
 
Meta-learning and the ELBO
Meta-learning and the ELBOMeta-learning and the ELBO
Meta-learning and the ELBO
 
Bayesian Deep Learning
Bayesian Deep LearningBayesian Deep Learning
Bayesian Deep Learning
 
02-VariableLengthCodes_pres.pdf
02-VariableLengthCodes_pres.pdf02-VariableLengthCodes_pres.pdf
02-VariableLengthCodes_pres.pdf
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

PAGOdA poster

  • 1. Pay-as-you-go OWL Query Answering Using a Triple Store Yujiao Zhou, Yavor Nenov, Bernardo Cuenca Grau and Ian Horrocks Problem Setting ‣ Ontology Σ — a set of rules of the form φ(x) → Vi ∃yi ψ(x, yi) ‣ Data D — a set of ground atoms of the form P(a) ‣ Conjunctive queries — FO formula of the form q(x) ← ∃y ψ(x, y) where ψ and φ are conjunctions of atoms. Pay-as-you-go Approach Intuition ‣ to delegate the bulk of the computational workload to a highly scalable datalog reasoner ! ‣ to minimise the use of a fully-fledged reasoner Diagram Over-approx to datalog ‣ Disjunctive knowledge Evaluation ‣ Existential knowledge {A, . . .} ‣ Evaluated on LUBM(100,1000), UOBM(1, 60, 500), FLY, DBPedia+travel and NPD FactPages. Average time without OWL 2 reasoning triple store OWL 2 reasoner ELHO Lower Upper Data Ontology L=LRL ∪ LEL ∪ … U U D Query Datalog Engine Fragment Summarisation Summary Datalog Engine Datalog Engine Full Reasoner Q F Dependency Analysis F Full Reasoner Q Output L = U Tracking by datalog encoding σ(cert(q, F)) ⊆ cert(q, σ(F)) Rule out non-answers Incomplete endomorphisms Arrange calls to the reasoner according to the dependencies heuristically Acknowledgements Average time Lower Data Done This work was supported by the Royal Society, the EPSRC projects Score!, ExODA, and MaSI3, and the FP7 project OPTIQUE. ! ! ! ! ‣ upper bound U answer of q w.r.t the resulting set of rules U(Σ) and D. Lower bounds ‣ basic lower bound LRL answer of q w.r.t. the datalog fragment of Σ and D; ‣ EL lower bound LEL answer of q w.r.t. the ELHO fragment of Σ and D. Tracking encoding in datalog Intuition: to compute all the rules and facts that participate in a proof of q(a) in Σ∪D. This goal can be archived using datalog encoding. ‣ Example: ‣ If B1(x1),…,Bm(xm) → H(x) is a rule in U(Σ), Ht(x), B1 (x1), . . . , Bm (xm) → S(cr)∧B1t (x1 )∧ . . . ∧Bmt(xm ) is added to the tracking rule. ‣ Involved rules: {r | S(cr) is derived} Involved facts: {P(a) ∈ D | Pt(a) is derived} Summarisation & dependency between answers ‣ Let σ be the summary function, σ(cert(q, F)) ⊆ cert(q, σ(F)) ‣ If there is an endomorphism from a to b in F, then a ∈ cert(q, F) implies b ∈ cert(q, F) {. . . , A u B, . . .} {C} {C} x1 {A, . . .} x2 R R {A, . . .} c {C} x1 {A, . . .} x2 R R {. . . , A t B, . . .} DL Ontology Dataset Queries LUBM(n) SHI 93 ~100,000n 14 (std)+10 UOBM(n) SHIN 314 ~200,000n 1!5 FLY SRI 144,407 6,308 88 5 DBPedia SHOIN 1,757 12,119,662 441 (atomic) NPD SHIF 819 3,817,079 329 (atomic) LUBM(1000) UOBM(100) FLY DBPedia NPD Queries 22/24 12/15 5/5 439/441 294/329 Time(s) 18.4 0.7 0.2 0.3 0.1 LUBM(100) UOBM(1) FLY DBPedia NPD Time(s) 29.6 1.8 0.2 3 3