SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Finding Plans from Proofs 
PDQ: Proof-driven Query Answering over Web-based Data Michael Benedikt, Julien Leblay, Efthymia Tsamoura - Oxford University 
Supported by EPSRC grant EP/H017690/1, Query-driven Data Acquisition from Web-based Data Sources 
Project homepage: 
http://www.cs.ox.ac.uk/projects/pdq/ 
Contact: 
firstname.lastname@cs.ox.ac.uk 
Example: online services for geographic information 
r1: Places(id, name, type, coordinates, ...) information about places (e.g. city, country, continent, lake, etc.) 
r2: BelongsTo(source, target) containment between places, "China belongs to Asia". 
r3: Countries(id, name, iso_code, ...) information about countries. 
φ1:Places(x, y, Country, ...) ↔ Countries(x, y, ...) 
Query for countries in Asia: not answerable without considering constraints. 
SELECT p1.name FROM BelongsTo AS bt 
JOIN Places AS p1 ON p1.id=bt.source 
JOIN Places AS p2 ON p2.id=bt.target 
WHERE p1.type = ’Country’ AND p2.name = ’Asia’ 
Pre-processing steps create auxiliary schema by adding relations InferredAccPlaces, InferredAccBelongsTo, InferredAccCountries, Accessible and constraints: 
φ’1: InferredAccPlaces(x, y, Country, ...) ↔ InferredAccCountries(x, y,...) 
α1: Accessible(y)∧Places(x, y , z, ...) 
→ InferredAccPlaces(x, y, z, ...)∧Accessible(x)∧Accessible(z)∧... 
α2: Accessible(x)∧BelongsTo(x, y) → InferredAccBelongsTo(x, y)∧Accessible(y) 
α3: Countries(x, y, z, ...) → InferredAccCountries(x, y, z, ...)∧Accessible(x)∧… 
α4: … 
Context Web data sources which may have: 
•overlapping information, 
•access restrictions. As a result: 
•There may be no web query plan for a given user query. 
•There may be many plans using different sources with different costs. Need to reason about Integrity constraints and access limitations. PDQ System for determining a query plan in the presence of web-based sources. 
i.constraint-aware 
ii.access-aware – abiding by access restrictions, 
iii.cost-aware – making use of any cost information Approach: generating query plans from proofs that a query is answerable. 
Input 
S: Schema 〈R, Σ〉, R set of relations with access methods (free, limited, inaccessible), Σ set of integrity constraints (TGDs). 
Q: Conjunctive query over S. 
f: Cost function on evaluation plans. 
Output 
Pbest: plan with minimal cost. 
Step 1: Pre-processing 
S augmented with new relations and axioms modelling the access restrictions. 
A goal query Qinferred is created based on the relations of the augmented schema. 
Q is grounded to form the initial state of the plan search. 
Step 2: Basic search step 
Each state is closed under firing of 
rules (blue arrows) other than accessibility axioms (denoted αi). 
Every possible firing of accessibility axioms (red arrows) gives a new candidate state, inheriting all the facts of its ancestors. 
Step 3: Plans and costs 
Each new state gives a plan, to which a cost is assigned (orange circles). 
If state corresponds to a match with Qinferred and its plan’s cost is lower than the best so far, it becomes the new best state. 
Queries over Web Data 
Architecture & User Experience 
User interface for creating and editing schemas and queries 
Interactive exploration of the planner’s search space. 
Online execution of plans. 
User interface for creating and configuring planning sessions. 
Dashboard 
Architecture 
Runtime 
Planner 
InferredAccPlaces(id2, "Asia", c2, …), 
Accessible(id2), Accessible(c2), … 
T’1 ⇐ Places ⇐ ("퐴푠푖푎") 
InferredAccPlaces(id2, "Asia", c2, …), 
Accessible(id2), Accessible(c2), … 
T2 ⇐ Places ⇐("퐴푠푖푎") 
T3 := T1 ⋈ T2 
InferredAccBelongsTo(id1, id2) 
T4 ⇐ BelongsTo ⇐ π source (T3) 
T5 := π name ( T3 ⋈ T4 ) 
Places(id1, name1, "Country", …), Places(id2, "Asia", …), 
BelongsTo(id1, id2), Accessible("Asia"), Accessible("Country") 
Initial State 
Countries(id1, name1, c1, …) 
φ1 
Goal : Qinferred(name) ← InferredAccPlaces(id1, name1, "Country", …) 
∧ InferredAccPlaces(id2, "Asia", …)∧ InferredAccBelongsTo(id1, id2) 
φ‘1 
α1 
α1 
α2 
α3 
InferredAccCountries(id1, name1, c1, …), Accessible(id1), Accessible(name1), Accessible(c1) 
T1 ⇐ Countries ⇐ Ø 
InferredAccPlaces(id1, name1, "Country", …) 
InferredAccCountries(id1, name1, c1, …), Accessible(id1), Accessible(name1), … 
T’2 ⇐ Countries ⇐ Ø 
T’3 := T’1 ⋈ T’2 
InferredAccPlaces(id1, name1, "Country", …) 
φ‘1 
α3 
InferredAccBelongsTo(id1, id2) 
T’4 ⇐ BelongsTo ⇐ π source (T’3) T‘5 := π name ( T‘3 ⋈ T‘4 ) 
α2 
3 
2 
25 
35 
45 
55 
Models free access on Countries

Contenu connexe

En vedette

Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
DBOnto
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
DBOnto
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
DBOnto
 

En vedette (19)

Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox Poster
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
Optique - poster
Optique - posterOptique - poster
Optique - poster
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA poster
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paper
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning Paper
 
Semantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentationSemantic Faceted Search with SemFacet presentation
Semantic Faceted Search with SemFacet presentation
 
SemFacet Poster
SemFacet PosterSemFacet Poster
SemFacet Poster
 
Optique presentation
Optique presentationOptique presentation
Optique presentation
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentation
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA Presentation
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentation
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox Presentation
 

Similaire à PDQ Poster

probabilistic ranking
probabilistic rankingprobabilistic ranking
probabilistic ranking
FELIX75
 
Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...
Kolja Kleineberg
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data Sources
Jie Bao
 

Similaire à PDQ Poster (20)

EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overview
 
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServerText Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
 
probabilistic ranking
probabilistic rankingprobabilistic ranking
probabilistic ranking
 
A Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF GraphsA Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF Graphs
 
Fosdem 2013 petra selmer flexible querying of graph data
Fosdem 2013 petra selmer   flexible querying of graph dataFosdem 2013 petra selmer   flexible querying of graph data
Fosdem 2013 petra selmer flexible querying of graph data
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkX
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
Graph technology meetup slides
Graph technology meetup slidesGraph technology meetup slides
Graph technology meetup slides
 
Sigir 2011 proceedings
Sigir 2011 proceedingsSigir 2011 proceedings
Sigir 2011 proceedings
 
Entity-Relationship Queries over Wikipedia
Entity-Relationship Queries over WikipediaEntity-Relationship Queries over Wikipedia
Entity-Relationship Queries over Wikipedia
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석
 
Evolution of Graph Algorithms – Benefits and Challenges
Evolution of Graph Algorithms – Benefits and ChallengesEvolution of Graph Algorithms – Benefits and Challenges
Evolution of Graph Algorithms – Benefits and Challenges
 
Intelligent Methods in Models of Text Information Retrieval: Implications for...
Intelligent Methods in Models of Text Information Retrieval: Implications for...Intelligent Methods in Models of Text Information Retrieval: Implications for...
Intelligent Methods in Models of Text Information Retrieval: Implications for...
 
Summary of SIGIR 2011 Papers
Summary of SIGIR 2011 PapersSummary of SIGIR 2011 Papers
Summary of SIGIR 2011 Papers
 
FScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems ProgrammingFScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems Programming
 
Spatial Database Systems
Spatial Database SystemsSpatial Database Systems
Spatial Database Systems
 
Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...Towards controlling evolutionary dynamics through network geometry: some very...
Towards controlling evolutionary dynamics through network geometry: some very...
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data Sources
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street data
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

PDQ Poster

  • 1. Finding Plans from Proofs PDQ: Proof-driven Query Answering over Web-based Data Michael Benedikt, Julien Leblay, Efthymia Tsamoura - Oxford University Supported by EPSRC grant EP/H017690/1, Query-driven Data Acquisition from Web-based Data Sources Project homepage: http://www.cs.ox.ac.uk/projects/pdq/ Contact: firstname.lastname@cs.ox.ac.uk Example: online services for geographic information r1: Places(id, name, type, coordinates, ...) information about places (e.g. city, country, continent, lake, etc.) r2: BelongsTo(source, target) containment between places, "China belongs to Asia". r3: Countries(id, name, iso_code, ...) information about countries. φ1:Places(x, y, Country, ...) ↔ Countries(x, y, ...) Query for countries in Asia: not answerable without considering constraints. SELECT p1.name FROM BelongsTo AS bt JOIN Places AS p1 ON p1.id=bt.source JOIN Places AS p2 ON p2.id=bt.target WHERE p1.type = ’Country’ AND p2.name = ’Asia’ Pre-processing steps create auxiliary schema by adding relations InferredAccPlaces, InferredAccBelongsTo, InferredAccCountries, Accessible and constraints: φ’1: InferredAccPlaces(x, y, Country, ...) ↔ InferredAccCountries(x, y,...) α1: Accessible(y)∧Places(x, y , z, ...) → InferredAccPlaces(x, y, z, ...)∧Accessible(x)∧Accessible(z)∧... α2: Accessible(x)∧BelongsTo(x, y) → InferredAccBelongsTo(x, y)∧Accessible(y) α3: Countries(x, y, z, ...) → InferredAccCountries(x, y, z, ...)∧Accessible(x)∧… α4: … Context Web data sources which may have: •overlapping information, •access restrictions. As a result: •There may be no web query plan for a given user query. •There may be many plans using different sources with different costs. Need to reason about Integrity constraints and access limitations. PDQ System for determining a query plan in the presence of web-based sources. i.constraint-aware ii.access-aware – abiding by access restrictions, iii.cost-aware – making use of any cost information Approach: generating query plans from proofs that a query is answerable. Input S: Schema 〈R, Σ〉, R set of relations with access methods (free, limited, inaccessible), Σ set of integrity constraints (TGDs). Q: Conjunctive query over S. f: Cost function on evaluation plans. Output Pbest: plan with minimal cost. Step 1: Pre-processing S augmented with new relations and axioms modelling the access restrictions. A goal query Qinferred is created based on the relations of the augmented schema. Q is grounded to form the initial state of the plan search. Step 2: Basic search step Each state is closed under firing of rules (blue arrows) other than accessibility axioms (denoted αi). Every possible firing of accessibility axioms (red arrows) gives a new candidate state, inheriting all the facts of its ancestors. Step 3: Plans and costs Each new state gives a plan, to which a cost is assigned (orange circles). If state corresponds to a match with Qinferred and its plan’s cost is lower than the best so far, it becomes the new best state. Queries over Web Data Architecture & User Experience User interface for creating and editing schemas and queries Interactive exploration of the planner’s search space. Online execution of plans. User interface for creating and configuring planning sessions. Dashboard Architecture Runtime Planner InferredAccPlaces(id2, "Asia", c2, …), Accessible(id2), Accessible(c2), … T’1 ⇐ Places ⇐ ("퐴푠푖푎") InferredAccPlaces(id2, "Asia", c2, …), Accessible(id2), Accessible(c2), … T2 ⇐ Places ⇐("퐴푠푖푎") T3 := T1 ⋈ T2 InferredAccBelongsTo(id1, id2) T4 ⇐ BelongsTo ⇐ π source (T3) T5 := π name ( T3 ⋈ T4 ) Places(id1, name1, "Country", …), Places(id2, "Asia", …), BelongsTo(id1, id2), Accessible("Asia"), Accessible("Country") Initial State Countries(id1, name1, c1, …) φ1 Goal : Qinferred(name) ← InferredAccPlaces(id1, name1, "Country", …) ∧ InferredAccPlaces(id2, "Asia", …)∧ InferredAccBelongsTo(id1, id2) φ‘1 α1 α1 α2 α3 InferredAccCountries(id1, name1, c1, …), Accessible(id1), Accessible(name1), Accessible(c1) T1 ⇐ Countries ⇐ Ø InferredAccPlaces(id1, name1, "Country", …) InferredAccCountries(id1, name1, c1, …), Accessible(id1), Accessible(name1), … T’2 ⇐ Countries ⇐ Ø T’3 := T’1 ⋈ T’2 InferredAccPlaces(id1, name1, "Country", …) φ‘1 α3 InferredAccBelongsTo(id1, id2) T’4 ⇐ BelongsTo ⇐ π source (T’3) T‘5 := π name ( T‘3 ⋈ T‘4 ) α2 3 2 25 35 45 55 Models free access on Countries