SlideShare une entreprise Scribd logo
1  sur  10
Télécharger pour lire hors ligne
David Chaves-Fraga, Ontology Engineering Group
Universidad Politécnica de Madrid, Spain
Claudio Gutierrez, CIWS & Universidad de Chile

Oscar Corcho, OEG - UPM

On the Role of the GRAPH
Clause in the Performance
of Federated SPARQL
Queries
dchaves@email

@dchavesf
22/10/2017

PROFILES17
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Federated Query and SPARQL endpoint
2
Improve federated query evaluation by systematically using the
GRAPH clause to rewrite a query into a semantically equivalent one
G0
G1 G2
G5
G3
G4
<URI>
<URI>
<URI> <URI>
<URI>
SPARQL endpoint
select * where {
service <s1> {GP1}
service <s2> {GP2}
service <s3> {GP3}
}
Federated Query
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Motivation
3
•Many graph patterns bind results from only one of the graphs in the
SPARQL endpoints
•The server can have more information and optimize the retrieval
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Reducing the space of a federated SPARQL query
4
•Hypothesis: This method adds efficiency in the evaluation of the query
•Assumption: The identification of the GP
select * where {
service <s1> {GP1}
service <s2> {GP2}
service <s3> {GP3}
}
select * from named <s1-g2>, <s2-g3>, <s3-g1>{
service <s1> {graph <s1-g2> {GP1}}
service <s2> {graph <s2-g3> {GP2}}
service <s3> {graph <s3-g1> {GP3}}
}
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Queries and experiment
5
•We use the real public instances of SPARQL endpoints of:
• DBpedia
• KB3city
• Linkedmdb
•5 queries - two different approaches (with/without GRAPH clause):
• 5 executions per approach with the disabled cache.
•Queries available at: https://github.com/dachafra/federated-graph-queries
Problems:
•There is not exist any benchmarking for federated SPARQL queries
that includes SERVICE and GRAPH clauses.
•Deal with availability and real performance of the SPARQL endpoints
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Preliminary results
6
Q1 Q2 Q3 Q4 Q5
GRAPH 13,51 3,22 1,98 1,73 8,56
SERVICE 14,63 4,36 2,88 1,69 9,12
•5%-10% improvement
•Q2-Q3-Q4: kb.3city.com SPARQL endpoint
• 73 graphs
• Almost 5 millions of triples
•Q1-Q5: dbpedia.com SPARQL endpoint:
• 19 graphs
• 285 millions of triples
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Preliminary results
6
Q1 Q2 Q3 Q4 Q5
GRAPH 13,51 3,22 1,98 1,73 8,56
SERVICE 14,63 4,36 2,88 1,69 9,12
•5%-10% improvement
•Q2-Q3-Q4: kb.3city.com SPARQL endpoint
• 73 graphs
• Almost 5 millions of triples
•Q1-Q5: dbpedia.com SPARQL endpoint:
• 19 graphs
• 285 millions of triples
Virtuoso puts the triples ina single table
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Conclusions: The role of the GRAPH clause
7
Conclusions about GRAPH clause:
1. is semantically ambiguous
2. offers a view of how the data
is conceptually ordered but
not physically
3. is a SPARQL clause that
seems to not allow any type of
optimization (at the moment)
DBpedia graphs and triples
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Future work
8
•Create a Benchmarking for federated queries with SERVICE and
GRAPH
•Apply the approach to other triple stores
•Carry out semantic equivalence between open data portal and a
SPARQL endpoint
•Link unique graph patterns with their pertained graphs
•Analyze why Virtuoso and Apache Jena can execute the approach
David Chaves-Fraga, Ontology Engineering Group
Universidad Politécnica de Madrid, Spain
Claudio Gutierrez, CIWS & Universidad de Chile

Oscar Corcho, OEG - UPM

On the Role of the GRAPH
Clause in the Performance
of Federated SPARQL
Queries
dchaves@email

@dchavesf
22/10/2017

PROFILES17

Contenu connexe

Tendances

How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowLucas Arruda
 
Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Vladimir Tomberg
 
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...Big Data Spain
 
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...confluent
 
Processing genetic data at scale
Processing genetic data at scaleProcessing genetic data at scale
Processing genetic data at scaleMark Schroering
 
6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial Code6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial CodePhdtopiccom
 
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQLPGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQLPGConf APAC
 
MLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsMLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsBigML, Inc
 
qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...Rodrigo Rocha Silva
 
MLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRIMLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRIBigML, Inc
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkSingleStore
 

Tendances (13)

IMDb Data Integration
IMDb Data IntegrationIMDb Data Integration
IMDb Data Integration
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
 
Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
 
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
 
Processing genetic data at scale
Processing genetic data at scaleProcessing genetic data at scale
Processing genetic data at scale
 
6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial Code6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial Code
 
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQLPGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
 
MLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsMLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning Sessions
 
qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...
 
MLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRIMLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRI
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
 

Similaire à On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries

WBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBenchWBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBencht_ivanov
 
Scaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with DgraphScaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with DgraphKarthic Rao
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
 
Recent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced AnalyticsRecent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced AnalyticsDatabricks
 
How easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performanceHow easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performanceLuca Mattia Ferrari
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
GraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & ContentfulGraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & ContentfulNikolas Burk
 
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...SonjaChevre
 
GPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a ServiceGPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a ServicePivotalOpenSourceHub
 
Graphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docxGraphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docxssuser5583681
 
Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresKostis Kyzirakos
 
Sempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on HadoopSempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on HadoopAlexander Schätzle
 
Training Week: GraphQL 2022
Training Week: GraphQL 2022Training Week: GraphQL 2022
Training Week: GraphQL 2022Neo4j
 
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademyGraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademyMarcinStachniuk
 
CONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQLCONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQLMatthew Groves
 

Similaire à On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries (20)

WBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBenchWBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBench
 
23003471225PPT.pptx
23003471225PPT.pptx23003471225PPT.pptx
23003471225PPT.pptx
 
Scaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with DgraphScaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with Dgraph
 
GraphQL
GraphQLGraphQL
GraphQL
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
Recent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced AnalyticsRecent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced Analytics
 
How easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performanceHow easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performance
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
GraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & ContentfulGraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & Contentful
 
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...
 
GPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a ServiceGPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a Service
 
GraphQL + relay
GraphQL + relayGraphQL + relay
GraphQL + relay
 
GraphQL-ify your APIs
GraphQL-ify your APIsGraphQL-ify your APIs
GraphQL-ify your APIs
 
Graphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docxGraphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docx
 
GraphQL Fundamentals
GraphQL FundamentalsGraphQL Fundamentals
GraphQL Fundamentals
 
Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF Stores
 
Sempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on HadoopSempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on Hadoop
 
Training Week: GraphQL 2022
Training Week: GraphQL 2022Training Week: GraphQL 2022
Training Week: GraphQL 2022
 
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademyGraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
 
CONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQLCONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQL
 

Plus de David Chaves-Fraga

Enlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte públicoEnlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte públicoDavid Chaves-Fraga
 
Open Summer of Code Spain 2018
Open Summer of Code Spain 2018Open Summer of Code Spain 2018
Open Summer of Code Spain 2018David Chaves-Fraga
 
The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...David Chaves-Fraga
 
Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)David Chaves-Fraga
 
TransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab BrusselsTransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab BrusselsDavid Chaves-Fraga
 
Semantic interoperability in Transport Domain
Semantic interoperability in Transport DomainSemantic interoperability in Transport Domain
Semantic interoperability in Transport DomainDavid Chaves-Fraga
 

Plus de David Chaves-Fraga (6)

Enlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte públicoEnlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte público
 
Open Summer of Code Spain 2018
Open Summer of Code Spain 2018Open Summer of Code Spain 2018
Open Summer of Code Spain 2018
 
The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...
 
Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)
 
TransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab BrusselsTransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab Brussels
 
Semantic interoperability in Transport Domain
Semantic interoperability in Transport DomainSemantic interoperability in Transport Domain
Semantic interoperability in Transport Domain
 

Dernier

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 

Dernier (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 

On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries

  • 1. David Chaves-Fraga, Ontology Engineering Group Universidad Politécnica de Madrid, Spain Claudio Gutierrez, CIWS & Universidad de Chile Oscar Corcho, OEG - UPM On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries dchaves@email @dchavesf 22/10/2017 PROFILES17
  • 2. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Federated Query and SPARQL endpoint 2 Improve federated query evaluation by systematically using the GRAPH clause to rewrite a query into a semantically equivalent one G0 G1 G2 G5 G3 G4 <URI> <URI> <URI> <URI> <URI> SPARQL endpoint select * where { service <s1> {GP1} service <s2> {GP2} service <s3> {GP3} } Federated Query
  • 3. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Motivation 3 •Many graph patterns bind results from only one of the graphs in the SPARQL endpoints •The server can have more information and optimize the retrieval
  • 4. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Reducing the space of a federated SPARQL query 4 •Hypothesis: This method adds efficiency in the evaluation of the query •Assumption: The identification of the GP select * where { service <s1> {GP1} service <s2> {GP2} service <s3> {GP3} } select * from named <s1-g2>, <s2-g3>, <s3-g1>{ service <s1> {graph <s1-g2> {GP1}} service <s2> {graph <s2-g3> {GP2}} service <s3> {graph <s3-g1> {GP3}} }
  • 5. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Queries and experiment 5 •We use the real public instances of SPARQL endpoints of: • DBpedia • KB3city • Linkedmdb •5 queries - two different approaches (with/without GRAPH clause): • 5 executions per approach with the disabled cache. •Queries available at: https://github.com/dachafra/federated-graph-queries Problems: •There is not exist any benchmarking for federated SPARQL queries that includes SERVICE and GRAPH clauses. •Deal with availability and real performance of the SPARQL endpoints
  • 6. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Preliminary results 6 Q1 Q2 Q3 Q4 Q5 GRAPH 13,51 3,22 1,98 1,73 8,56 SERVICE 14,63 4,36 2,88 1,69 9,12 •5%-10% improvement •Q2-Q3-Q4: kb.3city.com SPARQL endpoint • 73 graphs • Almost 5 millions of triples •Q1-Q5: dbpedia.com SPARQL endpoint: • 19 graphs • 285 millions of triples
  • 7. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Preliminary results 6 Q1 Q2 Q3 Q4 Q5 GRAPH 13,51 3,22 1,98 1,73 8,56 SERVICE 14,63 4,36 2,88 1,69 9,12 •5%-10% improvement •Q2-Q3-Q4: kb.3city.com SPARQL endpoint • 73 graphs • Almost 5 millions of triples •Q1-Q5: dbpedia.com SPARQL endpoint: • 19 graphs • 285 millions of triples Virtuoso puts the triples ina single table
  • 8. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Conclusions: The role of the GRAPH clause 7 Conclusions about GRAPH clause: 1. is semantically ambiguous 2. offers a view of how the data is conceptually ordered but not physically 3. is a SPARQL clause that seems to not allow any type of optimization (at the moment) DBpedia graphs and triples
  • 9. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Future work 8 •Create a Benchmarking for federated queries with SERVICE and GRAPH •Apply the approach to other triple stores •Carry out semantic equivalence between open data portal and a SPARQL endpoint •Link unique graph patterns with their pertained graphs •Analyze why Virtuoso and Apache Jena can execute the approach
  • 10. David Chaves-Fraga, Ontology Engineering Group Universidad Politécnica de Madrid, Spain Claudio Gutierrez, CIWS & Universidad de Chile Oscar Corcho, OEG - UPM On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries dchaves@email @dchavesf 22/10/2017 PROFILES17