SlideShare a Scribd company logo
1 of 28
Download to read offline
YABench:
A Comprehensive Framework for RDF
Stream Processor Correctness and
Performance Assessment
Maxim Kolchin, Peter Wetz, Elmar Kiesling, A Min Tjoa
ITMO University, Russia | TU Wien, Austria
The 16th International Conference on Web Engineering 2016, Lugano, Switzerland
RDF Stream Processing (RSP)
RDF Stream - a potentially infinite sequence of time-varying
data elements encoded in RDF
Continuous query - a query registered over streams that in
most cases are observed through windows
Query results - similarly to SPARQL they can be tuples, RDF
dataset or a new RDF Stream
2
State of the art
■ LSBench (2012)
■ SRBench (2012)
■ CSRBench (2013)
■ CityBench (2015)
Details can be found at W3C RSP Community Group’s Wiki: https://www.w3.
org/community/rsp/wiki/RSP_Benchmarking
3
Our contribution
■ We propose a benchmarking framework for RDF Stream Processing
engines that focuses on correctness and performance
○ Stream generator (generates configurable RDF stream)
○ Oracle (validates correctness of the results)
○ Runner (measures performance of an RSP engine)
■ We run a benchmark with the window-based RDF stream processing
engines:
○ C-SPARQL
○ CQELS
4
Requirements
■ Scalable and configurable input
■ Comprehensive correctness checking
■ Flexible queries
■ Reproducibility
5
Architecture
1. Define tests,
2. Generate data streams,
3. Run the tests with a given
engine,
a. Performance metrics
are collected in a
separate process,
4. At the end validate the
results with the oracle.
6
Architecture: Reporting tool
7
Validation against CSRBench
We validated the correctness checking functionality of YABench by
reproducing the CSRBench* benchmark.
CSRBench defines 7 queries for C-SPARQL, CQELS and SPARQLstream
engines.
Datasets, test configurations and results are available online: github.
com/YABench/csrbench-validation
*Daniele Dell’Aglio, et al. “On Correctness in RDF Stream Processor Benchmarking”, 2013
8
Validation against CSRBench (C-SPARQL)
Query
C-SPARQL
CSRBench YABench
Q1 ✓ ✓
Q2 ✓ ✓
Q3 ✓ ✓*
Q4 ✓ ✓
Q5 ✗ ✗
Q6 ✓ ✓*
Q7 ✓ ✓*
* - the results are the same, but
because of timing discrepancies
some results sometimes present in
the subsequent window
9
Validation against CSRBench (CQELS)
Query
CQELS
CSRBench YABench
Q1 ✓ ✓
Q2 ✓ ✓
Q3 ✓ ✓
Q4 ✗ ✗
Q5 ✓ ✓
Q6 ✗ ✗
Q7 ✗ ✗**
** - indicates that the query did not
execute successfully on the CQELS
engine. The engine crashed before
returning the query results
10
Benchmark
We reuse queries introduced by CSRBench, but we’re able to parametrize them, e.
g. window size, window slide, filter values, etc.
Measure:
- Precision and recall,
- Window and result size, and delay,
- Memory and CPU usage, # of threads
We run each test 10 times, to compute the distribution of precision/recall.
Detailed results are available online: github.com/YABench/yabench-one
11
Benchmark: Data Stream Model
A data stream is generated
based on:
■ Number of weather
stations,
■ Time interval between
two observations of a
single station,
■ Duration of the stream,
■ A seed for the
randomize function
12
Benchmark: Queries
Experiment 1: SELECT + FILTER
Experiment 2: SELECT + AVG + FILTER
Experiment 3: joining of triples from different timestamps
Experiment 4: demonstrates the use of gracious mode which implemented by the
oracle to eliminate the timing discrepancy issues of the engines
13
Experiment 1 (precision/recall): 50 stations
14
Experiment 1 (precision/recall): 1000 stations
15
Experiment 1 (precision/recall): 10000 stations
16
Experiment 1 (memory usage): 50 stations
17
Experiment 1 (memory usage): 1000 stations
18
Experiment 1 (memory usage): 10000 stations
19
Experiment 1 (delay): 50 stations
20
Experiment 1 (delay): 1000 stations
21
Experiment 1 (delay): 10000 stations
22
Experiment 1 (C-SPARQL): delay vs result size
23
Architecture: Gracious mode
In this mode the oracle tries to adjusts its window scope to match the scope of an
actual window, by moving the left and right borders to back and/or forth while the
precision and recall grows.
It allows to:
(a) confirm our assumption on why precision and recall are low,
(b) reconstruct and visualize the actual window borders
24
Experiment 4: gracious vs non-gracious modes
(a) In non-gracious (default) mode (b) In gracious mode
C-SPARQL
25
Experiment 4: gracious vs non-gracious modes
(a) In non-gracious (default) mode (b) In gracious mode
CQELS
26
Conclusion
■ We build a framework for benchmarking RSP engines which allows to assess
their correctness and performance
■ We run a benchmark which revealed some insides:
○ CQELS shows better precision/recall for simple queries,
○ C-SPARQL is slightly more memory efficient than CQELS,
○ C-SPARQL outperformes CQELS in terms of delay for more complex queries, which is mainly
caused by a different reporting strategy
■ By introducing gracious mode we’re able to estimate the extent of the timing
discrepancy
27
Thank you!
github.com/YABench

More Related Content

What's hot

State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...Paris Carbone
 
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...babak hosseini
 
Christian Moscardi Presentation
Christian Moscardi PresentationChristian Moscardi Presentation
Christian Moscardi PresentationJoseph Chow
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7Dewey Potts
 
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Taiwan User Group
 

What's hot (6)

State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
 
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
 
Christian Moscardi Presentation
Christian Moscardi PresentationChristian Moscardi Presentation
Christian Moscardi Presentation
 
0xdata H2O Podcast
0xdata H2O Podcast0xdata H2O Podcast
0xdata H2O Podcast
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7
 
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
 

Viewers also liked

NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NEQOS
 
semantic and social (intra)webs
semantic and social (intra)webssemantic and social (intra)webs
semantic and social (intra)websFabien Gandon
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
Nice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsNice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsFabien Gandon
 
Speech Technology and Big Data
Speech Technology and Big DataSpeech Technology and Big Data
Speech Technology and Big DataEUCLID project
 
L avenir du web au prisme de la resource
L avenir du web au prisme de la resourceL avenir du web au prisme de la resource
L avenir du web au prisme de la resourceFabien Gandon
 
Data Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsData Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsEUCLID project
 

Viewers also liked (8)

NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)
 
semantic and social (intra)webs
semantic and social (intra)webssemantic and social (intra)webs
semantic and social (intra)webs
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Nice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsNice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphs
 
Speech Technology and Big Data
Speech Technology and Big DataSpeech Technology and Big Data
Speech Technology and Big Data
 
Essai Takahashi
Essai TakahashiEssai Takahashi
Essai Takahashi
 
L avenir du web au prisme de la resource
L avenir du web au prisme de la resourceL avenir du web au prisme de la resource
L avenir du web au prisme de la resource
 
Data Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsData Science Curriculum for Professionals
Data Science Curriculum for Professionals
 

Similar to YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...Zia Ush Shamszaman
 
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemDatabricks
 
On correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingOn correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingDaniele Dell'Aglio
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Databricks
 
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5Jeff Larkin
 
FlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkFlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkTheodoros Vasiloudis
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_JieMDO_Lab
 
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Dan Crankshaw
 
Interface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkInterface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkLiang Men
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC
 
Big Data, Bigger Analytics
Big Data, Bigger AnalyticsBig Data, Bigger Analytics
Big Data, Bigger AnalyticsItzhak Kameli
 
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?Markus Michalewicz
 
Analytics DB Benchmark
Analytics DB BenchmarkAnalytics DB Benchmark
Analytics DB BenchmarkDenny Marton
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesOptimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesIntel® Software
 
Technical_Report_on_ML_Library
Technical_Report_on_ML_LibraryTechnical_Report_on_ML_Library
Technical_Report_on_ML_LibrarySaurabh Chauhan
 

Similar to YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment (20)

On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...
 
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving System
 
On correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingOn correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarking
 
System mldl meetup
System mldl meetupSystem mldl meetup
System mldl meetup
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
 
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
 
FlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkFlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache Flink
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_Jie
 
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017
 
Redux vs GraphQL
Redux vs GraphQLRedux vs GraphQL
Redux vs GraphQL
 
Interface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkInterface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration Framework
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021
 
Big Data, Bigger Analytics
Big Data, Bigger AnalyticsBig Data, Bigger Analytics
Big Data, Bigger Analytics
 
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
 
Analytics DB Benchmark
Analytics DB BenchmarkAnalytics DB Benchmark
Analytics DB Benchmark
 
Ikc 2015
Ikc 2015Ikc 2015
Ikc 2015
 
Smallsat 2021
Smallsat 2021Smallsat 2021
Smallsat 2021
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesOptimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
 
Technical_Report_on_ML_Library
Technical_Report_on_ML_LibraryTechnical_Report_on_ML_Library
Technical_Report_on_ML_Library
 

Recently uploaded

Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 

Recently uploaded (20)

Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 

YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

  • 1. YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment Maxim Kolchin, Peter Wetz, Elmar Kiesling, A Min Tjoa ITMO University, Russia | TU Wien, Austria The 16th International Conference on Web Engineering 2016, Lugano, Switzerland
  • 2. RDF Stream Processing (RSP) RDF Stream - a potentially infinite sequence of time-varying data elements encoded in RDF Continuous query - a query registered over streams that in most cases are observed through windows Query results - similarly to SPARQL they can be tuples, RDF dataset or a new RDF Stream 2
  • 3. State of the art ■ LSBench (2012) ■ SRBench (2012) ■ CSRBench (2013) ■ CityBench (2015) Details can be found at W3C RSP Community Group’s Wiki: https://www.w3. org/community/rsp/wiki/RSP_Benchmarking 3
  • 4. Our contribution ■ We propose a benchmarking framework for RDF Stream Processing engines that focuses on correctness and performance ○ Stream generator (generates configurable RDF stream) ○ Oracle (validates correctness of the results) ○ Runner (measures performance of an RSP engine) ■ We run a benchmark with the window-based RDF stream processing engines: ○ C-SPARQL ○ CQELS 4
  • 5. Requirements ■ Scalable and configurable input ■ Comprehensive correctness checking ■ Flexible queries ■ Reproducibility 5
  • 6. Architecture 1. Define tests, 2. Generate data streams, 3. Run the tests with a given engine, a. Performance metrics are collected in a separate process, 4. At the end validate the results with the oracle. 6
  • 8. Validation against CSRBench We validated the correctness checking functionality of YABench by reproducing the CSRBench* benchmark. CSRBench defines 7 queries for C-SPARQL, CQELS and SPARQLstream engines. Datasets, test configurations and results are available online: github. com/YABench/csrbench-validation *Daniele Dell’Aglio, et al. “On Correctness in RDF Stream Processor Benchmarking”, 2013 8
  • 9. Validation against CSRBench (C-SPARQL) Query C-SPARQL CSRBench YABench Q1 ✓ ✓ Q2 ✓ ✓ Q3 ✓ ✓* Q4 ✓ ✓ Q5 ✗ ✗ Q6 ✓ ✓* Q7 ✓ ✓* * - the results are the same, but because of timing discrepancies some results sometimes present in the subsequent window 9
  • 10. Validation against CSRBench (CQELS) Query CQELS CSRBench YABench Q1 ✓ ✓ Q2 ✓ ✓ Q3 ✓ ✓ Q4 ✗ ✗ Q5 ✓ ✓ Q6 ✗ ✗ Q7 ✗ ✗** ** - indicates that the query did not execute successfully on the CQELS engine. The engine crashed before returning the query results 10
  • 11. Benchmark We reuse queries introduced by CSRBench, but we’re able to parametrize them, e. g. window size, window slide, filter values, etc. Measure: - Precision and recall, - Window and result size, and delay, - Memory and CPU usage, # of threads We run each test 10 times, to compute the distribution of precision/recall. Detailed results are available online: github.com/YABench/yabench-one 11
  • 12. Benchmark: Data Stream Model A data stream is generated based on: ■ Number of weather stations, ■ Time interval between two observations of a single station, ■ Duration of the stream, ■ A seed for the randomize function 12
  • 13. Benchmark: Queries Experiment 1: SELECT + FILTER Experiment 2: SELECT + AVG + FILTER Experiment 3: joining of triples from different timestamps Experiment 4: demonstrates the use of gracious mode which implemented by the oracle to eliminate the timing discrepancy issues of the engines 13
  • 16. Experiment 1 (precision/recall): 10000 stations 16
  • 17. Experiment 1 (memory usage): 50 stations 17
  • 18. Experiment 1 (memory usage): 1000 stations 18
  • 19. Experiment 1 (memory usage): 10000 stations 19
  • 20. Experiment 1 (delay): 50 stations 20
  • 21. Experiment 1 (delay): 1000 stations 21
  • 22. Experiment 1 (delay): 10000 stations 22
  • 23. Experiment 1 (C-SPARQL): delay vs result size 23
  • 24. Architecture: Gracious mode In this mode the oracle tries to adjusts its window scope to match the scope of an actual window, by moving the left and right borders to back and/or forth while the precision and recall grows. It allows to: (a) confirm our assumption on why precision and recall are low, (b) reconstruct and visualize the actual window borders 24
  • 25. Experiment 4: gracious vs non-gracious modes (a) In non-gracious (default) mode (b) In gracious mode C-SPARQL 25
  • 26. Experiment 4: gracious vs non-gracious modes (a) In non-gracious (default) mode (b) In gracious mode CQELS 26
  • 27. Conclusion ■ We build a framework for benchmarking RSP engines which allows to assess their correctness and performance ■ We run a benchmark which revealed some insides: ○ CQELS shows better precision/recall for simple queries, ○ C-SPARQL is slightly more memory efficient than CQELS, ○ C-SPARQL outperformes CQELS in terms of delay for more complex queries, which is mainly caused by a different reporting strategy ■ By introducing gracious mode we’re able to estimate the extent of the timing discrepancy 27