SlideShare une entreprise Scribd logo
1  sur  38
Optimized Index Structures
for Querying RDF from the
Web
Presented by : Mahdi Atawna
1
About the Paper
 The paper was published at the Third Latin
American Web Congress in 2005
 Have 56 citation.
Andreas Harth
National University
of Galway, Ireland
Prof. Stefan Decker
National University of
Galway, Ireland
2
Outline
 Overview of Semantic Web.
 Overview of Indexes.
 Paper motivation.
 Methodology.
 Experiment & Result.
 Conclusion.
3
Semantic Web
 Also called :
 Web 3.0.
 the Linked Data Web.
 the Web of Data…whatever you call it.
 the next major evolution in connecting information.
4
Why semantic web?
 It enables data to be linked from a source to any
other source.
 It can be understood by computers so that they can
perform increasingly sophisticated tasks on our
behalf.
5
6
Source: http://lod-cloud.net
Semantic Web Standards
 RDF (Resource Description Framework): The data modeling
language for the Semantic Web (like UML). All Semantic Web
information is stored and represented in the RDF.
 SPARQL : The query language of the Semantic Web.
 OWL (Web Ontology Language) The schema language, or
knowledge representation (KR) language, of the Semantic Web.
7
What is RDF?
 RDF is the data model of the Semantic Web.
 That means that all data in Semantic Web
technologies is represented as RDF.
 If you store Semantic Web data, it's in RDF.
 If you query Semantic Web data (typically using
SPARQL), it's RDF data. If you send Semantic Web
data to your friend, it's RDF.
8
9
10
Source : http://www.w3.org/TR/rdf11-primer/
RDF triples
 are representations of graph edges.
11
Subject Object
Predicate
Mahdi Hebron
born in
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
12
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
Subject
predicate
Object
13
Sparql query language
SELECT ?p ?o
{
<http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o
}
14
What is database index?
 A database index is a data structure that improves the
speed of data retrieval operations on a database table at
the cost of additional writes and storage space to maintain
the index data structure.
 Index goal : The index structure enables fast retrieval of
data
15
16
Index example
key articles
Leonardo [104,70,12,98]
Mona Lisa [2,201,7,20,12]
Francesco [1,8,900,104]
17
Paper motivation
 Previous Systems provide a storage infrastructure for RDF data, but
index structure which do not support typical query scenarios
for data from the Web which results in poor query answering
performance in some cases.
18
Methodology
 The researchers present a new index structure that handle the
data from the Web .
 Implemented the index structure in a lightweight software called
YARS
19
RDF Index structures
 The authors suggested an index structure that contains two
sets:
1. Lexicon : covers the string representation of RDF graph
(r,l,b)
2. Quad indexes : cover the quads (triples).
20
1. Lexicon indexes
 NodeOID and OIDNode Index
 Keyword Index
21
1. Lexicon indexes
 NodeOID and OIDNode Index :
22
Key value
<http://www.harth.org/andreas/#me> 3
<http://decker.cn/stefan/> 14
<http://sw.deri.org/ aharth/foaf.rdf> 11
<http://www.deri.org/> 1
1. Lexicon indexes
 Keyword Index (Popular in search engines)
23
Key No of hits List of hits
“Andreas” 1 3
“Decker” 1 11
”Harth” 1 3
“Stefan” 2 11,13
2. Quad indexes
 Access Patterns
 Combined Indexes
 Occurrence Counts
24
2. Quad indexes
A- Access Patterns
25
No Access pattern No Access pattern
1 (?:?:?:?) 9 (s:?:o:c)
2 (s:?:?:?) 10 (?:?:o:c)
3 (s:p:?:?) 11 (?:?:o:?)
4 (s:p:o:?) 12 (?:?:?:c)
5 (s:p:o:c) 13 (s:?:?:c)
6 (?:p:?:?) 14 (s:p:?:c)
7 (?:p:o:?) 15 (?:p:?:c)
8 (?:p:o:c) 16 (s:?:o:?)
2. Quad indexes
A- Access Patterns
26
No Access pattern values
1 (?:?:?:?) [1,5,3]
2 (s:?:?:?) [2]
3 (s:p:?:?) [9,8,2,3]
4 (s:p:o:?) [1,3]
5 (s:p:o:c) [1]
6 (?:p:?:?) [76,9]
7 (?:p:o:?) [2,3]
2. Quad indexes
B- Combined Indexes
27
spoc poc osc csp cp os
(?:?:?:?) (?:p:?:?) (?:?:o:?)
(?:?:?:c) (?:p:?:c) (s:?:o:?)
(s:?:?:?) (?:p:o:?) (?:?:o:c)
(s:?:?:c)
(s:p:?:?) (?:p:o:c) (s:?:o:c)
(s:p:?:c)
(s:p:o:?)
(s:p:o:c)
2. Quad indexes
C- Occurrence Counts
28
No
Access
pattern
values count
1 (?:?:?:?) [1,5,3] 3
2 (s:?:?:?) [2] 1
3 (s:p:?:?) [9,8,2,3] 4
4 (s:p:o:?) [1,3] 2
5 (s:p:o:c) [1] 1
6 (?:p:?:?) [76,9] 2
7 (?:p:o:?) [2,3] 2
YARS
 Web application that built in JAVA.
 Has two parts:
 a storage component that handles both persistent and in-
memory indexes.
 a query handler to perform query processing and evaluation.
29
Experiment
 They evaluated the performance based on a dataset of 2.8
million triples (293 MB).
 The testing server has :
 Pentium-4 2.4 GHz
 4 GB RAM
 running Debian Sarge .
30
Experiment
They considered the following RDF stores for evaluation:
 Sesame.
 Kowari (failed to get a running version).
 Redland.
 Jena2. ([9] shows that Sesame generally supersedes Jena in
performance results)
31
Experiment
Experiment parts:
 Index Construction.
 Queries.
32
Result – index construction
System Index size (bytes)
Redland 2.164.019.200
Sesame MySQL 340.381.636
Sesame native 39.997.992
YARS 1.090.002.944
33
Table 8: Index size for the synthetic Univ20 dataset.
Result – index construction
34
Result - queries
No Query
1 ?x rdf:type univ:UndergraduateStudent
2 ?x ?p ”UndergraduateStudent0”
3 <http://www.University965.edu> ?p ?o
4 ?x univ:worksFor ?y
35
Query Redland Sesame MySQL Sesame Native YARS
1 0:10.48 0:18.87 1:05.16 0:18.41
2 0:44.14 0:00.73 0:00.55 0:00.49
3 0:44.15 0:00.46 0:00.47 0:00.32
4 3:04.21 0:03.42 0:01.95 0:00.47
Performance results for quad queries.
Conclusion
 The auther introduced query processing for RDF which is an I
portant issue in sematic web.
 YARS has some overhead for resolving the dependencies and
order in comparison with others.
36
Criticism
 - In experiment , the researchers removed “Kowari “ engine
because the cannot install it.
37
Questions?
Thank you
38

Contenu connexe

Tendances

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
R Programming Language
R Programming LanguageR Programming Language
R Programming LanguageNareshKarela1
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-iDr. Awase Khirni Syed
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language Rfbenault
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationINRIA-OAK
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming FundamentalsRagia Ibrahim
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesLeonidas Akritidis
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RYanchang Zhao
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul SinghRavi Basil
 

Tendances (20)

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil
 
R Programming Language
R Programming LanguageR Programming Language
R Programming Language
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language R
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
Data Structure
Data StructureData Structure
Data Structure
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted Indexes
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in R
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul Singh
 

Similaire à Optimized index structures for querying rdf from the web

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3CIvan Herman
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Jose Emilio Labra Gayo
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyAlbert Meroño-Peñuela
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityDatabricks
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptxAndrew Lamb
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?Örjan Lundberg
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهWeb Standards School
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_frameworksamuelhard
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureMasud Rahman
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkDatabricks
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET FrameworkAdriana Ivanciu
 

Similaire à Optimized index structures for querying rdf from the web (20)

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3C
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic Study
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark community
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptx
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_framework
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lecture
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with Spark
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET Framework
 

Plus de Mahdi Atawneh

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logsMahdi Atawneh
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsMahdi Atawneh
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreMahdi Atawneh
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesMahdi Atawneh
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxMahdi Atawneh
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model databaseMahdi Atawneh
 

Plus de Mahdi Atawneh (6)

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logs
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristics
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value Store
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptx
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model database
 

Dernier

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Dernier (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

Optimized index structures for querying rdf from the web

  • 1. Optimized Index Structures for Querying RDF from the Web Presented by : Mahdi Atawna 1
  • 2. About the Paper  The paper was published at the Third Latin American Web Congress in 2005  Have 56 citation. Andreas Harth National University of Galway, Ireland Prof. Stefan Decker National University of Galway, Ireland 2
  • 3. Outline  Overview of Semantic Web.  Overview of Indexes.  Paper motivation.  Methodology.  Experiment & Result.  Conclusion. 3
  • 4. Semantic Web  Also called :  Web 3.0.  the Linked Data Web.  the Web of Data…whatever you call it.  the next major evolution in connecting information. 4
  • 5. Why semantic web?  It enables data to be linked from a source to any other source.  It can be understood by computers so that they can perform increasingly sophisticated tasks on our behalf. 5
  • 7. Semantic Web Standards  RDF (Resource Description Framework): The data modeling language for the Semantic Web (like UML). All Semantic Web information is stored and represented in the RDF.  SPARQL : The query language of the Semantic Web.  OWL (Web Ontology Language) The schema language, or knowledge representation (KR) language, of the Semantic Web. 7
  • 8. What is RDF?  RDF is the data model of the Semantic Web.  That means that all data in Semantic Web technologies is represented as RDF.  If you store Semantic Web data, it's in RDF.  If you query Semantic Web data (typically using SPARQL), it's RDF data. If you send Semantic Web data to your friend, it's RDF. 8
  • 9. 9
  • 11. RDF triples  are representations of graph edges. 11 Subject Object Predicate Mahdi Hebron born in
  • 12. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . 12
  • 13. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . Subject predicate Object 13
  • 14. Sparql query language SELECT ?p ?o { <http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o } 14
  • 15. What is database index?  A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure.  Index goal : The index structure enables fast retrieval of data 15
  • 16. 16
  • 17. Index example key articles Leonardo [104,70,12,98] Mona Lisa [2,201,7,20,12] Francesco [1,8,900,104] 17
  • 18. Paper motivation  Previous Systems provide a storage infrastructure for RDF data, but index structure which do not support typical query scenarios for data from the Web which results in poor query answering performance in some cases. 18
  • 19. Methodology  The researchers present a new index structure that handle the data from the Web .  Implemented the index structure in a lightweight software called YARS 19
  • 20. RDF Index structures  The authors suggested an index structure that contains two sets: 1. Lexicon : covers the string representation of RDF graph (r,l,b) 2. Quad indexes : cover the quads (triples). 20
  • 21. 1. Lexicon indexes  NodeOID and OIDNode Index  Keyword Index 21
  • 22. 1. Lexicon indexes  NodeOID and OIDNode Index : 22 Key value <http://www.harth.org/andreas/#me> 3 <http://decker.cn/stefan/> 14 <http://sw.deri.org/ aharth/foaf.rdf> 11 <http://www.deri.org/> 1
  • 23. 1. Lexicon indexes  Keyword Index (Popular in search engines) 23 Key No of hits List of hits “Andreas” 1 3 “Decker” 1 11 ”Harth” 1 3 “Stefan” 2 11,13
  • 24. 2. Quad indexes  Access Patterns  Combined Indexes  Occurrence Counts 24
  • 25. 2. Quad indexes A- Access Patterns 25 No Access pattern No Access pattern 1 (?:?:?:?) 9 (s:?:o:c) 2 (s:?:?:?) 10 (?:?:o:c) 3 (s:p:?:?) 11 (?:?:o:?) 4 (s:p:o:?) 12 (?:?:?:c) 5 (s:p:o:c) 13 (s:?:?:c) 6 (?:p:?:?) 14 (s:p:?:c) 7 (?:p:o:?) 15 (?:p:?:c) 8 (?:p:o:c) 16 (s:?:o:?)
  • 26. 2. Quad indexes A- Access Patterns 26 No Access pattern values 1 (?:?:?:?) [1,5,3] 2 (s:?:?:?) [2] 3 (s:p:?:?) [9,8,2,3] 4 (s:p:o:?) [1,3] 5 (s:p:o:c) [1] 6 (?:p:?:?) [76,9] 7 (?:p:o:?) [2,3]
  • 27. 2. Quad indexes B- Combined Indexes 27 spoc poc osc csp cp os (?:?:?:?) (?:p:?:?) (?:?:o:?) (?:?:?:c) (?:p:?:c) (s:?:o:?) (s:?:?:?) (?:p:o:?) (?:?:o:c) (s:?:?:c) (s:p:?:?) (?:p:o:c) (s:?:o:c) (s:p:?:c) (s:p:o:?) (s:p:o:c)
  • 28. 2. Quad indexes C- Occurrence Counts 28 No Access pattern values count 1 (?:?:?:?) [1,5,3] 3 2 (s:?:?:?) [2] 1 3 (s:p:?:?) [9,8,2,3] 4 4 (s:p:o:?) [1,3] 2 5 (s:p:o:c) [1] 1 6 (?:p:?:?) [76,9] 2 7 (?:p:o:?) [2,3] 2
  • 29. YARS  Web application that built in JAVA.  Has two parts:  a storage component that handles both persistent and in- memory indexes.  a query handler to perform query processing and evaluation. 29
  • 30. Experiment  They evaluated the performance based on a dataset of 2.8 million triples (293 MB).  The testing server has :  Pentium-4 2.4 GHz  4 GB RAM  running Debian Sarge . 30
  • 31. Experiment They considered the following RDF stores for evaluation:  Sesame.  Kowari (failed to get a running version).  Redland.  Jena2. ([9] shows that Sesame generally supersedes Jena in performance results) 31
  • 32. Experiment Experiment parts:  Index Construction.  Queries. 32
  • 33. Result – index construction System Index size (bytes) Redland 2.164.019.200 Sesame MySQL 340.381.636 Sesame native 39.997.992 YARS 1.090.002.944 33 Table 8: Index size for the synthetic Univ20 dataset.
  • 34. Result – index construction 34
  • 35. Result - queries No Query 1 ?x rdf:type univ:UndergraduateStudent 2 ?x ?p ”UndergraduateStudent0” 3 <http://www.University965.edu> ?p ?o 4 ?x univ:worksFor ?y 35 Query Redland Sesame MySQL Sesame Native YARS 1 0:10.48 0:18.87 1:05.16 0:18.41 2 0:44.14 0:00.73 0:00.55 0:00.49 3 0:44.15 0:00.46 0:00.47 0:00.32 4 3:04.21 0:03.42 0:01.95 0:00.47 Performance results for quad queries.
  • 36. Conclusion  The auther introduced query processing for RDF which is an I portant issue in sematic web.  YARS has some overhead for resolving the dependencies and order in comparison with others. 36
  • 37. Criticism  - In experiment , the researchers removed “Kowari “ engine because the cannot install it. 37