SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Slider: an Efficient Incremental Reasoner
Jules Chevalier
jules.chevalier@univ-st-etienne.fr
LT2C, Télécom Saint Etienne, Université Jean Monnet
Octobre 2014
Supervisors :
Fréférique Laforest
Christophe Gravier
Julien Subercaze
Summary
Introduction
State of the art
Contribution
Experimental results
Conclusion
2 / 27
Semantic Web & Description Logic
Formalises concepts to represent them
Standardizes this representation
Makes it readable for both humans and computers
Links these data together
Allows automatic operations on these data
Integrity constraint validation
Query the knowledge base
Extraction of implicit data
3 / 27
Semantic Web & Description Logic
Formalises concepts to represent them
Standardizes this representation
Makes it readable for both humans and computers
Links these data together
Allows automatic operations on these data
Integrity constraint validation
Query the knowledge base
Extraction of implicit data = Reasoning
4 / 27
Reasoning : Forward Chaining VS Backward Chaining
Abraham
Homer Marge
Liza Bart
What we know :
Abraham father Homer
Homer father Liza
Homer father Bart
Marge mother Liza
Marge mother bart
5 / 27
Reasoning : Forward Chaining VS Backward Chaining
Abraham
Homer Marge
Liza Bart
What we know :
Abraham father Homer
Homer father Liza
Homer father Bart
Marge mother Liza
Marge mother bart
What Forward Chaining do :
Abraham grandfather Liza
Abraham grandfather Bart
...
Abraham grandfather Liza ? → yes
5 / 27
Reasoning : Forward Chaining VS Backward Chaining
Abraham
Homer Marge
Liza Bart
What we know :
Abraham father Homer
Homer father Liza
Homer father Bart
Marge mother Liza
Marge mother bart
What Forward Chaining do :
Abraham grandfather Liza
Abraham grandfather Bart
...
Abraham grandfather Liza ? → yes
What Backward Chaining do :
Abraham grandfather Liza ?
Abraham father X & X father Liza ?
Abraham father Homer &
Homer father Liza → yes
5 / 27
Rule-based Reasoning
Rules
An antecedent: Allows the rule to be executed
A consequent: The statement inferred
c1 subClassOf c2, x type c1
(cax-sco)
x type c2
Fragments
A fragment is a set of inference rules
Semantic Web standards suggest different pre defined
fragments (RDFS, OWL Lite, OWL Full, OWL DL, ...)
The more they have a high expressivity, the more the
operations are complex (from P to NEXPTIME)
Choosing one fragment is trade off between expressivity and
computational complexity
6 / 27
Problematic
What we want to do
Efficient forward-reasoning over Big Data (NP complete for
most complex ontologies)
7 / 27
Problematic
What we want to do
Efficient forward-reasoning over Big Data (NP complete for
most complex ontologies)
What are the problems
Rules form a cyclic graph
Complexity depends on the fragment !
The amount of triples generated is quite unpredictable
The complexity also depends on data !
Big Data is not static
We need to handle data streams !
7 / 27
Summary
Introduction
State of the art
Contribution
Experimental results
Conclusion
8 / 27
Batch reasoning approaches
WebPie : a Web-scale Parallel Inference Engine
2009 - Jacopo Urbani Thesis [6]
Uses MapReduce for OWL Horst and RDFS reasoning
2011 - Fix some issues to improve OWL Horst reasoning [7]
Duplicates limitation
Indexation for sameAs
Greedy scheduling
Cleaner Job after some rules, or at the end
MapResolve [5]
Based their work on WebPie to develop a OWL Horst reasoner
Use 3 sets for triples : usable, used, inferred
Limits overheads, optimise
Points out MapReduce limitations
9 / 27
Analysis : MapReduce approaches
MapReduce
Framework
Allows to implement distributed
tasks
The Hadoop framework
Best suited to batch process huge
amounts of data
MapReduce requires an acyclic
dataflow
Jobs run in isolation
Not suitable network shuffling
Hadoop distributed file system
WebPie and MapResolve
Contributions
Only provide batch reasoning
Nodes must wait for each other
Generate a lot of duplicates
Fragment dependant
Naive partitioning
Critical letter for WebPie [4]
10 / 27
Incremental solutions
History Matters: Incremental Ontology Reasoning Using
Modules [2]
Maintains classification of ontologies as they evolve
Provides encouraging results
Not viable for static hierarchy of ontologies
Not adapted on high number of nominals
Incremental Reasoning in OWL EL without Bookkeeping [3]
Handles both addition and deletion of knowledge
Incremental classification of TBox
Limited to the classification on the TBox
Dedicated to the EL+ fragment
11 / 27
Summary
Introduction
State of the art
Contribution
Experimental results
Conclusion
12 / 27
Proposed solution
Slider
Parallel and Scalable Execution
Rules mapped to independent modules
Multiple rule instances allowed to run in parallel
Duplicates Limitation
Shared triple store
Vertical partitioning [1] and multiple indexing
Data Stream Support
Streamed architecture
Parallel parsing/reasoning
Fragment’s Customization
Dynamic support of ruleset
ρdf and RDFS natively supported
Extendible to any other fragment
13 / 27
Architecture
TRIPLE STORE
Evolving
Data
New
triples
Explicit Triples
Implicit Triples
Streamed Triples
R2
Buffer R3
Distributor R3
Distributor R2
Distributor R1
Buffer R2
Buffer R1
R2R1
R2
R1R3
Input Manager Rules Buffers Thread Pool Distributors
R1
R1
R2
R2
R3
R3
Incoming
triples
Concurrent Access
Rule Modules
Input
Manager
R1
R2
R3
R3
R1
R2
14 / 27
Architecture
Input Manager
Receives incoming triples
Sends them to
The triple store
The rules buffers
Rules Buffers
A buffer for each rule
Run the rule when full
Run the rule when
timed-out
Ensures completeness
Thread Pool
Manages a pool instances
Ensures scalability
Rule instance
Execute the inference
Access concurrently the
triple store
Distributor
Stores inferred triples
Dispatches them to the
buffers
15 / 27
Inference: cax-sco
16 / 27
Triple Store
Vertical Partitioning
2
1
3
4
5
7
6
8
9
(1,2,3)
(4,2,5)
(6,7,8)
(6,7,9)
TRIPLES ENCODING
Near-optimal indexing
Indexing by predicates, subjects
and objects
Best trade-off for nearly all rules
from the OWL fragments
Concurrent Access
ReentrantReadWriteLocks
ensure concurrency
Write lock to add triples
Read lock for other methods
Duplicates Elimination
HashMap of MultiMaps∗
Bans duplicates
Ensures uniqueness of triples
∗
Google’s Guava libraries
17 / 27
Rules Dependency Graph
Directed graph
Edges represent rules
A → B: B can use the output
of A
Created at initialisation time
Used to route new triples by
The input manager
The distributors
PRP-
DOM
CAX-
SCO
PRP-
RNG
PRP-
SPO1
SCM-
SCO
SCM-
DOM2
SCM-
RNG2
SCM-
SPO
Universal Input
Rules Dependency Graph for ρdf
18 / 27
Architecture
TRIPLE STORE
Evolving
Data
New
triples
Explicit Triples
Implicit Triples
Streamed Triples
R2
Buffer R3
Distributor R3
Distributor R2
Distributor R1
Buffer R2
Buffer R1
R2R1
R2
R1R3
Input Manager Rules Buffers Thread Pool Distributors
R1
R1
R2
R2
R3
R3
Incoming
triples
Concurrent Access
Rule Modules
Input
Manager
R1
R2
R3
R3
R1
R2
19 / 27
Summary
Introduction
State of the art
Contribution
Experimental results
Conclusion
20 / 27
Experimentations
Baseline
OWLIM-SE (Standard Edition)
Semantic repository with reasoning features
Fastest reasoner available to the best of our knowledge
Outperforms Jena and Sesame
Natively supports RDFS, custom rule configuration for ρdf
Dataset
13 ontologies from 3 sets:
2 Real life ontologies: WordNet and Wikipedia
5 generated by BSBM, from 100,000 to 5 million triples
6 subClassOf ontologies (closure computation, duplicates
intensive)
21 / 27
Experiments
ρdf reasoning RDFS reasoning
Ontology OWLIM Slider OWLIM Slider
BSBM_100k 9.907s 4.636s 7.487s 4.558s
BSBM_200k 13.338s 6.059s 11.064s 6.198s
BSBM_500k 23.595s 11.133s 20.580s 10.984s
BSBM_1M 39.364s 22.357s 35.602s 22.192s
BSBM_5M 170.151s 126.292s 160.699s 127.037s
wikipedia 18.802s 17.422s 17.186s 22.443s
wordnet - - 15.075s 8.828s
subClassOf10 3.507s 1.209s 1.423s 1.216s
subClassOf20 3.730s 1.316s 1.536s 1.330s
subClassOf50 4.159s 1.615s 1.865s 1.583s
subClassOf100 4.397s 1.827s 2.242s 1.805s
subClassOf200 4.962s 2.210s 2.837s 2.170s
subClassOf500 9.862s 8.102s 7.584s 7.625s
Improvement
Average 71.47%
RDFS 36.08%
ρdf 106.86%
RDFS
owlimse
slider
5
10
15
20
25
30
35
40
InferenceTime
(inseconds)
ρdf
5
10
15
20
25
30
35
40
45
BSBM
_100k
BSBM
_200k
BSBM
_500k
BSBM
_1Mw
ikipediaw
ordnet
subClassO
f10
subClassO
f20
subClassO
f50
subClassO
f100
subClassO
f200
subClassO
f500
owlimse
slider
InferenceTime
(inseconds)
Inference time for Slider and OWLIM-SE on ρdf and RDFS
22 / 27
Demonstration
23 / 27
Summary
Introduction
State of the art
Contribution
Experimental results
Conclusion
24 / 27
Conclusion and Future Work
Slider
Efficient incremental rule-based reasoning
Fragment agnocism
Data streams support
Improvement of 71.47% in average against baseline
Future Work
Implementation of new rulesets
Just-in-time optimisation of rules scheduling
Use of historical statistics for adaptation
25 / 27
Bibliography I
[1] Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K.
Scalable semantic web data management using vertical partitioning.
In Proceedings of the 33rd International Conference on Very Large Data Bases (2007), VLDB ’07, VLDB
Endowment, pp. 411–422.
[2] Cuenca Grau, B., Halaschek-Wiener, C., and Kazakov, Y.
History matters: Incremental ontology reasoning using modules.
In The Semantic Web. 2007.
[3] Kazakov, Y., and Klinov, P.
Incremental reasoning in owl el without bookkeeping.
In ISWC 2013. 2013.
[4] Patel-Schneider, P.
Letter: Comments on WebPIE: A Web-scale parallel inference engine using MapReduce.
Web Semantics: Science, Services and Agents on . . . 15 (Sept. 2012), 69–70.
[5] Schlicht, A., and Stuckenschmidt, H.
Mapresolve.
Web Reasoning and Rule Systems 6902 (2011), 294–299.
[6] Urbani, J.
RDFS/OWL reasoning using the MapReduce framework.
PhD thesis, Vrije Universiteit - Faculty of Sciences, 2009.
[7] Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., and Bal, H. E.
WebPIE: A Web-scale parallel inference engine using MapReduce.
J. Web Sem. 10 (2012), 59–75.
26 / 27
Slider: an Efficient Incremental Reasoner
Thank you for your attention
jules.chevalier@univ-st-etienne.fr
https://github.com/juleschevalier/slider
http://demo-satin.telecom-st-etienne.fr/slider/
27 / 27

Contenu connexe

Similaire à Slider: an Efficient Incremental Reasoner, by Jules Chevalier

This is the way - Holistic (Network) Automation
This is the way - Holistic (Network) AutomationThis is the way - Holistic (Network) Automation
This is the way - Holistic (Network) Automation
Maximilan Wilhelm
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problems
Richard Ashworth
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Laurens De Vocht
 
Scimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujoScimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujo
Robson Araujo
 
Deploying the producer consumer problem using homogeneous modalities
Deploying the producer consumer problem using homogeneous modalitiesDeploying the producer consumer problem using homogeneous modalities
Deploying the producer consumer problem using homogeneous modalities
Fredrick Ishengoma
 
Brian Klumpe Unification of Producer Consumer Key Pairs
Brian Klumpe Unification of Producer Consumer Key PairsBrian Klumpe Unification of Producer Consumer Key Pairs
Brian Klumpe Unification of Producer Consumer Key Pairs
Brian_Klumpe
 
Introduction to Cluster Computing and Map Reduce (from Google)
Introduction to Cluster Computing and Map Reduce  (from Google)Introduction to Cluster Computing and Map Reduce  (from Google)
Introduction to Cluster Computing and Map Reduce (from Google)
Sri Prasanna
 
Comparing Write-Ahead Logging and the Memory Bus Using
Comparing Write-Ahead Logging and the Memory Bus UsingComparing Write-Ahead Logging and the Memory Bus Using
Comparing Write-Ahead Logging and the Memory Bus Using
jorgerodriguessimao
 
Naveen nimmu sdn future of networking
Naveen nimmu sdn   future of networkingNaveen nimmu sdn   future of networking
Naveen nimmu sdn future of networking
OpenSourceIndia
 
Naveen nimmu sdn future of networking
Naveen nimmu sdn   future of networkingNaveen nimmu sdn   future of networking
Naveen nimmu sdn future of networking
suniltomar04
 

Similaire à Slider: an Efficient Incremental Reasoner, by Jules Chevalier (20)

Scimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbinScimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbin
 
This is the way - Holistic (Network) Automation
This is the way - Holistic (Network) AutomationThis is the way - Holistic (Network) Automation
This is the way - Holistic (Network) Automation
 
Model checking
Model checkingModel checking
Model checking
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problems
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
 
Scimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujoScimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujo
 
RAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme ScalesRAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme Scales
 
Constructing Operating Systems and E-Commerce
Constructing Operating Systems and E-CommerceConstructing Operating Systems and E-Commerce
Constructing Operating Systems and E-Commerce
 
Deploying the producer consumer problem using homogeneous modalities
Deploying the producer consumer problem using homogeneous modalitiesDeploying the producer consumer problem using homogeneous modalities
Deploying the producer consumer problem using homogeneous modalities
 
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
 
Crypto
CryptoCrypto
Crypto
 
Keynote at AImWD
Keynote at AImWDKeynote at AImWD
Keynote at AImWD
 
Brian Klumpe Unification of Producer Consumer Key Pairs
Brian Klumpe Unification of Producer Consumer Key PairsBrian Klumpe Unification of Producer Consumer Key Pairs
Brian Klumpe Unification of Producer Consumer Key Pairs
 
Introduction to Cluster Computing and Map Reduce (from Google)
Introduction to Cluster Computing and Map Reduce  (from Google)Introduction to Cluster Computing and Map Reduce  (from Google)
Introduction to Cluster Computing and Map Reduce (from Google)
 
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
 
User biglm
User biglmUser biglm
User biglm
 
IEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and AbstractIEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and Abstract
 
Comparing Write-Ahead Logging and the Memory Bus Using
Comparing Write-Ahead Logging and the Memory Bus UsingComparing Write-Ahead Logging and the Memory Bus Using
Comparing Write-Ahead Logging and the Memory Bus Using
 
Naveen nimmu sdn future of networking
Naveen nimmu sdn   future of networkingNaveen nimmu sdn   future of networking
Naveen nimmu sdn future of networking
 
Naveen nimmu sdn future of networking
Naveen nimmu sdn   future of networkingNaveen nimmu sdn   future of networking
Naveen nimmu sdn future of networking
 

Plus de opencloudware

OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
opencloudware
 
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
opencloudware
 

Plus de opencloudware (6)

"Knowledge Enabled Real-Time Recommendation System", by Jules Chevalier and S...
"Knowledge Enabled Real-Time Recommendation System", by Jules Chevalier and S..."Knowledge Enabled Real-Time Recommendation System", by Jules Chevalier and S...
"Knowledge Enabled Real-Time Recommendation System", by Jules Chevalier and S...
 
Opencloudware, The vApp Lifecycle Management Solution for Multi-Cloud - Cloud...
Opencloudware, The vApp Lifecycle Management Solution for Multi-Cloud - Cloud...Opencloudware, The vApp Lifecycle Management Solution for Multi-Cloud - Cloud...
Opencloudware, The vApp Lifecycle Management Solution for Multi-Cloud - Cloud...
 
OpenCloudware: the Cloud application lifecycle management platform
OpenCloudware: the Cloud application lifecycle management platformOpenCloudware: the Cloud application lifecycle management platform
OpenCloudware: the Cloud application lifecycle management platform
 
Think to PaaS for Multi-IaaS Cloud Computing: the OpenCloudware collaborative...
Think to PaaS for Multi-IaaS Cloud Computing: the OpenCloudware collaborative...Think to PaaS for Multi-IaaS Cloud Computing: the OpenCloudware collaborative...
Think to PaaS for Multi-IaaS Cloud Computing: the OpenCloudware collaborative...
 
OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative project presented at Cloud Expo Europe 2012 in Lo...
 
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
OpenCloudware Collaborative Project presented at Cloud Expo Europe 2012 in Lo...
 

Dernier

一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理
F
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Monica Sydney
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Monica Sydney
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
ydyuyu
 

Dernier (20)

Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...
 
Down bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirts
 
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency Dallas
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girls
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 

Slider: an Efficient Incremental Reasoner, by Jules Chevalier

  • 1. Slider: an Efficient Incremental Reasoner Jules Chevalier jules.chevalier@univ-st-etienne.fr LT2C, Télécom Saint Etienne, Université Jean Monnet Octobre 2014 Supervisors : Fréférique Laforest Christophe Gravier Julien Subercaze
  • 2. Summary Introduction State of the art Contribution Experimental results Conclusion 2 / 27
  • 3. Semantic Web & Description Logic Formalises concepts to represent them Standardizes this representation Makes it readable for both humans and computers Links these data together Allows automatic operations on these data Integrity constraint validation Query the knowledge base Extraction of implicit data 3 / 27
  • 4. Semantic Web & Description Logic Formalises concepts to represent them Standardizes this representation Makes it readable for both humans and computers Links these data together Allows automatic operations on these data Integrity constraint validation Query the knowledge base Extraction of implicit data = Reasoning 4 / 27
  • 5. Reasoning : Forward Chaining VS Backward Chaining Abraham Homer Marge Liza Bart What we know : Abraham father Homer Homer father Liza Homer father Bart Marge mother Liza Marge mother bart 5 / 27
  • 6. Reasoning : Forward Chaining VS Backward Chaining Abraham Homer Marge Liza Bart What we know : Abraham father Homer Homer father Liza Homer father Bart Marge mother Liza Marge mother bart What Forward Chaining do : Abraham grandfather Liza Abraham grandfather Bart ... Abraham grandfather Liza ? → yes 5 / 27
  • 7. Reasoning : Forward Chaining VS Backward Chaining Abraham Homer Marge Liza Bart What we know : Abraham father Homer Homer father Liza Homer father Bart Marge mother Liza Marge mother bart What Forward Chaining do : Abraham grandfather Liza Abraham grandfather Bart ... Abraham grandfather Liza ? → yes What Backward Chaining do : Abraham grandfather Liza ? Abraham father X & X father Liza ? Abraham father Homer & Homer father Liza → yes 5 / 27
  • 8. Rule-based Reasoning Rules An antecedent: Allows the rule to be executed A consequent: The statement inferred c1 subClassOf c2, x type c1 (cax-sco) x type c2 Fragments A fragment is a set of inference rules Semantic Web standards suggest different pre defined fragments (RDFS, OWL Lite, OWL Full, OWL DL, ...) The more they have a high expressivity, the more the operations are complex (from P to NEXPTIME) Choosing one fragment is trade off between expressivity and computational complexity 6 / 27
  • 9. Problematic What we want to do Efficient forward-reasoning over Big Data (NP complete for most complex ontologies) 7 / 27
  • 10. Problematic What we want to do Efficient forward-reasoning over Big Data (NP complete for most complex ontologies) What are the problems Rules form a cyclic graph Complexity depends on the fragment ! The amount of triples generated is quite unpredictable The complexity also depends on data ! Big Data is not static We need to handle data streams ! 7 / 27
  • 11. Summary Introduction State of the art Contribution Experimental results Conclusion 8 / 27
  • 12. Batch reasoning approaches WebPie : a Web-scale Parallel Inference Engine 2009 - Jacopo Urbani Thesis [6] Uses MapReduce for OWL Horst and RDFS reasoning 2011 - Fix some issues to improve OWL Horst reasoning [7] Duplicates limitation Indexation for sameAs Greedy scheduling Cleaner Job after some rules, or at the end MapResolve [5] Based their work on WebPie to develop a OWL Horst reasoner Use 3 sets for triples : usable, used, inferred Limits overheads, optimise Points out MapReduce limitations 9 / 27
  • 13. Analysis : MapReduce approaches MapReduce Framework Allows to implement distributed tasks The Hadoop framework Best suited to batch process huge amounts of data MapReduce requires an acyclic dataflow Jobs run in isolation Not suitable network shuffling Hadoop distributed file system WebPie and MapResolve Contributions Only provide batch reasoning Nodes must wait for each other Generate a lot of duplicates Fragment dependant Naive partitioning Critical letter for WebPie [4] 10 / 27
  • 14. Incremental solutions History Matters: Incremental Ontology Reasoning Using Modules [2] Maintains classification of ontologies as they evolve Provides encouraging results Not viable for static hierarchy of ontologies Not adapted on high number of nominals Incremental Reasoning in OWL EL without Bookkeeping [3] Handles both addition and deletion of knowledge Incremental classification of TBox Limited to the classification on the TBox Dedicated to the EL+ fragment 11 / 27
  • 15. Summary Introduction State of the art Contribution Experimental results Conclusion 12 / 27
  • 16. Proposed solution Slider Parallel and Scalable Execution Rules mapped to independent modules Multiple rule instances allowed to run in parallel Duplicates Limitation Shared triple store Vertical partitioning [1] and multiple indexing Data Stream Support Streamed architecture Parallel parsing/reasoning Fragment’s Customization Dynamic support of ruleset ρdf and RDFS natively supported Extendible to any other fragment 13 / 27
  • 17. Architecture TRIPLE STORE Evolving Data New triples Explicit Triples Implicit Triples Streamed Triples R2 Buffer R3 Distributor R3 Distributor R2 Distributor R1 Buffer R2 Buffer R1 R2R1 R2 R1R3 Input Manager Rules Buffers Thread Pool Distributors R1 R1 R2 R2 R3 R3 Incoming triples Concurrent Access Rule Modules Input Manager R1 R2 R3 R3 R1 R2 14 / 27
  • 18. Architecture Input Manager Receives incoming triples Sends them to The triple store The rules buffers Rules Buffers A buffer for each rule Run the rule when full Run the rule when timed-out Ensures completeness Thread Pool Manages a pool instances Ensures scalability Rule instance Execute the inference Access concurrently the triple store Distributor Stores inferred triples Dispatches them to the buffers 15 / 27
  • 20. Triple Store Vertical Partitioning 2 1 3 4 5 7 6 8 9 (1,2,3) (4,2,5) (6,7,8) (6,7,9) TRIPLES ENCODING Near-optimal indexing Indexing by predicates, subjects and objects Best trade-off for nearly all rules from the OWL fragments Concurrent Access ReentrantReadWriteLocks ensure concurrency Write lock to add triples Read lock for other methods Duplicates Elimination HashMap of MultiMaps∗ Bans duplicates Ensures uniqueness of triples ∗ Google’s Guava libraries 17 / 27
  • 21. Rules Dependency Graph Directed graph Edges represent rules A → B: B can use the output of A Created at initialisation time Used to route new triples by The input manager The distributors PRP- DOM CAX- SCO PRP- RNG PRP- SPO1 SCM- SCO SCM- DOM2 SCM- RNG2 SCM- SPO Universal Input Rules Dependency Graph for ρdf 18 / 27
  • 22. Architecture TRIPLE STORE Evolving Data New triples Explicit Triples Implicit Triples Streamed Triples R2 Buffer R3 Distributor R3 Distributor R2 Distributor R1 Buffer R2 Buffer R1 R2R1 R2 R1R3 Input Manager Rules Buffers Thread Pool Distributors R1 R1 R2 R2 R3 R3 Incoming triples Concurrent Access Rule Modules Input Manager R1 R2 R3 R3 R1 R2 19 / 27
  • 23. Summary Introduction State of the art Contribution Experimental results Conclusion 20 / 27
  • 24. Experimentations Baseline OWLIM-SE (Standard Edition) Semantic repository with reasoning features Fastest reasoner available to the best of our knowledge Outperforms Jena and Sesame Natively supports RDFS, custom rule configuration for ρdf Dataset 13 ontologies from 3 sets: 2 Real life ontologies: WordNet and Wikipedia 5 generated by BSBM, from 100,000 to 5 million triples 6 subClassOf ontologies (closure computation, duplicates intensive) 21 / 27
  • 25. Experiments ρdf reasoning RDFS reasoning Ontology OWLIM Slider OWLIM Slider BSBM_100k 9.907s 4.636s 7.487s 4.558s BSBM_200k 13.338s 6.059s 11.064s 6.198s BSBM_500k 23.595s 11.133s 20.580s 10.984s BSBM_1M 39.364s 22.357s 35.602s 22.192s BSBM_5M 170.151s 126.292s 160.699s 127.037s wikipedia 18.802s 17.422s 17.186s 22.443s wordnet - - 15.075s 8.828s subClassOf10 3.507s 1.209s 1.423s 1.216s subClassOf20 3.730s 1.316s 1.536s 1.330s subClassOf50 4.159s 1.615s 1.865s 1.583s subClassOf100 4.397s 1.827s 2.242s 1.805s subClassOf200 4.962s 2.210s 2.837s 2.170s subClassOf500 9.862s 8.102s 7.584s 7.625s Improvement Average 71.47% RDFS 36.08% ρdf 106.86% RDFS owlimse slider 5 10 15 20 25 30 35 40 InferenceTime (inseconds) ρdf 5 10 15 20 25 30 35 40 45 BSBM _100k BSBM _200k BSBM _500k BSBM _1Mw ikipediaw ordnet subClassO f10 subClassO f20 subClassO f50 subClassO f100 subClassO f200 subClassO f500 owlimse slider InferenceTime (inseconds) Inference time for Slider and OWLIM-SE on ρdf and RDFS 22 / 27
  • 27. Summary Introduction State of the art Contribution Experimental results Conclusion 24 / 27
  • 28. Conclusion and Future Work Slider Efficient incremental rule-based reasoning Fragment agnocism Data streams support Improvement of 71.47% in average against baseline Future Work Implementation of new rulesets Just-in-time optimisation of rules scheduling Use of historical statistics for adaptation 25 / 27
  • 29. Bibliography I [1] Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K. Scalable semantic web data management using vertical partitioning. In Proceedings of the 33rd International Conference on Very Large Data Bases (2007), VLDB ’07, VLDB Endowment, pp. 411–422. [2] Cuenca Grau, B., Halaschek-Wiener, C., and Kazakov, Y. History matters: Incremental ontology reasoning using modules. In The Semantic Web. 2007. [3] Kazakov, Y., and Klinov, P. Incremental reasoning in owl el without bookkeeping. In ISWC 2013. 2013. [4] Patel-Schneider, P. Letter: Comments on WebPIE: A Web-scale parallel inference engine using MapReduce. Web Semantics: Science, Services and Agents on . . . 15 (Sept. 2012), 69–70. [5] Schlicht, A., and Stuckenschmidt, H. Mapresolve. Web Reasoning and Rule Systems 6902 (2011), 294–299. [6] Urbani, J. RDFS/OWL reasoning using the MapReduce framework. PhD thesis, Vrije Universiteit - Faculty of Sciences, 2009. [7] Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., and Bal, H. E. WebPIE: A Web-scale parallel inference engine using MapReduce. J. Web Sem. 10 (2012), 59–75. 26 / 27
  • 30. Slider: an Efficient Incremental Reasoner Thank you for your attention jules.chevalier@univ-st-etienne.fr https://github.com/juleschevalier/slider http://demo-satin.telecom-st-etienne.fr/slider/ 27 / 27