SlideShare une entreprise Scribd logo
1  sur  22
Creating Knowledge out of Interlinked Data
          JIST 2012 – Page 1                                                      http://lod2.eu




  Improving the Performance of the
 DL-Learner SPARQL Component for
     Semantic Web Applications
                  Didier Cherix, Sebastian Hellmann, Jens Lehmann

                         http://slideshare.net/kurzum


                                                          http://dl-learner.org
                                                             http://lod2.eu


                                                            AKSW, Universität Leipzig
LOD2 Presentation . 02.09.2010 . Page                                      http://lod2.eu
JIST 2012 – Page 2                                               http://lod2.eu




              Motivation: 2007 - 2012

DL-Learner was developed in parallel to DBpedia at University Leipzig since 2007

DL-Learner is a tool for learning concepts in Description Logics (DLs) from user-
provided examples.

Worked very well for small to medium sized data sets, e.g. Carcinogenesis an other
ML problems from the UCI ML repository

Limit is the capacity of current OWL-DL reasoners

Challenge was (and is) to do reasoning-based, supervized Machine Learning on
the DBpedia Dataset (> 200 Mio triples) or larger datasets
JIST 2012 – Page 3         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 4                                    http://lod2.eu




 Introduction DL-Learner




                           Very large search space

                           Reasoner instance checks
JIST 2012 – Page 5         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 6                                             http://lod2.eu




              Introduction DL-Learner
DL-Learner heavily relies on instance checks for machine learning, so the OWL
Reasoner is the bottle neck

Underlying idea:
Only select relevant data for the Machine Learning Problem based on user-given
examples

→ Reduces the amount of triples that have to be given to a reasoner
→ Reduces complexity and size of the OWL schema

Brute-force approach:
Load all data into the OWL Reasoner, then do instance checks
→ infeasible for Dbpedia

Iterative approach (old component):
Iterate over all instances and fetch the data recursively
→ inefficient even with caching
JIST 2012 – Page 7         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 8         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 9         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 10        http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 11        http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 12                                              http://lod2.eu




 Introduction DL-Learner




                           Challenge:
   What is the most efficient way to retrieve such a fragment?
JIST 2012 – Page 13                           http://lod2.eu




           Improvements of the New Component

•   Step 1: Indexing the T-Box:
     • Download the OWL Schema and index it in memory
     • either via SPARQL or OWL file
JIST 2012 – Page 14                                             http://lod2.eu




              Improvements of the New Component

 •   Step 2: A-Box Queries




Parameter recursion depth:
Retrieve newly discovered bindings to ?o until a certain depth is reached.
JIST 2012 – Page 15                  http://lod2.eu




           Improvements of the New Component

•   Step 3: Typing the retrieved instances
JIST 2012 – Page 16                                              http://lod2.eu




           Improvements of the New Component

•   Step 4: T-Box Index:
    All “relevant” T-Box information is added via the index to the fragment.
    For each class already in the fragment. all superclasses and their
    equivalentClass axioms are added
JIST 2012 – Page 17       http://lod2.eu




             Benchmarking - Speed

For each class in DBpedia Ontology:
- 30 instances as positives
- 30 negatives from a sister class
JIST 2012 – Page 18                                     http://lod2.eu




 Benchmarking – F-Measure on the training data




             70% of the results for each class
     had an F-measure of 90-100% on the training data
JIST 2012 – Page 19                                              http://lod2.eu




              SPARQL Retrieval Component Impact

•    DL-Learner – http://dl-learner.org
•    DBpedia Navigator
•    Tiger Corpus Navigator
•    AutoSPARQL - http://autosparql.dl-learner.org/
•    HANNE – http://hanne.aksw.org
•    ORE - http://aksw.org/Projects/ORE


    Sebastian Hellmann, Jens Lehmann und Sören Auer:
    Learning of OWL Class Descriptions on Very Large Knowledge Bases
    In: International Journal on Semantic Web and Information Systems, 2009


     Web Applications
     Active Learning → User Interaction and Feedback
JIST 2012 – Page 20                                                 http://lod2.eu




           Future Work

•   Research Paper in Session 4b (tomorrow at 15:10)
    Navigation-induced Knowledge Engineering by Example
•   Caching + more sophisticated options
•   Large scale learning problems


                          http://slideshare.net/kurzum


                                Homepage: http://dl-learner.org
                                Source code:
                                http://sourceforge.net/projects/dl-learner/
JIST 2012 – Page 21                                                               http://lod2.eu




             Example




Sebastian Hellmann, Jens Lehmann, Jörg Unbehauen, Claus Stadler, Thanh Nghia Lam und Markus
Strohmaier: Navigation-induced Knowledge Engineering by Example
In: JIST 2012
JIST 2012 – Page 22                                              http://lod2.eu




          Example




Sebastian Hellmann, Jens Lehmann und Sören Auer:
Learning of OWL Class Descriptions on Very Large Knowledge Bases
In: International Journal on Semantic Web and Information Systems, 2009

Contenu connexe

En vedette

Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationSebastian Hellmann
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationSebastian Hellmann
 

En vedette (7)

Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web Annotation
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddata
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 

Similaire à Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications

Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by ExampleSebastian Hellmann
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Learning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdLearning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdJad Najjar
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Pal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationPal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationMustafa Jarrar
 
Pal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faPal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faMustafa Jarrar
 
Pal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlPal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlMustafa Jarrar
 
Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Hans Põldoja
 

Similaire à Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications (20)

Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by Example
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
NoTube: Models & Semantics
NoTube: Models & SemanticsNoTube: Models & Semantics
NoTube: Models & Semantics
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
Learning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdLearning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher Ed
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and RepairLOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
 
Pal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationPal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegration
 
Pal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faPal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-fa
 
Pal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlPal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owl
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?
 

Plus de Sebastian Hellmann

Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkSebastian Hellmann
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015Sebastian Hellmann
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 

Plus de Sebastian Hellmann (8)

KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future Work
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 

Dernier

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Dernier (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications

  • 1. Creating Knowledge out of Interlinked Data JIST 2012 – Page 1 http://lod2.eu Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications Didier Cherix, Sebastian Hellmann, Jens Lehmann http://slideshare.net/kurzum http://dl-learner.org http://lod2.eu AKSW, Universität Leipzig LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  • 2. JIST 2012 – Page 2 http://lod2.eu Motivation: 2007 - 2012 DL-Learner was developed in parallel to DBpedia at University Leipzig since 2007 DL-Learner is a tool for learning concepts in Description Logics (DLs) from user- provided examples. Worked very well for small to medium sized data sets, e.g. Carcinogenesis an other ML problems from the UCI ML repository Limit is the capacity of current OWL-DL reasoners Challenge was (and is) to do reasoning-based, supervized Machine Learning on the DBpedia Dataset (> 200 Mio triples) or larger datasets
  • 3. JIST 2012 – Page 3 http://lod2.eu Introduction DL-Learner
  • 4. JIST 2012 – Page 4 http://lod2.eu Introduction DL-Learner Very large search space Reasoner instance checks
  • 5. JIST 2012 – Page 5 http://lod2.eu Introduction DL-Learner
  • 6. JIST 2012 – Page 6 http://lod2.eu Introduction DL-Learner DL-Learner heavily relies on instance checks for machine learning, so the OWL Reasoner is the bottle neck Underlying idea: Only select relevant data for the Machine Learning Problem based on user-given examples → Reduces the amount of triples that have to be given to a reasoner → Reduces complexity and size of the OWL schema Brute-force approach: Load all data into the OWL Reasoner, then do instance checks → infeasible for Dbpedia Iterative approach (old component): Iterate over all instances and fetch the data recursively → inefficient even with caching
  • 7. JIST 2012 – Page 7 http://lod2.eu Introduction DL-Learner
  • 8. JIST 2012 – Page 8 http://lod2.eu Introduction DL-Learner
  • 9. JIST 2012 – Page 9 http://lod2.eu Introduction DL-Learner
  • 10. JIST 2012 – Page 10 http://lod2.eu Introduction DL-Learner
  • 11. JIST 2012 – Page 11 http://lod2.eu Introduction DL-Learner
  • 12. JIST 2012 – Page 12 http://lod2.eu Introduction DL-Learner Challenge: What is the most efficient way to retrieve such a fragment?
  • 13. JIST 2012 – Page 13 http://lod2.eu Improvements of the New Component • Step 1: Indexing the T-Box: • Download the OWL Schema and index it in memory • either via SPARQL or OWL file
  • 14. JIST 2012 – Page 14 http://lod2.eu Improvements of the New Component • Step 2: A-Box Queries Parameter recursion depth: Retrieve newly discovered bindings to ?o until a certain depth is reached.
  • 15. JIST 2012 – Page 15 http://lod2.eu Improvements of the New Component • Step 3: Typing the retrieved instances
  • 16. JIST 2012 – Page 16 http://lod2.eu Improvements of the New Component • Step 4: T-Box Index: All “relevant” T-Box information is added via the index to the fragment. For each class already in the fragment. all superclasses and their equivalentClass axioms are added
  • 17. JIST 2012 – Page 17 http://lod2.eu Benchmarking - Speed For each class in DBpedia Ontology: - 30 instances as positives - 30 negatives from a sister class
  • 18. JIST 2012 – Page 18 http://lod2.eu Benchmarking – F-Measure on the training data 70% of the results for each class had an F-measure of 90-100% on the training data
  • 19. JIST 2012 – Page 19 http://lod2.eu SPARQL Retrieval Component Impact • DL-Learner – http://dl-learner.org • DBpedia Navigator • Tiger Corpus Navigator • AutoSPARQL - http://autosparql.dl-learner.org/ • HANNE – http://hanne.aksw.org • ORE - http://aksw.org/Projects/ORE Sebastian Hellmann, Jens Lehmann und Sören Auer: Learning of OWL Class Descriptions on Very Large Knowledge Bases In: International Journal on Semantic Web and Information Systems, 2009 Web Applications Active Learning → User Interaction and Feedback
  • 20. JIST 2012 – Page 20 http://lod2.eu Future Work • Research Paper in Session 4b (tomorrow at 15:10) Navigation-induced Knowledge Engineering by Example • Caching + more sophisticated options • Large scale learning problems http://slideshare.net/kurzum Homepage: http://dl-learner.org Source code: http://sourceforge.net/projects/dl-learner/
  • 21. JIST 2012 – Page 21 http://lod2.eu Example Sebastian Hellmann, Jens Lehmann, Jörg Unbehauen, Claus Stadler, Thanh Nghia Lam und Markus Strohmaier: Navigation-induced Knowledge Engineering by Example In: JIST 2012
  • 22. JIST 2012 – Page 22 http://lod2.eu Example Sebastian Hellmann, Jens Lehmann und Sören Auer: Learning of OWL Class Descriptions on Very Large Knowledge Bases In: International Journal on Semantic Web and Information Systems, 2009