SlideShare une entreprise Scribd logo
1  sur  13
Creating Knowledge out of Interlinked Data




        LOD2 Paris Meeting:
        WP3 Overview
        Knowledge Base Creation, Enrichment and Repair




                                                    Jens Lehmann
                                                        AKSW, Universität Leipzig
LOD2 Presentation . 02.09.2010 . Page                                http://lod2.eu
Creating Knowledge out of Interlinked Data




Outline




      • General WP3 Overview (Jens Lehmann)
            • WP structure
            • Deliverables
            • Progress

      • Task 3.2 Report: NLP2RDF + NIF (Sebastian Hellmann)




LOD2 Event . 06.09.2010 . 2Page 2                             http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task Overview


      • Research WP, 76 PMs, InfAI (37), NUIG (10), FUB (17),
          OpenLink (5), Exalead (7)

      • 3.1: Provenance-Aware Extraction of Linked Data from Existing
          Structured Formats

      • 3.2: Provenance-Aware Extraction of Linked Data from
          Unstructured and Semi-Structured Sources

      • 3.3: Knowledge Base Schema Enrichment
      • 3.4: Knowledge Base Repair
      • 3.5: Web Linkage Validator

LOD2 Event . 06.09.2010 . 3Page 3                               http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Goals


  • General Goal: creation, improvement, repair of knowledge bases
  • Focus: very large knowledge bases, diverse knowledge,
       web/linked data

  • Refine existing (Virtuoso Sponger, RDF Views, Triplify, D2R)
       triplification approaches

  • Improve schema of knowledge based on data
  • Fix problems in knowledge bases e.g. inconsistencies
  • Techniques: Semi-automatic machine learning, ontology
       debugging, NLP, shallow parsing etc.

LOD2 Event . 06.09.2010 . 4Page   4                                http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.1

      • Provenance-Aware Extraction of Linked Data from Existing
          Structured Formats (spreadsheets, relational databases, CMS,
          logs, XML documents)

      • Partners: FUB, InfAI, OpenLink, Exalead
      • Provide: process description + tools
      • Standardisation of RDB2RDF mapping
      • Draws on existing tools/frameworks:
           • D2R (FUB)
           • Triplify (InfAI)
           • Virtuoso Sponger (OpenLink)

      • Deliverables: State-of-the Art Report (M6), D2R release (M20),
          Triplify release (M20)
LOD2 Event . 06.09.2010 . 5Page 5                                   http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.1 - Progress


      • D2R Server MetaData Extension (allows adding licencing and
          provenance output to D2R server)

      • Deliverable 3.1.1 completed: state of the art report about
          knowledge extraction from structured sources
           • 200+ tools collected at http://data.lod2.eu/2011/tools/
           • http://en.wikipedia.org/wiki/Knowledge_extraction created

      • Addition of RDF2Triggers to RDF Views in Virtuoso: enables
          materialisation and synchronisation of RDF views as physical
          triples

      • Virtuoso sponger cartridges extended

LOD2 Event . 06.09.2010 . 6Page 6                                 http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.2

  • Provenance-Aware Extraction of Linked Data from Unstructured
      and Semi-Structured Sources (HTML, PDF+ Office documents with
      metadata)

  • Partners: FUB, InfAI, OpenLink, Exalead
  • NLP techniques / text understanding (combine approaches, not
      invent them)

  • Draws on existing tools:
        • NLP2RDF (InfAI)
        • Stanford Parser, ASV toolkit, Zemanta, Ontos API (all external)
        • DBpedia (FUB, InfAI, OpenLink)

  • Deliverables: NLP2RDF release (M8), DBpedia Live (M8), DBpedia
      Framework Extension (M20)
LOD2 Event . 06.09.2010 . 7Page 7                                           http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.2 - Progress



  • NLP2RDF + NIF: presented by Sebastian
  • DBpedia Live:
        • New server acquired
        • Running at http://live.dbpedia.org/sparql/ (beta version)

  • DBpedia I18N committee founded and multi-language support
       extended

  • DBpedia Spotlight released (http://dbpedia.org/spotlight): tool for
       annotating mentions of DBpedia resources in text



LOD2 Event . 06.09.2010 . 8Page 8                                     http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.3

  • Knowledge Base Schema Enrichment
  • Partners: InfAI
  • Suggests OWL Schema Axioms to Knowlege Base Maintainers
      (Definitions, Super Classes, Disjointness)

  • Tightly coupled to Task 3.4
  • Adapts existing approaches to work with very large Linked Data
      knowledge bases

  • Uses DL-Learner (InfAI) and external ontology learning approaches
  • Deliverables: Enrichment Method Report (M12), User Interface
      (M24), Evaluation (M36)

LOD2 Event . 06.09.2010 . 9Page 9                              http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.4

  • Knowledge Base Repair
  • Partners: InfAI, NUIG
  • Fix inconsistent knowledge bases, unsatisfiable classes, (some)
       modelling errors, (some) reasoning performance problems

  • Draws on a lot of existing work in ontology debugging and
       extends it to knowledge bases in the LOD cloud

  • Related to quality measures in WP4
  • Result: ORE tool (together with Task 3.3)
  • Deliverables: Report on Modelling Errors/Problems (M6), 1st ORE
       Release (M28), 2nd ORE Release (M40)
LOD2 Event . 06.09.2010 . 10
                           Page 10                               http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.4 - Progress

  • Google Code project for ORE (Ontology Repair and Enrichment)
       tool started: http://code.google.com/p/ore/

  • Domain http://ore-tool.net/ with basic instructions
  • ORE 0.2 released (desktop version – web version in development
       at http://web.ore-tool.net)

  • ORE paper accepted at ISWC
  • Deliverable 3.4.1 completed (state of the art report on
       detectable errors in knowledge bases)

  • Preliminary work on algorithms for supporting debugging
       SPARQL endpoints and Linked Data

LOD2 Event . 06.09.2010 . 11
                           Page 11                            http://lod2.eu
Creating Knowledge out of Interlinked Data




WP 3 Task 3.5


  • Web Linkage Validator
  • Partners: NUIG
  • Tightly coupled to Task 4.2 (Unsupervised Interlinking)
  • Creates linkage reports for knowledge base maintainers
  • Could suggest to add further properties, more specific property
       values, better specify classes/properties for knowledge base
       entitites

  • Deliverables: Initial Release (M18), LOD2 Stack Component
       Release (M28)

LOD2 Event . 06.09.2010 . 12
                           Page 12                               http://lod2.eu
Creating Knowledge out of Interlinked Data




        Thanks for your attention!




LOD2 Presentation . 02.09.2010 . Page                   http://lod2.eu

Contenu connexe

Tendances (6)

LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
Work Package 2 - Month 6 by Hannes Mühleisen
Work Package 2 - Month 6 by Hannes MühleisenWork Package 2 - Month 6 by Hannes Mühleisen
Work Package 2 - Month 6 by Hannes Mühleisen
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 
The ABES Discovery Study
The ABES Discovery StudyThe ABES Discovery Study
The ABES Discovery Study
 

En vedette

Top 10 richest temples of world
Top 10 richest temples of worldTop 10 richest temples of world
Top 10 richest temples of worldParv Garg
 
Leather Footwear in context of Bangladesh
Leather Footwear in context of BangladeshLeather Footwear in context of Bangladesh
Leather Footwear in context of BangladeshRifat Touhid
 
Digital Cameras Powerpoint
Digital Cameras PowerpointDigital Cameras Powerpoint
Digital Cameras Powerpointfarley115
 
Canon Presentation
Canon PresentationCanon Presentation
Canon Presentationguest9eeb40
 
Digital cameras power point presentation
Digital cameras power point presentationDigital cameras power point presentation
Digital cameras power point presentationDavid Boin
 
0 point cafe coffee
0 point cafe coffee0 point cafe coffee
0 point cafe coffeeM.K Rehman
 
Business Plan Powerpoint 1
Business Plan Powerpoint 1Business Plan Powerpoint 1
Business Plan Powerpoint 1haleydawn
 

En vedette (10)

Top 10 richest temples of world
Top 10 richest temples of worldTop 10 richest temples of world
Top 10 richest temples of world
 
Leather Footwear in context of Bangladesh
Leather Footwear in context of BangladeshLeather Footwear in context of Bangladesh
Leather Footwear in context of Bangladesh
 
Cash Discount and Trade Discount
Cash Discount and Trade DiscountCash Discount and Trade Discount
Cash Discount and Trade Discount
 
Digital Cameras Powerpoint
Digital Cameras PowerpointDigital Cameras Powerpoint
Digital Cameras Powerpoint
 
Canon Presentation
Canon PresentationCanon Presentation
Canon Presentation
 
Nikon vs Canon Operation Strategy
Nikon vs Canon Operation StrategyNikon vs Canon Operation Strategy
Nikon vs Canon Operation Strategy
 
Pricing Policy in Marketing
Pricing Policy in MarketingPricing Policy in Marketing
Pricing Policy in Marketing
 
Digital cameras power point presentation
Digital cameras power point presentationDigital cameras power point presentation
Digital cameras power point presentation
 
0 point cafe coffee
0 point cafe coffee0 point cafe coffee
0 point cafe coffee
 
Business Plan Powerpoint 1
Business Plan Powerpoint 1Business Plan Powerpoint 1
Business Plan Powerpoint 1
 

Similaire à LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
Populating DBpedia FR and using it for Extracting Information
Populating DBpedia FR and using it for Extracting InformationPopulating DBpedia FR and using it for Extracting Information
Populating DBpedia FR and using it for Extracting InformationJulien PLU
 
Online Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data SourcesOnline Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data SourcesFabio Benedetti
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
OntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIOntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIPhilipp Frischmuth
 
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...Europeana
 

Similaire à LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair (20)

LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIFLOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack PrototypeLOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General PresentationLOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge BasesLOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
 
Lod2
Lod2Lod2
Lod2
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
Populating DBpedia FR and using it for Extracting Information
Populating DBpedia FR and using it for Extracting InformationPopulating DBpedia FR and using it for Extracting Information
Populating DBpedia FR and using it for Extracting Information
 
Online Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data SourcesOnline Index Extraction from Linked Open Data Sources
Online Index Extraction from Linked Open Data Sources
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
OntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt APIOntoWiki Application Framework & Erfurt API
OntoWiki Application Framework & Erfurt API
 
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
 
LOD2: State of Play WP6 - LOD2 Stack Architecture
LOD2: State of Play WP6 - LOD2 Stack ArchitectureLOD2: State of Play WP6 - LOD2 Stack Architecture
LOD2: State of Play WP6 - LOD2 Stack Architecture
 

Plus de LOD2 Creating Knowledge out of Interlinked Data

Plus de LOD2 Creating Knowledge out of Interlinked Data (17)

LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7
 
LOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia SpotlightLOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia Spotlight
 
LOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKANLOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKAN
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 General Presentation 2012
LOD2 General Presentation 2012LOD2 General Presentation 2012
LOD2 General Presentation 2012
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
LOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMESLOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMES
 
LOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project ManagementLOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project Management
 
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
 
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
 
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
 
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data WebLOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
 
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
 
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 StackLOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
 
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
 
LOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWikiLOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWiki
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 

LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair

  • 1. Creating Knowledge out of Interlinked Data LOD2 Paris Meeting: WP3 Overview Knowledge Base Creation, Enrichment and Repair Jens Lehmann AKSW, Universität Leipzig LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  • 2. Creating Knowledge out of Interlinked Data Outline • General WP3 Overview (Jens Lehmann) • WP structure • Deliverables • Progress • Task 3.2 Report: NLP2RDF + NIF (Sebastian Hellmann) LOD2 Event . 06.09.2010 . 2Page 2 http://lod2.eu
  • 3. Creating Knowledge out of Interlinked Data WP 3 Task Overview • Research WP, 76 PMs, InfAI (37), NUIG (10), FUB (17), OpenLink (5), Exalead (7) • 3.1: Provenance-Aware Extraction of Linked Data from Existing Structured Formats • 3.2: Provenance-Aware Extraction of Linked Data from Unstructured and Semi-Structured Sources • 3.3: Knowledge Base Schema Enrichment • 3.4: Knowledge Base Repair • 3.5: Web Linkage Validator LOD2 Event . 06.09.2010 . 3Page 3 http://lod2.eu
  • 4. Creating Knowledge out of Interlinked Data WP 3 Goals • General Goal: creation, improvement, repair of knowledge bases • Focus: very large knowledge bases, diverse knowledge, web/linked data • Refine existing (Virtuoso Sponger, RDF Views, Triplify, D2R) triplification approaches • Improve schema of knowledge based on data • Fix problems in knowledge bases e.g. inconsistencies • Techniques: Semi-automatic machine learning, ontology debugging, NLP, shallow parsing etc. LOD2 Event . 06.09.2010 . 4Page 4 http://lod2.eu
  • 5. Creating Knowledge out of Interlinked Data WP 3 Task 3.1 • Provenance-Aware Extraction of Linked Data from Existing Structured Formats (spreadsheets, relational databases, CMS, logs, XML documents) • Partners: FUB, InfAI, OpenLink, Exalead • Provide: process description + tools • Standardisation of RDB2RDF mapping • Draws on existing tools/frameworks: • D2R (FUB) • Triplify (InfAI) • Virtuoso Sponger (OpenLink) • Deliverables: State-of-the Art Report (M6), D2R release (M20), Triplify release (M20) LOD2 Event . 06.09.2010 . 5Page 5 http://lod2.eu
  • 6. Creating Knowledge out of Interlinked Data WP 3 Task 3.1 - Progress • D2R Server MetaData Extension (allows adding licencing and provenance output to D2R server) • Deliverable 3.1.1 completed: state of the art report about knowledge extraction from structured sources • 200+ tools collected at http://data.lod2.eu/2011/tools/ • http://en.wikipedia.org/wiki/Knowledge_extraction created • Addition of RDF2Triggers to RDF Views in Virtuoso: enables materialisation and synchronisation of RDF views as physical triples • Virtuoso sponger cartridges extended LOD2 Event . 06.09.2010 . 6Page 6 http://lod2.eu
  • 7. Creating Knowledge out of Interlinked Data WP 3 Task 3.2 • Provenance-Aware Extraction of Linked Data from Unstructured and Semi-Structured Sources (HTML, PDF+ Office documents with metadata) • Partners: FUB, InfAI, OpenLink, Exalead • NLP techniques / text understanding (combine approaches, not invent them) • Draws on existing tools: • NLP2RDF (InfAI) • Stanford Parser, ASV toolkit, Zemanta, Ontos API (all external) • DBpedia (FUB, InfAI, OpenLink) • Deliverables: NLP2RDF release (M8), DBpedia Live (M8), DBpedia Framework Extension (M20) LOD2 Event . 06.09.2010 . 7Page 7 http://lod2.eu
  • 8. Creating Knowledge out of Interlinked Data WP 3 Task 3.2 - Progress • NLP2RDF + NIF: presented by Sebastian • DBpedia Live: • New server acquired • Running at http://live.dbpedia.org/sparql/ (beta version) • DBpedia I18N committee founded and multi-language support extended • DBpedia Spotlight released (http://dbpedia.org/spotlight): tool for annotating mentions of DBpedia resources in text LOD2 Event . 06.09.2010 . 8Page 8 http://lod2.eu
  • 9. Creating Knowledge out of Interlinked Data WP 3 Task 3.3 • Knowledge Base Schema Enrichment • Partners: InfAI • Suggests OWL Schema Axioms to Knowlege Base Maintainers (Definitions, Super Classes, Disjointness) • Tightly coupled to Task 3.4 • Adapts existing approaches to work with very large Linked Data knowledge bases • Uses DL-Learner (InfAI) and external ontology learning approaches • Deliverables: Enrichment Method Report (M12), User Interface (M24), Evaluation (M36) LOD2 Event . 06.09.2010 . 9Page 9 http://lod2.eu
  • 10. Creating Knowledge out of Interlinked Data WP 3 Task 3.4 • Knowledge Base Repair • Partners: InfAI, NUIG • Fix inconsistent knowledge bases, unsatisfiable classes, (some) modelling errors, (some) reasoning performance problems • Draws on a lot of existing work in ontology debugging and extends it to knowledge bases in the LOD cloud • Related to quality measures in WP4 • Result: ORE tool (together with Task 3.3) • Deliverables: Report on Modelling Errors/Problems (M6), 1st ORE Release (M28), 2nd ORE Release (M40) LOD2 Event . 06.09.2010 . 10 Page 10 http://lod2.eu
  • 11. Creating Knowledge out of Interlinked Data WP 3 Task 3.4 - Progress • Google Code project for ORE (Ontology Repair and Enrichment) tool started: http://code.google.com/p/ore/ • Domain http://ore-tool.net/ with basic instructions • ORE 0.2 released (desktop version – web version in development at http://web.ore-tool.net) • ORE paper accepted at ISWC • Deliverable 3.4.1 completed (state of the art report on detectable errors in knowledge bases) • Preliminary work on algorithms for supporting debugging SPARQL endpoints and Linked Data LOD2 Event . 06.09.2010 . 11 Page 11 http://lod2.eu
  • 12. Creating Knowledge out of Interlinked Data WP 3 Task 3.5 • Web Linkage Validator • Partners: NUIG • Tightly coupled to Task 4.2 (Unsupervised Interlinking) • Creates linkage reports for knowledge base maintainers • Could suggest to add further properties, more specific property values, better specify classes/properties for knowledge base entitites • Deliverables: Initial Release (M18), LOD2 Stack Component Release (M28) LOD2 Event . 06.09.2010 . 12 Page 12 http://lod2.eu
  • 13. Creating Knowledge out of Interlinked Data Thanks for your attention! LOD2 Presentation . 02.09.2010 . Page http://lod2.eu