SlideShare une entreprise Scribd logo
1  sur  49
Télécharger pour lire hors ligne
Semi-Automatic Example-Driven
Linked Data Mapping Creation
Pieter Heyvaert
pheyvaer.heyvaert@ugent.be
1
Semantic Web technologies rely on Linked Data,
but not all data is accessible as Linked Data.
databases
XML files
Solutions to provide access exist,
but results are not always as desired because limited knowledge is used:
data schema
ontology
2
Goal: improve access to data as Linked Data
3
Overview
problem
current solutions
approach
discussion
4
Overview
problem
current solutions
approach
discussion
5
Input data
id title author
0 Harry Potter and The Sorcerer’s Stone J.K. Rowling
1 Homo Deus Yuval Noah Harari
{
"authors": [{
"id": "jkr",
"name": "J.K. Rowling",
"country": "UK"
"birthdate": "1965-07-31"
},{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}]
}
6
Desired Linked Data
book:0 a schema:Book;
schema:title "Harry Potter and The Sorcerer’s Stone"@en;
schema:author author:jkr.
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:jkr a foaf:Person;
foaf:name "J.K. Rowling";
foaf:country "UK";
schema:birthdate "1965-07-21"^^xsd:date.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "UK";
schema:birthdate "1976-04-24"^^xsd:date.
7
Apply rules to generate Linked Data
original
data
Linked
Data
rules
rules state how to generate RDF terms and triples using data and ontologies
8
Linked Data example available
9
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
Use example to create rules
sample example
rules
10
original
data
Linked
Data
Linked Data example aligns
with sample of original data
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
11
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
id
12
rule: IRI is “book” + value from column “id”
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
title
13
rule: literal uses value from column “title”
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
titleproperty
14
rule: predicate is schema:title
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
type
15
rule: type of a book is schema:Book
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
other entity
16
rule: a book is related to its author
All rules
IRI is “book” + value from column “id”
Literal uses value from column “title”
Predicate is schema:title
Type of a book is schema:Book
A book is related to its author
17
Apply rules to generate all Linked Data
sample example
rules
18
original
data
Linked
Data
Linked Data might not be as desired
Rules are prone to errors when created manually
Wrong use of ontology classes, properties, and datatypes
Wrong alignments with original data
Especially when dealing
with large and complex data sources
multiple data sources at the same time
19
Overview
problem
current solutions
approach
discussion
20
Overview
problem
current solutions
approach
discussion
21
Solutions to reduce manual effort
when creating rules
Semi-automatic: users provide feedback
Automatic: no user interaction required
22
Current solutions use limited knowledge
Only work with
data schemas
data values
ontologies
Do not consider knowledge embedded in
query workload of Linked Data
Linked Data examples
23
Overview
problem
current solutions
approach
discussion
24
Overview
problem
current solutions
approach
discussion
25
We propose a semi-automatic
example-driven approach to create rules.
26
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
27
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
Visualize Linked Data example
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
28
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
29
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
30
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
JSON
31
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
CSV
JSON
32
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
CSV
CSV
JSON
33
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
id title author
1 Homo Deus Yuval Noah Harari
CSV
JSON
34
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
for each subgraph with an entity
35
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
only CSV data source
CSV
36
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
CSV data source match with 1 node
JSON data source match with all nodes
JSON
37
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
38
Rule creation
For every entity
For every attribute
For all interlinked entities
39
Create rules for entity
schema:Book
book:1
CSV
IRI is “book” + id
type is schema:Book
40
Create rules for attribute
schema:Book
book:1
CSVuse predicate schema:title
literal uses value from column title
language of the title is English
Homo Deus
@en
schema:title
CSV
41
Create rules for interlinked entities
schema:Book
book:1
CSV
use predicate schema:author
join condition: names match
foaf:Person
author:ynh
schema:author
JSON
42
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
43
Optional adjustments
Fix errors if needed
Add additional rules to deal with exceptions
44
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
45
Apply rules to generate all Linked Data
sample example
46
original
data
Linked
Data
rules
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
47
Discussion
Advantages
Use knowledge embedded in Linked Data examples
Minimize errors and user interaction
Approach can be combined with other approaches
Disadvantages
Linked Data example is required
User action might still be required for special cases
48
Recap
Use cases can have Linked Data example available.
Example contains knowledge to create rules.
We introduced approach that uses this knowledge.
This approach can be combined with other approaches.
49

Contenu connexe

Tendances

Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
Semantic Web and Schema.org
Semantic Web and Schema.orgSemantic Web and Schema.org
Semantic Web and Schema.orgrvguha
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data StrategyRichard Wallis
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgRichard Wallis
 
Creating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFCreating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFdonaldlsmithjr
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library DataRichard Wallis
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapesJose Emilio Labra Gayo
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataRichard Wallis
 
Graph and RDF databases
Graph and RDF databasesGraph and RDF databases
Graph and RDF databasesNassim Bahri
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummiesgiurca
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discoveryRichard Wallis
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic WebJan Beeck
 
Schema Design
Schema DesignSchema Design
Schema DesignMongoDB
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 

Tendances (19)

Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
Semantic Web and Schema.org
Semantic Web and Schema.orgSemantic Web and Schema.org
Semantic Web and Schema.org
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data Strategy
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.org
 
Presentation shexer
Presentation shexerPresentation shexer
Presentation shexer
 
Creating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFCreating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDF
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Linked Data and OCLC
Linked Data and OCLCLinked Data and OCLC
Linked Data and OCLC
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapes
 
RDF Data Model
RDF Data ModelRDF Data Model
RDF Data Model
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library Data
 
Graph and RDF databases
Graph and RDF databasesGraph and RDF databases
Graph and RDF databases
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummies
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discovery
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
 
Schema Design
Schema DesignSchema Design
Schema Design
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 

Plus de Pieter Heyvaert

Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Pieter Heyvaert
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsPieter Heyvaert
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Pieter Heyvaert
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsPieter Heyvaert
 
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Pieter Heyvaert
 
FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)Pieter Heyvaert
 
Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Pieter Heyvaert
 

Plus de Pieter Heyvaert (7)

Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping Definitions
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
 
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
 
FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)
 
Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)
 

Dernier

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 

Dernier (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 

Semi-Automatic Example-Driven Linked Data Mapping Creation

  • 1. Semi-Automatic Example-Driven Linked Data Mapping Creation Pieter Heyvaert pheyvaer.heyvaert@ugent.be 1
  • 2. Semantic Web technologies rely on Linked Data, but not all data is accessible as Linked Data. databases XML files Solutions to provide access exist, but results are not always as desired because limited knowledge is used: data schema ontology 2
  • 3. Goal: improve access to data as Linked Data 3
  • 6. Input data id title author 0 Harry Potter and The Sorcerer’s Stone J.K. Rowling 1 Homo Deus Yuval Noah Harari { "authors": [{ "id": "jkr", "name": "J.K. Rowling", "country": "UK" "birthdate": "1965-07-31" },{ "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" }] } 6
  • 7. Desired Linked Data book:0 a schema:Book; schema:title "Harry Potter and The Sorcerer’s Stone"@en; schema:author author:jkr. book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:jkr a foaf:Person; foaf:name "J.K. Rowling"; foaf:country "UK"; schema:birthdate "1965-07-21"^^xsd:date. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "UK"; schema:birthdate "1976-04-24"^^xsd:date. 7
  • 8. Apply rules to generate Linked Data original data Linked Data rules rules state how to generate RDF terms and triples using data and ontologies 8
  • 9. Linked Data example available 9 book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date.
  • 10. Use example to create rules sample example rules 10 original data Linked Data
  • 11. Linked Data example aligns with sample of original data id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } 11 book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date.
  • 12. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } id 12 rule: IRI is “book” + value from column “id”
  • 13. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } title 13 rule: literal uses value from column “title”
  • 14. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } titleproperty 14 rule: predicate is schema:title
  • 15. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } type 15 rule: type of a book is schema:Book
  • 16. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } other entity 16 rule: a book is related to its author
  • 17. All rules IRI is “book” + value from column “id” Literal uses value from column “title” Predicate is schema:title Type of a book is schema:Book A book is related to its author 17
  • 18. Apply rules to generate all Linked Data sample example rules 18 original data Linked Data
  • 19. Linked Data might not be as desired Rules are prone to errors when created manually Wrong use of ontology classes, properties, and datatypes Wrong alignments with original data Especially when dealing with large and complex data sources multiple data sources at the same time 19
  • 22. Solutions to reduce manual effort when creating rules Semi-automatic: users provide feedback Automatic: no user interaction required 22
  • 23. Current solutions use limited knowledge Only work with data schemas data values ontologies Do not consider knowledge embedded in query workload of Linked Data Linked Data examples 23
  • 26. We propose a semi-automatic example-driven approach to create rules. 26
  • 27. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 27
  • 28. book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. Visualize Linked Data example author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country 28
  • 29. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 29
  • 30. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 30
  • 31. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV JSON 31
  • 32. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV CSV JSON 32
  • 33. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV CSV CSV JSON 33
  • 34. Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } id title author 1 Homo Deus Yuval Noah Harari CSV JSON 34
  • 35. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON for each subgraph with an entity 35
  • 36. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON only CSV data source CSV 36
  • 37. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON CSV data source match with 1 node JSON data source match with all nodes JSON 37
  • 38. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 38
  • 39. Rule creation For every entity For every attribute For all interlinked entities 39
  • 40. Create rules for entity schema:Book book:1 CSV IRI is “book” + id type is schema:Book 40
  • 41. Create rules for attribute schema:Book book:1 CSVuse predicate schema:title literal uses value from column title language of the title is English Homo Deus @en schema:title CSV 41
  • 42. Create rules for interlinked entities schema:Book book:1 CSV use predicate schema:author join condition: names match foaf:Person author:ynh schema:author JSON 42
  • 43. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 43
  • 44. Optional adjustments Fix errors if needed Add additional rules to deal with exceptions 44
  • 45. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 45
  • 46. Apply rules to generate all Linked Data sample example 46 original data Linked Data rules
  • 47. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 47
  • 48. Discussion Advantages Use knowledge embedded in Linked Data examples Minimize errors and user interaction Approach can be combined with other approaches Disadvantages Linked Data example is required User action might still be required for special cases 48
  • 49. Recap Use cases can have Linked Data example available. Example contains knowledge to create rules. We introduced approach that uses this knowledge. This approach can be combined with other approaches. 49