SlideShare une entreprise Scribd logo
1  sur  40
1Franck MICHEL
Franck Michel
Integrating Heterogeneous
Data Sources
in the Web of Data
2Franck MICHEL
More data sources  More opportunities
3Franck MICHEL
Example: study history of zoological knowledge
Archaeological excavationConservation biology*
*http://www.lynxeds.com/hmw/plate/family-delphinidae-ocean-dolphins
First Natural History Encycloedia, 1485.
4Franck MICHEL
Example: study history of zoological knowledge
Archaeological excavationConservation biology*
*http://www.lynxeds.com/hmw/plate/family-delphinidae-ocean-dolphins
Knowledge formalizations
Controlled vocabularies,
taxonomies
domain ontologies, …
5Franck MICHEL
LOD Cloud: 10K datasets, 150B Statements
Linking Open Data cloud diagram 2017. A. Abele, J.P. McCrae, P. Buitelaar, A. Jentzsch and R. Cyganiak. http://lod-cloud.net/
On the Web
In RDF
Under open
licences
Interlinked
6Franck MICHEL
Publishing legacy data in RDF raises tricky questions
Metadata
Data
Vocabularies?
Create links?
Raw data?
Translate
into RDF?
7Franck MICHEL
Describe the translation of heterogeneous data into RDF
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
8Franck MICHEL
Describe the translation of heterogeneous data into RDF
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
9Franck MICHEL
Data Sources Have Heterogeneous Data Models
Relational DB
ID NAME
GraphObject-Oriented
Native XML DBs
Documents
10Franck MICHEL
Describe the translation of heterogeneous data into RDF
HTML data with RDFa
CSV data
Relational data
NoSQL data
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
11Franck MICHEL
Describe the translation of heterogeneous data into RDF
HTML data with RDFa
CSV data
Relational data
NoSQL data
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
12Franck MICHEL
<body vocab="http://schema.org/">
<div resource="/jdev2017" typeof="Event">
<h2 property="title">JDEV 2017</h2>
<p>Date: <span property="startDate">2017-07-04</span></p>
...
<p>T2 - Ingénierie et web des données.
<a property="url“href="http://devlog.cnrs.fr/jdev2017/t2">More…</a>
</p>
</div>
</body> prefix sch: <http://schema.org/>
<http://devlog.cnrs.fr/jdev2017>
rdf:type sch:Event ;
sch:title "JDEV 2017";
sch:startDate "2015-10-20" ;
sch:url <http://devlog.cnrs.fr/jdev2017/t2> .
RDFa: RDF in HTML attributes
http://devlog.cnrs.fr/
https://www.w3.org/TR/rdfa-core/
13Franck MICHEL
Describe the translation of heterogeneous data into RDF
HTML data with RDFa
CSV data
Relational data
NoSQL data
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
14Franck MICHEL
CSVW: CSV on the Web
GET trees.csv
Content-Type: text/csv
Link: <http://example.org/trees.json>; rel="…"
ID, Street, Species,Trim Cycle, Inventory Date
1, Addison Av, Celtis australis, 2010/10/18
2, Emerson St, Liquidambar styraciflua, 2010/06/02
ID, Street, Species,Trim Cycle, Inventory Date
1, Addison Av, Celtis australis, 2010/10/18
2, Emerson St, Liquidambar styraciflua, 2010/06/02
{ "@context":["http://www.w3.org/ns/csvw", {"@language": "en"}],
"url": "trees.csv", "dc:title": "Trees",
"dc:license": { "@id": "http://.../../cc-by/"},
"dc:modified": {"@value": "2010-12-31", "@type": "xsd:date"},
"tableSchema": {
"columns": [
{ "name": "ID", "titles": ["ID", "Identifier"], "datatype":"string", "required": true },
{ "name": "Street", "titles": "On Street", "dc:description": "…", "datatype": "string" }, ...
],
"primaryKey": "ID", "aboutUrl": "#id-{ID}" }}
https://www.w3.org/2013/csvw/
15Franck MICHEL
Describe the translation of heterogeneous data into RDF
HTML data with RDFa
CSV data
Relational data
NoSQL data
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
16Franck MICHEL
Direct Mapping of a RDB to RDF
<PEOPLE/ID=7> rdf:type <PEOPLE> .
<PEOPLE/ID=7> <PEOPLE#FNAME> "Catherine" .
<PEOPLE/ID=7> <PEOPLE#WROTE> <BOOK/ID=22> .
<PEOPLE/ID=8> rdf:type <People> .
<PEOPLE/ID=8> <PEOPLE#FNAME> "Olivier" .
<PEOPLE/ID=8> <PEOPLE#WROTE> <BOOK/ID=22> .
Table: PEOPLE
ID FNAME WROTE (FK BOOK/ID)
7 Catherine 22
8 Olivier 22
… … …
https://www.w3.org/TR/2012/REC-rdb-direct-mapping-20120927/
17Franck MICHEL
Custom Mapping of a RDB to RDF
<http://unice.fr/staff/7> rdf:type ex:Teacher.
<http://unice.fr/staff/7> foaf:name "Catherine".
<http://unice.fr/staff/7> dc:contributor <http://unice.fr/book/22>.
<http://unice.fr/staff/8> rdf:type ex:Teacher.
<http://unice.fr/staff/8> foaf:name "Olivier".
<http://unice.fr/staff/8> dc:contributor <http://unice.fr/book/22>.
Existing vocabularies
Table: PEOPLE
ID FNAME WROTE (FK BOOK/ID)
7 Catherine 22
8 Olivier 22
… … …
18Franck MICHEL
Custom Mapping of a RDB to RDF with R2RML
<#MapPeople>
rr:logicalTable [ rr:tableName "PEOPLE" ];
rr:subjectMap [
rr:template "http://unice.fr/staff/{ID}";
rr:class ex:Teacher;
];
rr:predicateObjectMap [
rr:predicate foaf:name;
rr:objectMap [ rr:column "FNAME" ];
].
<http://unice.fr/staff/7> rdf:type ex:Teacher.
<http://unice.fr/staff/7> foaf:name "Catherine".
<http://unice.fr/staff/8> rdf:type ex:Teacher.
<http://unice.fr/staff/8> foaf:name "Olivier".
http://www.w3.org/TR/r2rml/
19Franck MICHEL
Describe the translation of heterogeneous data into RDF
HTML data with RDFa
CSV data
Relational data
NoSQL data
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
20Franck MICHEL
<http://example.org/member/106> foaf:mbox "john@foo.com".
<http://example.org/member/106> foaf:mbox "john@example.org".
<#MapMbox>
xrr:logicalSource [ xrr:query "db.people.find({'emails':{$ne: null}})" ];
rr:subjectMap [ rr:template "http://example.org/member/{$.id}" ];
rr:predicateObjectMap [
rr:predicate foaf:mbox;
rr:objectMap [ xrr:reference "$.emails.*"; rr:termType rr:Literal ]
].
Mapping of a NoSQL DBs to RDF with xR2RML
{ "id": 106,
"firstname": "John",
"emails": [ "john@foo.com", "john@example.org" ]
}
xR2RML
21Franck MICHEL
Many methods for many types of data sources
AstroGrid-D, SPARQL2XQuery, XSPARQL
XML
XLWrap, Linked CSV, CSVW, RML
CSV/TSV/Spreadsheets
D2RQ, R2O, Ultrawrap, Triplify, SM
R2RML: Morph-RDB, ontop, Virtuoso
Relational Databases
RML, TARQL, Apache Any23, DataLift,
SPARQL-Generate
Multiple formats
RDFa, Microformats
HTML
TARQL, JSON-LD, RML
JSON
xR2RML (MongoDB), ontop (MongoDB),
[Mugnier et al, 2016]
NoSQL
M.L. Mugnier, M.C. Rousset, and F. Ulliana. “Ontology-Mediated Queries for NOSQL Databases.” In Proc. AAAI. 2016.
22Franck MICHEL
Describe the translation of heterogeneous data into RDF
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
23Franck MICHEL
Direct mapping: create my own vocabulary
Can be derived from an existing schema
May seem easier: “I do whatever I want”
But no added semantics, need to link my vocabulary with
other ones
24Franck MICHEL
Custom Mapping: reuse existing vocabularies
Large variety of existing vocabularies
But may be difficult to find the appropriate one
Partial coverage of the domain
Granularity: too high (cumbersome), too low (useless)
Different points of view
Frequently, a mixed approach is used
25Franck MICHEL
Describe the translation of heterogeneous data into RDF
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
26Franck MICHEL
Two approaches to translate existing data sources in RDF
Graph
Materialization
(ETL like)
Virtual Graph
Query
rewriting
SPARQL
SPARQL
ID NAME
27Franck MICHEL
A large variety of approaches
Graph
Materialization
Query
Rewriting
Direct
Mapping
Custom
Mapping
RDFa XSPARQL
SPARQL2XQuery
AstroGrid-D
XLWrap
Linked CSV
CSVW
TARQL
DataLift
SPARQL-Generate
RML
JSON-LD
Virtuoso
R2RML
D2RQ
R2O
Ultrawrap ontop
R2RML xR2RML
Any23
Triplify
28Franck MICHEL
Describe the translation of heterogeneous data into RDF
Choose vocabularies to represent RDF data
Access the RDF data produced
The key importance of metadata
Agenda
29Franck MICHEL
Making datasets discoverable and reusable
requires
high-quality, comprehensive metadata
30Franck MICHEL
31Franck MICHEL
Metadata are the key to enable dataset reuse
Context Identification, authors, dates, license, version, reference articles
Access Format, structure, location (dwld), query method
Meaning What do the data represent? What concepts, entities, semantics?
Interpretation Units (cm or inches, left/right)…
Provenance
Acquired with which equipment, parameters, protocols?
Derived from which dataset? With which processing?
Dataset-level or entity-level provenance
Statistics Number of triples per property of class, links to other datasets…
…
32Franck MICHEL
Several works normalize metadata descritpions
and how to use metadata
 CSVW: CSV on the Web
 DCAT: Data Catalog Vocabulary
DCAT extensions, application profiles
W3C Dataset Exchange Working Group
 VoID: Vocabulary of Interlinked Datasets
 HCLS: Health Care & Life Sciences Dataset Profile
 …
33Franck MICHEL
Thank
you!
34Franck MICHEL
Backup slides
35Franck MICHEL
HCLS: Health Care & Life Sciences Dataset Profile
Consensus among stakeholders on
the description of datasets using RDF
*http://www.w3.org/TR/hcls-dataset/
RDF, RDFS, XSD
Citation Typing Ontology
Data Catalog (DCAT)
Dublin Core Metadata Types, Dublin Core Metadata Terms
Friend-of-a-Friend (FOAF)
Collection Description Frequency Vocabulary
Identifiers.org vocabulary
Lexvo.org - Lexical Vocabulary
Provenance Authoring and Versioning ontology (PAV)
PROV Ontology
Semantic science Integrated Ontology (SIO)
Vocabulary of Interlinked Datasets (VoID)
Used
vocabularies
36Franck MICHEL
Using a JSON-LD profile to translate JSON into RDF
<http://example.org/member/106> foaf:mbox "john@foo.com".
<http://example.org/member/106> foaf:mbox "john@example.org".
{ "id": 106,
"firstname": "John",
"emails": [ "john@foo.com",
"john@example.org" ]
}
{ "@context": {
"id": "@id",
"@base": "http://example.org/member/"
"emails": "http://xmlns.com/foaf/0.1/mbox"
}
}
https://www.w3.org/TR/json-ld/
37Franck MICHEL
 Various initial motivations
• Web of Data, Linked Data,
• OBDA,
• Ontology learning,
• Schema mapping…
 Historical products: D2RQ, Virtuoso…
 R2RML mapping language
• 2012 W3C recommendation
• Several implementations
 Several methods: direct mapping vs. domain-specific
Translation of RDBs to RDF
38Franck MICHEL
All You Need is LOV
Linked Open Vocabularies
 522 curated vocabularies
 Quality requirements
• URI stability and availability,
• Quality metadata and
documentation,
• Identifiable and trustable
publication body,
• Proper versioning policy,
• …
“Vocabularies provide the semantic glue
enabling Data to become meaningful Data.”
http://lov.okfn.org/dataset/lov/
39Franck MICHEL
Linked Data rules
1.Use URIs as names for things
2.Use HTTP URIs so that people
can look up those names
3.When someone looks up a URI, provide useful
information, using the standards (RDF*, SPARQL)
4.Include links to other URIs, so that they can discover
more things
40Franck MICHEL
Publish
Linked Open Data
on the Web

Contenu connexe

Tendances

SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mapping
Mariano Rodriguez-Muro
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
Markus Luczak-Rösch
 
Tue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddataTue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddata
eswcsummerschool
 

Tendances (20)

SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
SWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQLSWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQL
 
Web Data Management with RDF
Web Data Management with RDFWeb Data Management with RDF
Web Data Management with RDF
 
Uplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLUplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RML
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data mining
 
5 rdfs
5 rdfs5 rdfs
5 rdfs
 
SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mapping
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
 
SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library Data
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Semantic HTML
Semantic HTMLSemantic HTML
Semantic HTML
 
Tue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddataTue acosta tut_providing_linkeddata
Tue acosta tut_providing_linkeddata
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 

Similaire à Integrating Heterogeneous Data Sources in the Web of Data

Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Cory Lampert
 
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
National Information Standards Organization (NISO)
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
trevorthornton
 

Similaire à Integrating Heterogeneous Data Sources in the Web of Data (20)

Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Do it on your own - From 3 to 5 Star Linked Open Data with RMLioDo it on your own - From 3 to 5 Star Linked Open Data with RMLio
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
 
DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World."
 
Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial data
 
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Sören Auer | Enterprise Knowledge Graphs
Sören Auer | Enterprise Knowledge GraphsSören Auer | Enterprise Knowledge Graphs
Sören Auer | Enterprise Knowledge Graphs
 
Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the Web
 
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
 
Linked data for librarians
Linked data for librariansLinked data for librarians
Linked data for librarians
 
when the link makes sense
when the link makes sensewhen the link makes sense
when the link makes sense
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
 
Introducción a la web semántica - Linkatu - irekia 2012
Introducción a la web semántica - Linkatu - irekia 2012Introducción a la web semántica - Linkatu - irekia 2012
Introducción a la web semántica - Linkatu - irekia 2012
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
 

Plus de Franck Michel

A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
Franck Michel
 
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked DataSPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
Franck Michel
 
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
Franck Michel
 

Plus de Franck Michel (13)

ISSA: Generic Pipeline, Knowledge Model and Visualization tools to Help Scien...
ISSA: Generic Pipeline, Knowledge Model and Visualization tools to Help Scien...ISSA: Generic Pipeline, Knowledge Model and Visualization tools to Help Scien...
ISSA: Generic Pipeline, Knowledge Model and Visualization tools to Help Scien...
 
Bioschemas: Marking up biodiversity websites to improve data discovery and we...
Bioschemas: Marking up biodiversity websites to improve data discovery and we...Bioschemas: Marking up biodiversity websites to improve data discovery and we...
Bioschemas: Marking up biodiversity websites to improve data discovery and we...
 
Unleash the Potential of your Website! 180,000 webpages from the French NHM m...
Unleash the Potential of your Website! 180,000 webpages from the French NHM m...Unleash the Potential of your Website! 180,000 webpages from the French NHM m...
Unleash the Potential of your Website! 180,000 webpages from the French NHM m...
 
Heterogeneous Data Aggregation and Querying at Web Scale Using Semantic align...
Heterogeneous Data Aggregation and Querying at Web Scale Using Semantic align...Heterogeneous Data Aggregation and Querying at Web Scale Using Semantic align...
Heterogeneous Data Aggregation and Querying at Web Scale Using Semantic align...
 
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
 
Knowledge Engineering: Semantic web, web of data, linked data
Knowledge Engineering: Semantic web, web of data, linked dataKnowledge Engineering: Semantic web, web of data, linked data
Knowledge Engineering: Semantic web, web of data, linked data
 
Enabling Automatic Discovery and Querying of Web APIs at Web Scale using Link...
Enabling Automatic Discovery and Querying of Web APIs at Web Scale using Link...Enabling Automatic Discovery and Querying of Web APIs at Web Scale using Link...
Enabling Automatic Discovery and Querying of Web APIs at Web Scale using Link...
 
Modelling Biodiversity Linked Data: Pragmatism May Narrow Future Opportunities
Modelling Biodiversity Linked Data: Pragmatism May Narrow Future OpportunitiesModelling Biodiversity Linked Data: Pragmatism May Narrow Future Opportunities
Modelling Biodiversity Linked Data: Pragmatism May Narrow Future Opportunities
 
A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
A Model to Represent Nomenclatural and Taxonomic Information as Linked Data. ...
 
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked DataSPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
SPARQL Micro-Services: Lightweight Integration of Web APIs and Linked Data
 
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
Construction d’un référentiel taxonomique commun pour des études sur l’histoi...
 
A Mapping-based Method to Query MongoDB Documents with SPARQL
A Mapping-based Method to Query MongoDB Documents with SPARQLA Mapping-based Method to Query MongoDB Documents with SPARQL
A Mapping-based Method to Query MongoDB Documents with SPARQL
 
Towards a Shared Reference Thesaurus for Studies on History of Zoology, Archa...
Towards a Shared Reference Thesaurus for Studies on History of Zoology, Archa...Towards a Shared Reference Thesaurus for Studies on History of Zoology, Archa...
Towards a Shared Reference Thesaurus for Studies on History of Zoology, Archa...
 

Dernier

Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
JOHNBEBONYAP1
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
nirzagarg
 
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
nirzagarg
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 

Dernier (20)

Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
 
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
 
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
 
Wadgaon Sheri $ Call Girls Pune 10k @ I'm VIP Independent Escorts Girls 80057...
Wadgaon Sheri $ Call Girls Pune 10k @ I'm VIP Independent Escorts Girls 80057...Wadgaon Sheri $ Call Girls Pune 10k @ I'm VIP Independent Escorts Girls 80057...
Wadgaon Sheri $ Call Girls Pune 10k @ I'm VIP Independent Escorts Girls 80057...
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 

Integrating Heterogeneous Data Sources in the Web of Data

  • 1. 1Franck MICHEL Franck Michel Integrating Heterogeneous Data Sources in the Web of Data
  • 2. 2Franck MICHEL More data sources  More opportunities
  • 3. 3Franck MICHEL Example: study history of zoological knowledge Archaeological excavationConservation biology* *http://www.lynxeds.com/hmw/plate/family-delphinidae-ocean-dolphins First Natural History Encycloedia, 1485.
  • 4. 4Franck MICHEL Example: study history of zoological knowledge Archaeological excavationConservation biology* *http://www.lynxeds.com/hmw/plate/family-delphinidae-ocean-dolphins Knowledge formalizations Controlled vocabularies, taxonomies domain ontologies, …
  • 5. 5Franck MICHEL LOD Cloud: 10K datasets, 150B Statements Linking Open Data cloud diagram 2017. A. Abele, J.P. McCrae, P. Buitelaar, A. Jentzsch and R. Cyganiak. http://lod-cloud.net/ On the Web In RDF Under open licences Interlinked
  • 6. 6Franck MICHEL Publishing legacy data in RDF raises tricky questions Metadata Data Vocabularies? Create links? Raw data? Translate into RDF?
  • 7. 7Franck MICHEL Describe the translation of heterogeneous data into RDF Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 8. 8Franck MICHEL Describe the translation of heterogeneous data into RDF Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 9. 9Franck MICHEL Data Sources Have Heterogeneous Data Models Relational DB ID NAME GraphObject-Oriented Native XML DBs Documents
  • 10. 10Franck MICHEL Describe the translation of heterogeneous data into RDF HTML data with RDFa CSV data Relational data NoSQL data Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 11. 11Franck MICHEL Describe the translation of heterogeneous data into RDF HTML data with RDFa CSV data Relational data NoSQL data Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 12. 12Franck MICHEL <body vocab="http://schema.org/"> <div resource="/jdev2017" typeof="Event"> <h2 property="title">JDEV 2017</h2> <p>Date: <span property="startDate">2017-07-04</span></p> ... <p>T2 - Ingénierie et web des données. <a property="url“href="http://devlog.cnrs.fr/jdev2017/t2">More…</a> </p> </div> </body> prefix sch: <http://schema.org/> <http://devlog.cnrs.fr/jdev2017> rdf:type sch:Event ; sch:title "JDEV 2017"; sch:startDate "2015-10-20" ; sch:url <http://devlog.cnrs.fr/jdev2017/t2> . RDFa: RDF in HTML attributes http://devlog.cnrs.fr/ https://www.w3.org/TR/rdfa-core/
  • 13. 13Franck MICHEL Describe the translation of heterogeneous data into RDF HTML data with RDFa CSV data Relational data NoSQL data Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 14. 14Franck MICHEL CSVW: CSV on the Web GET trees.csv Content-Type: text/csv Link: <http://example.org/trees.json>; rel="…" ID, Street, Species,Trim Cycle, Inventory Date 1, Addison Av, Celtis australis, 2010/10/18 2, Emerson St, Liquidambar styraciflua, 2010/06/02 ID, Street, Species,Trim Cycle, Inventory Date 1, Addison Av, Celtis australis, 2010/10/18 2, Emerson St, Liquidambar styraciflua, 2010/06/02 { "@context":["http://www.w3.org/ns/csvw", {"@language": "en"}], "url": "trees.csv", "dc:title": "Trees", "dc:license": { "@id": "http://.../../cc-by/"}, "dc:modified": {"@value": "2010-12-31", "@type": "xsd:date"}, "tableSchema": { "columns": [ { "name": "ID", "titles": ["ID", "Identifier"], "datatype":"string", "required": true }, { "name": "Street", "titles": "On Street", "dc:description": "…", "datatype": "string" }, ... ], "primaryKey": "ID", "aboutUrl": "#id-{ID}" }} https://www.w3.org/2013/csvw/
  • 15. 15Franck MICHEL Describe the translation of heterogeneous data into RDF HTML data with RDFa CSV data Relational data NoSQL data Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 16. 16Franck MICHEL Direct Mapping of a RDB to RDF <PEOPLE/ID=7> rdf:type <PEOPLE> . <PEOPLE/ID=7> <PEOPLE#FNAME> "Catherine" . <PEOPLE/ID=7> <PEOPLE#WROTE> <BOOK/ID=22> . <PEOPLE/ID=8> rdf:type <People> . <PEOPLE/ID=8> <PEOPLE#FNAME> "Olivier" . <PEOPLE/ID=8> <PEOPLE#WROTE> <BOOK/ID=22> . Table: PEOPLE ID FNAME WROTE (FK BOOK/ID) 7 Catherine 22 8 Olivier 22 … … … https://www.w3.org/TR/2012/REC-rdb-direct-mapping-20120927/
  • 17. 17Franck MICHEL Custom Mapping of a RDB to RDF <http://unice.fr/staff/7> rdf:type ex:Teacher. <http://unice.fr/staff/7> foaf:name "Catherine". <http://unice.fr/staff/7> dc:contributor <http://unice.fr/book/22>. <http://unice.fr/staff/8> rdf:type ex:Teacher. <http://unice.fr/staff/8> foaf:name "Olivier". <http://unice.fr/staff/8> dc:contributor <http://unice.fr/book/22>. Existing vocabularies Table: PEOPLE ID FNAME WROTE (FK BOOK/ID) 7 Catherine 22 8 Olivier 22 … … …
  • 18. 18Franck MICHEL Custom Mapping of a RDB to RDF with R2RML <#MapPeople> rr:logicalTable [ rr:tableName "PEOPLE" ]; rr:subjectMap [ rr:template "http://unice.fr/staff/{ID}"; rr:class ex:Teacher; ]; rr:predicateObjectMap [ rr:predicate foaf:name; rr:objectMap [ rr:column "FNAME" ]; ]. <http://unice.fr/staff/7> rdf:type ex:Teacher. <http://unice.fr/staff/7> foaf:name "Catherine". <http://unice.fr/staff/8> rdf:type ex:Teacher. <http://unice.fr/staff/8> foaf:name "Olivier". http://www.w3.org/TR/r2rml/
  • 19. 19Franck MICHEL Describe the translation of heterogeneous data into RDF HTML data with RDFa CSV data Relational data NoSQL data Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 20. 20Franck MICHEL <http://example.org/member/106> foaf:mbox "john@foo.com". <http://example.org/member/106> foaf:mbox "john@example.org". <#MapMbox> xrr:logicalSource [ xrr:query "db.people.find({'emails':{$ne: null}})" ]; rr:subjectMap [ rr:template "http://example.org/member/{$.id}" ]; rr:predicateObjectMap [ rr:predicate foaf:mbox; rr:objectMap [ xrr:reference "$.emails.*"; rr:termType rr:Literal ] ]. Mapping of a NoSQL DBs to RDF with xR2RML { "id": 106, "firstname": "John", "emails": [ "john@foo.com", "john@example.org" ] } xR2RML
  • 21. 21Franck MICHEL Many methods for many types of data sources AstroGrid-D, SPARQL2XQuery, XSPARQL XML XLWrap, Linked CSV, CSVW, RML CSV/TSV/Spreadsheets D2RQ, R2O, Ultrawrap, Triplify, SM R2RML: Morph-RDB, ontop, Virtuoso Relational Databases RML, TARQL, Apache Any23, DataLift, SPARQL-Generate Multiple formats RDFa, Microformats HTML TARQL, JSON-LD, RML JSON xR2RML (MongoDB), ontop (MongoDB), [Mugnier et al, 2016] NoSQL M.L. Mugnier, M.C. Rousset, and F. Ulliana. “Ontology-Mediated Queries for NOSQL Databases.” In Proc. AAAI. 2016.
  • 22. 22Franck MICHEL Describe the translation of heterogeneous data into RDF Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 23. 23Franck MICHEL Direct mapping: create my own vocabulary Can be derived from an existing schema May seem easier: “I do whatever I want” But no added semantics, need to link my vocabulary with other ones
  • 24. 24Franck MICHEL Custom Mapping: reuse existing vocabularies Large variety of existing vocabularies But may be difficult to find the appropriate one Partial coverage of the domain Granularity: too high (cumbersome), too low (useless) Different points of view Frequently, a mixed approach is used
  • 25. 25Franck MICHEL Describe the translation of heterogeneous data into RDF Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 26. 26Franck MICHEL Two approaches to translate existing data sources in RDF Graph Materialization (ETL like) Virtual Graph Query rewriting SPARQL SPARQL ID NAME
  • 27. 27Franck MICHEL A large variety of approaches Graph Materialization Query Rewriting Direct Mapping Custom Mapping RDFa XSPARQL SPARQL2XQuery AstroGrid-D XLWrap Linked CSV CSVW TARQL DataLift SPARQL-Generate RML JSON-LD Virtuoso R2RML D2RQ R2O Ultrawrap ontop R2RML xR2RML Any23 Triplify
  • 28. 28Franck MICHEL Describe the translation of heterogeneous data into RDF Choose vocabularies to represent RDF data Access the RDF data produced The key importance of metadata Agenda
  • 29. 29Franck MICHEL Making datasets discoverable and reusable requires high-quality, comprehensive metadata
  • 31. 31Franck MICHEL Metadata are the key to enable dataset reuse Context Identification, authors, dates, license, version, reference articles Access Format, structure, location (dwld), query method Meaning What do the data represent? What concepts, entities, semantics? Interpretation Units (cm or inches, left/right)… Provenance Acquired with which equipment, parameters, protocols? Derived from which dataset? With which processing? Dataset-level or entity-level provenance Statistics Number of triples per property of class, links to other datasets… …
  • 32. 32Franck MICHEL Several works normalize metadata descritpions and how to use metadata  CSVW: CSV on the Web  DCAT: Data Catalog Vocabulary DCAT extensions, application profiles W3C Dataset Exchange Working Group  VoID: Vocabulary of Interlinked Datasets  HCLS: Health Care & Life Sciences Dataset Profile  …
  • 35. 35Franck MICHEL HCLS: Health Care & Life Sciences Dataset Profile Consensus among stakeholders on the description of datasets using RDF *http://www.w3.org/TR/hcls-dataset/ RDF, RDFS, XSD Citation Typing Ontology Data Catalog (DCAT) Dublin Core Metadata Types, Dublin Core Metadata Terms Friend-of-a-Friend (FOAF) Collection Description Frequency Vocabulary Identifiers.org vocabulary Lexvo.org - Lexical Vocabulary Provenance Authoring and Versioning ontology (PAV) PROV Ontology Semantic science Integrated Ontology (SIO) Vocabulary of Interlinked Datasets (VoID) Used vocabularies
  • 36. 36Franck MICHEL Using a JSON-LD profile to translate JSON into RDF <http://example.org/member/106> foaf:mbox "john@foo.com". <http://example.org/member/106> foaf:mbox "john@example.org". { "id": 106, "firstname": "John", "emails": [ "john@foo.com", "john@example.org" ] } { "@context": { "id": "@id", "@base": "http://example.org/member/" "emails": "http://xmlns.com/foaf/0.1/mbox" } } https://www.w3.org/TR/json-ld/
  • 37. 37Franck MICHEL  Various initial motivations • Web of Data, Linked Data, • OBDA, • Ontology learning, • Schema mapping…  Historical products: D2RQ, Virtuoso…  R2RML mapping language • 2012 W3C recommendation • Several implementations  Several methods: direct mapping vs. domain-specific Translation of RDBs to RDF
  • 38. 38Franck MICHEL All You Need is LOV Linked Open Vocabularies  522 curated vocabularies  Quality requirements • URI stability and availability, • Quality metadata and documentation, • Identifiable and trustable publication body, • Proper versioning policy, • … “Vocabularies provide the semantic glue enabling Data to become meaningful Data.” http://lov.okfn.org/dataset/lov/
  • 39. 39Franck MICHEL Linked Data rules 1.Use URIs as names for things 2.Use HTTP URIs so that people can look up those names 3.When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4.Include links to other URIs, so that they can discover more things

Notes de l'éditeur

  1. Ce matin Manuel a introduit les LD, WoD, URI etc., Olivier a décrit plus en detail RDF, SPARQL. Now: comment alimenter ce WoD avec des données qui ne sont pas en RDF au depart.
  2. On est tous témoin de la multiplication des DS dispo sur le Web. Témoins et Acteurs. Social networks, collaborative wikis, scientific databases, crowd-sourced information. The availability of these DS spurs new ideas and opportunities for DI, but also comes with new challenges: need to capture and share the semantics of data sources.
  3. Transition: So to make sense of them, we need…
  4. SW technologies bring answers to these challenges RDF increasingly used as the pivot format for integrating heterogeneous DS. Transition: c’est précisément l’objet du LOD…
  5. Result of an RDF-based DI process. Starts with translating existing DS into RDF.
  6. INTERACT: Qui a déjà eu à faire de la conversion d’un format vers un autre pour DI ? Qui a déjà eu à faire ce genre de transformation ? Quels outils ?
  7. RDFa définit de nouvelles balises HTML (typeof, resource, property) pour ajouter des métadonnées qui permettent de générer du RDF. Pb: approche invasive
  8. Non invasif, contrairement à RDFa
  9. Create ad-hoc ontology: Class = « Table » Subject is « Table/ID » Property is « Table#column » Object is value of column
  10. Create ad-hoc ontology: Class = « Table » Subject is « Table/ID » Property is « Table#column » Object is value of column
  11. Point of view issue : - biologist vs. taxonomist: est-ce que “espèce” a bien le même sens pour les 2? - surgeon vs. anatomist: un chirurgien va sans hésitation donner la délimitation d’une zone du cerveau, alors que l’anatomiste va prudemment désigner le centre de la zone mais pas ses frontières…
  12. (…) Translate each individual DS into RDF using appropriate vocab = mediation In practice, this mediation follows two common approaches…
  13. Permettre la répétabilité des expériences. INTERACT: pouvez-vous me donner qq exemples de métadonnées ?
  14. VoID et DCAT à décrire
  15. (…) library metadata formats provided by the Library of Congress, BnF, DNB, etc.. (…) published and used by (…) large media corporations (BBC), national administrations (INSEE), EC, universities and research projects, (…) published by individuals and put on the community table, in the tradition and spirit of the open, collaborative Web.”
  16. Retour sur les fondamentaux du LD, les tables de la loi!
  17. On a vu: - Comment décrire nos données avec des métadonnées, - les technos du WS qui peuvent nous aider à publier données et métadonnées - comment créer/réutiliser les vocabulaires qui constituent la référence sémantique de mes données enfin, comment convertir des données existantes en RDF Maintenant, quelles sont les bonnes pratiques pour en faire des données ouvertes et liées…