SlideShare une entreprise Scribd logo
1  sur  19
Enriching Cultural Heritage
Data with DBpedia
Antoine Isaac | DBpedia Community Meeting 2016
Netherlands, Public Domain
1660 - 1625, Rijksmuseum
Anonymous
Arrival of a Portuguese ship
Title here
CC BY-SA
Europeana?
Europeana Essentials
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Europeana Collections homepage
Europeana| CC BY-SA
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Europeana aggregation infrastructure
Europeana| CC BY-SA
Europeana?
Europeana has many data challenges
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
We aggregate very heterogeneous metadata
• More than 48M objects
• 3,500 galleries, libraries, archives and museums
• 50 languages
• From all EU countries
• Level of quality varies greatly
Title here
CC BY-SA
Title here
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Linked Open Data
Europeana Linked Open Data video on Vimeo
Europeana | CC BY-SA
Europeana Linked Data Strategy
Our efforts and lines of work
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
• The Europeana Data Model (EDM) offers a way to represent richer
(linked) data
• We apply an enrichment strategy to link source data to reference
data, including DBpedia
Will be discussed in Parallel Session 2:
• We encourage data providers to contribute links between objects
and (their own) vocabularies
• We encourage alignment activities between domain vocabularies
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
The Europeana Data Model
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Clavecin, Bartolomeo Cristofori
Cite de la Musique,
MIMO - Musical Instruments Museums Online|CC BY-NC-SA
Europeana Data Model example
Europeana| CC BY-SA
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
Create a “semantic layer” on top of cultural
heritage objects
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Include multilingual “value vocabularies” (e.g. thesauri represented SKOS)
from Europeana’s providers or from third-party data sources
Semantic enrichment, a solution for better
quality data?
Automatic and manual enrichment are more and more commonly used
in digital libraries to:
• normalise data
• “standardize data” by linking it to authority resources
• improve multilingual coverage in datasets
• contextualise resources
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
The main components of semantic enrichment
CC BY-SA
source objects
whose metadata is
being enriched
set of resources used
to enrich the source
metadata
targets can be of
different types, from
simple uncontrolled
strings to resources
published as LOD
specify how the
enrichment between
the source and target
should be executed.
Source
Target
Rules
Enriching Cultural Heritage Data with DBpedia
Automatic enrichment process in Europeana
CC BY-SA
selection of metadata
fields in descriptions
selection of potential
rules to match
matching the values
of the metadata
fields to values of the
contextual resources
adding contextual
links
selection of values
from the contextual
resource
values go into the
search index
Analysis
Linking
Augmentation of
search index
Enriching Cultural Heritage Data with DBpedia
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
Vocabularies we currently enrich metadata with
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
Entity
Class
Target
vocabulary
Size Metadata Fields subject of Enrichment
Places GeoNames 140,097 dcterms:spatial, dc:coverage
Concepts DBpedia 5,284 dc:subject, dc:type
GEMET 280
Agents DBpedia 161,209 dc:creator, dc:contributor
Time Semium Time 2,566 dc:coverage, dcterms:temporal,
dc:date, edm:year
Why DBpedia?
CC BY-SA
Building an ecosystem of networked references
• It offers labels in about 124 languages through all its
language editions of which 48 match the languages that
Europeana supports
• It gives fairly complete and accurate descriptive metadata
about entities
• Works great as a “pivot” vocabulary, providing further links to
other vocabularies such as Wikidata and Freebase
Not everything is
perfect
France, Public Domain
1921, National Library of France
Agence de presse Meurisse
Colombes : championnats de France d’Athlétisme :
rivière, le speaker
Challenges of multilingual automatic enrichment
Evaluation of metadata enrichment practices in digital libraries: steps towards better data
enrichments
Poisonous India or the Importance of a Semantic and
Multilingual Enrichment Strategy
Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012
http://link.springer.com/chapter/10.1007%2F978-3-642-35233-
1_25
Comparative evaluation of enrichments
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
We ran a quantitative evaluation on a sample set enriched by 7 different
tools (settings)
http://pro.europeana.eu/taskforce/evaluation-and-enrichments
Example of Recommendations that will be explored
CC BY-SA
Enriching Cultural Heritage Data with DBpedia
Define your enrichment goals
• Develop better criteria for evaluating enrichment
Choose the right service
• enrichment tool more aware of the semantics of the model
Monitor your enrichment process and re-assess
• target dataset could be richer: new terms, new languages,
more granular
Enrichment using a better reference for contextual entities?
You will hear about this in the next session ☺
Title here
CC BY-SA
Name of image | Creator
Providing organization|
Country, licence
Name of image | Creator
Providing organization| Country, licence
With slides from Valentine Charles, Juliane Stiller, Hugo
Manguinhas and Stefan Gradmann

Contenu connexe

Tendances

Tendances (20)

Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing framework
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap Meeting
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 
Open Science, Open Data: towards a new transparent and reproducible ecosystem
Open Science, Open Data:   towards a new transparent and reproducible ecosystemOpen Science, Open Data:   towards a new transparent and reproducible ecosystem
Open Science, Open Data: towards a new transparent and reproducible ecosystem
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 

En vedette

En vedette (20)

Baal icsig-2012-Holliday
Baal icsig-2012-HollidayBaal icsig-2012-Holliday
Baal icsig-2012-Holliday
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Pundit at 3rd DBpedia Community Meeting 2015
Pundit at 3rd DBpedia Community Meeting 2015Pundit at 3rd DBpedia Community Meeting 2015
Pundit at 3rd DBpedia Community Meeting 2015
 
Using DBpedia for Spotting and Disambiguating Entities
Using DBpedia for Spotting and Disambiguating EntitiesUsing DBpedia for Spotting and Disambiguating Entities
Using DBpedia for Spotting and Disambiguating Entities
 
20150209 improving the_d_bpedia_ontology_v2
20150209 improving the_d_bpedia_ontology_v220150209 improving the_d_bpedia_ontology_v2
20150209 improving the_d_bpedia_ontology_v2
 
Missingbot DBpedia Meeting Dublin 2015
Missingbot DBpedia Meeting Dublin 2015Missingbot DBpedia Meeting Dublin 2015
Missingbot DBpedia Meeting Dublin 2015
 
DBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterDBpedia as Gaeilge Chapter
DBpedia as Gaeilge Chapter
 
D bpedia association meeting dublin wkg
D bpedia association meeting dublin wkgD bpedia association meeting dublin wkg
D bpedia association meeting dublin wkg
 
20140130 metadata vocabularies_and_cultural_heritage_final
20140130 metadata vocabularies_and_cultural_heritage_final20140130 metadata vocabularies_and_cultural_heritage_final
20140130 metadata vocabularies_and_cultural_heritage_final
 
Linking Implicit entities - DBpedia Meetup
Linking Implicit entities - DBpedia MeetupLinking Implicit entities - DBpedia Meetup
Linking Implicit entities - DBpedia Meetup
 
DBpedia in the Japanese LOD cloud
DBpedia in the Japanese LOD cloudDBpedia in the Japanese LOD cloud
DBpedia in the Japanese LOD cloud
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
 
DBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in Dublin
 
DBpedia i18n - Amsterdam Meeting (30/01/2014)
DBpedia i18n - Amsterdam Meeting (30/01/2014)DBpedia i18n - Amsterdam Meeting (30/01/2014)
DBpedia i18n - Amsterdam Meeting (30/01/2014)
 
8th DBpedia meeting / California 2016
8th DBpedia meeting /  California 20168th DBpedia meeting /  California 2016
8th DBpedia meeting / California 2016
 
Integration of Web Protégé into DBpedia
Integration of Web Protégé into DBpediaIntegration of Web Protégé into DBpedia
Integration of Web Protégé into DBpedia
 
Knowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaKnowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPedia
 
The Current State of the National Museum of the Philippines and its Role in P...
The Current State of the National Museum of the Philippines and its Role in P...The Current State of the National Museum of the Philippines and its Role in P...
The Current State of the National Museum of the Philippines and its Role in P...
 
Indigenous Peoples of the Philippines
Indigenous Peoples of the PhilippinesIndigenous Peoples of the Philippines
Indigenous Peoples of the Philippines
 

Similaire à Enriching Cultural Heritage Data with DBpedia

Similaire à Enriching Cultural Heritage Data with DBpedia (20)

The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage Data
 
UKSG webinar: Introduction to metadata quality – the approach of Europeana Co...
UKSG webinar: Introduction to metadata quality – the approach of Europeana Co...UKSG webinar: Introduction to metadata quality – the approach of Europeana Co...
UKSG webinar: Introduction to metadata quality – the approach of Europeana Co...
 
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Building an ecosystem of networked references
Building an ecosystem of networked referencesBuilding an ecosystem of networked references
Building an ecosystem of networked references
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 
Eun lre brussels_winer20100616
Eun lre brussels_winer20100616Eun lre brussels_winer20100616
Eun lre brussels_winer20100616
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
Data quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)dataData quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)data
 
Sharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshopSharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshop
 
Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...Exploring comparative evaluation of semantic enrichment tools for cultural he...
Exploring comparative evaluation of semantic enrichment tools for cultural he...
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Linked Open Data Cloud
Linked Open Data CloudLinked Open Data Cloud
Linked Open Data Cloud
 
Tim Hill
Tim HillTim Hill
Tim Hill
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’Europeana
 

Plus de Antoine Isaac

Plus de Antoine Isaac (12)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Enriching Cultural Heritage Data with DBpedia

  • 1. Enriching Cultural Heritage Data with DBpedia Antoine Isaac | DBpedia Community Meeting 2016 Netherlands, Public Domain 1660 - 1625, Rijksmuseum Anonymous Arrival of a Portuguese ship
  • 2. Title here CC BY-SA Europeana? Europeana Essentials CC BY-SA Enriching Cultural Heritage Data with DBpedia CC BY-SA Europeana Collections homepage Europeana| CC BY-SA
  • 3. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA Enriching Cultural Heritage Data with DBpedia CC BY-SA Europeana aggregation infrastructure Europeana| CC BY-SA Europeana?
  • 4. Europeana has many data challenges Enriching Cultural Heritage Data with DBpedia CC BY-SA We aggregate very heterogeneous metadata • More than 48M objects • 3,500 galleries, libraries, archives and museums • 50 languages • From all EU countries • Level of quality varies greatly
  • 5. Title here CC BY-SA Title here CC BY-SA Enriching Cultural Heritage Data with DBpedia CC BY-SA Linked Open Data Europeana Linked Open Data video on Vimeo Europeana | CC BY-SA
  • 6. Europeana Linked Data Strategy Our efforts and lines of work Enriching Cultural Heritage Data with DBpedia CC BY-SA • The Europeana Data Model (EDM) offers a way to represent richer (linked) data • We apply an enrichment strategy to link source data to reference data, including DBpedia Will be discussed in Parallel Session 2: • We encourage data providers to contribute links between objects and (their own) vocabularies • We encourage alignment activities between domain vocabularies
  • 7. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA The Europeana Data Model Enriching Cultural Heritage Data with DBpedia CC BY-SA Clavecin, Bartolomeo Cristofori Cite de la Musique, MIMO - Musical Instruments Museums Online|CC BY-NC-SA Europeana Data Model example Europeana| CC BY-SA
  • 8. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA Create a “semantic layer” on top of cultural heritage objects Enriching Cultural Heritage Data with DBpedia CC BY-SA Include multilingual “value vocabularies” (e.g. thesauri represented SKOS) from Europeana’s providers or from third-party data sources
  • 9. Semantic enrichment, a solution for better quality data? Automatic and manual enrichment are more and more commonly used in digital libraries to: • normalise data • “standardize data” by linking it to authority resources • improve multilingual coverage in datasets • contextualise resources Enriching Cultural Heritage Data with DBpedia CC BY-SA
  • 10. The main components of semantic enrichment CC BY-SA source objects whose metadata is being enriched set of resources used to enrich the source metadata targets can be of different types, from simple uncontrolled strings to resources published as LOD specify how the enrichment between the source and target should be executed. Source Target Rules Enriching Cultural Heritage Data with DBpedia
  • 11. Automatic enrichment process in Europeana CC BY-SA selection of metadata fields in descriptions selection of potential rules to match matching the values of the metadata fields to values of the contextual resources adding contextual links selection of values from the contextual resource values go into the search index Analysis Linking Augmentation of search index Enriching Cultural Heritage Data with DBpedia
  • 12. CC BY-SA Enriching Cultural Heritage Data with DBpedia
  • 13. Vocabularies we currently enrich metadata with CC BY-SA Enriching Cultural Heritage Data with DBpedia Entity Class Target vocabulary Size Metadata Fields subject of Enrichment Places GeoNames 140,097 dcterms:spatial, dc:coverage Concepts DBpedia 5,284 dc:subject, dc:type GEMET 280 Agents DBpedia 161,209 dc:creator, dc:contributor Time Semium Time 2,566 dc:coverage, dcterms:temporal, dc:date, edm:year
  • 14. Why DBpedia? CC BY-SA Building an ecosystem of networked references • It offers labels in about 124 languages through all its language editions of which 48 match the languages that Europeana supports • It gives fairly complete and accurate descriptive metadata about entities • Works great as a “pivot” vocabulary, providing further links to other vocabularies such as Wikidata and Freebase
  • 15. Not everything is perfect France, Public Domain 1921, National Library of France Agence de presse Meurisse Colombes : championnats de France d’Athlétisme : rivière, le speaker
  • 16. Challenges of multilingual automatic enrichment Evaluation of metadata enrichment practices in digital libraries: steps towards better data enrichments Poisonous India or the Importance of a Semantic and Multilingual Enrichment Strategy Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012 http://link.springer.com/chapter/10.1007%2F978-3-642-35233- 1_25
  • 17. Comparative evaluation of enrichments CC BY-SA Enriching Cultural Heritage Data with DBpedia We ran a quantitative evaluation on a sample set enriched by 7 different tools (settings) http://pro.europeana.eu/taskforce/evaluation-and-enrichments
  • 18. Example of Recommendations that will be explored CC BY-SA Enriching Cultural Heritage Data with DBpedia Define your enrichment goals • Develop better criteria for evaluating enrichment Choose the right service • enrichment tool more aware of the semantics of the model Monitor your enrichment process and re-assess • target dataset could be richer: new terms, new languages, more granular Enrichment using a better reference for contextual entities? You will hear about this in the next session ☺
  • 19. Title here CC BY-SA Name of image | Creator Providing organization| Country, licence Name of image | Creator Providing organization| Country, licence With slides from Valentine Charles, Juliane Stiller, Hugo Manguinhas and Stefan Gradmann

Notes de l'éditeur

  1. Europeana works with data experts around the world to ensure that the Europeana Data Model describes our cultural heritage material in the best possible way, and in a way that means it can link in with other systems. Europeana Data Model Information about digital cultural heritage comes in a variety of formats. Europeana has developed the Europeana Data Model to ensure that collections from any organisation are treated and displayed in the same ways in Europeana’s systems and services. Data partners often hold their data in a local or standard metadata format in their own systems. That data needs to be mapped, transformed and exported from those systems to EDM for use in Europeana’s systems. EDM has now become an industry standard for cultural heritage data. Since its original release, EDM has permeated the entire portfolio of Europeana products: we ingest, store, enrich and exchange data following a richer, more semantic (and more multilingual!) approach. This work continues, for example, as Europeana prepares to handle more data enrichment, including user annotations. EDM has also been extended to meet the data needs of specific domain aggregators, like Europeana Sounds, and address the requirements of new data services and enrichment in Europeana's main platform. EDM is now used by Europeana and several other cultural aggregators, such as DPLA and DDB. In true linked data fashion, EDM "profiles" can be developed without Europeana having to update the core model anymore. Exploiting the data expressed with these profiles across different systems still requires work. But it is no longer impossible to realise the vision where the design of data models is decentralised and tailored to specific applications, while the data created and exchanged with them still forms together a vast, semantically interoperable knowledge environment.   See more at: http://pro.europeana.eu/blogpost/the-europeana-data-model-a-living-model-5-years-on#sthash.9HetKLZU.dpuf
  2. Image url:http://www.europeana.eu/portal/record/90402/SK_A_3899.html Copyright url:http://creativecommons.org/publicdomain/mark/1.0/