SlideShare une entreprise Scribd logo
1  sur  36
Multilingual challenges and
ongoing work to tackle them at
Europeana
Antoine Isaac
with slides from Hugo Manguinhas, Valentine Charles, Juliane Stiller,
Mónica Marrero
CLUBS final workshop
Saarbrücken, 7 June 2019
What is Europeana?
● Europeana is the European Commission's digital
platform for cultural heritage.
● We provide access to digital collections from libraries,
museums and archives around Europe
Title here
CC BY-SACC BY-SA
A network of data partners
● Data providers: Cultural heritage institutions providing content and metadata
to Europeana
● "Intermediate” Aggregators: gathering
metadata and content for institutions
from a specific country, sector, or on a
specific domain (music, archaeology,
theater…) and making it available for
Europeana and other data consumers
Title here
CC BY-SACC BY-SA
57 million digitized objects, from
3,700 institutions in 44 countries
Title here
CC BY-SACC BY-SA
What’s multilingual in there?
We working for a multilingual audience, in Europe and
elsewhere
● Dynamic content - user interface
● Static content such as curated virtual exhibitions
● Search - returning cultural object for users’ queries
Title here
CC BY-SACC BY-SA
Title here
CC BY-SA
Multilinguality in Europeana search
Users from all around the
world querying in their
own language
Metadata
in 37
languages
Title here
CC BY-SACC BY-SA
The data Europeana holds about
cultural objects
● Descriptive and technical metadata
● Creator, title, rights…
● Some content for specific projects
● newspapers text and images
Title here
CC BY-SA
Massively multilingual
metadata
Europeana Essentials
CC BY-SACC BY-SA
57 million digitized objects, from 3,700 institutions in 44
countries
● Officially we get metadata in 37 languages
● But there are more languages used in individual
metadata fields
Title here
CC BY-SA
Europeana Essentials
CC BY-SACC BY-SA
Work by Péter Kiraly (Göttingen Research alliance)
http://144.76.218.178/europeana-
qa/languages.php?collectionId=all&field=aggregated
Title here
CC BY-SA
Europeana Essentials
CC BY-SACC BY-SA
Data analysis by Péter Kiraly (Göttingen Research alliance)
Title here
CC BY-SA
Massively multilingual
metadata
Europeana Essentials
CC BY-SACC BY-SA
● Officially we get metadata in 37 languages
● But there are more languages used in individual
metadata fields
• Over 400 language codes
• E.g., 6 values in x-aramaic-latn - not a valid code by the way
• But the most common case is lack of language information!
France, Public Domain
1932, National Library of France
Agence de presse Mondial Photo-Presse.
Tournoi royal de motos à Londres :
changement d'une roue de side-car en
marche
How to tackle these
issues?
1. Data Modeling for
richer, multilingual
data
Title here
CC BY-SACC BY-SA
Following the Linked Open Data principles
http://vimeo.com/36752317
Title here
CC BY-SA
Title here
CC BY-SA
Europeana Essentials
CC BY-SA
The Europana Data Model as handle
for multilingual data
CC BY-SA
Clavecin, Bartolomeo Cristofori
Cite de la Musique,
MIMO - Musical Instruments Museums Online|CC BY-NC-SA
Europeana Data Model example
Here, from the Musical Instrument Museums Online (MIMO) project
France, Public Domain
1932, National Library of France
Agence de presse Mondial Photo-Presse.
Tournoi royal de motos à Londres :
changement d'une roue de side-car en
marche
How to tackle these
issues?
2. Enriching
metadata
Title here
CC BY-SACC BY-SA
A Linked Data Strategy for Europeana
• The Europeana Data Model (EDM) gives a base for (linking to)
multilingual metadata
• data as resources (with URIs!), not only strings
• We encourage data providers to contribute their own links to
vocabularies
Title here
CC BY-SACC BY-SA
Related to a general effort on quality
We have set up a Data Quality Committee working on
recommendations for the Europeana community on:
○ Mandatory metadata elements
○ Metadata normalization
○ Multilingualism
…
http://pro.europeana.eu/get-involved/europeana-tech/data-quality-committee
Title here
CC BY-SACC BY-SA
A general effort on quality
https://pro.europeana.eu/post/publishing-framework
Title here
CC BY-SACC BY-SA
LOD Vocabularies currently recognized by Europeana in providers'
metadata:
Vocabulary URL
MIMO Concepts http://www.mimo-db.eu/
MIMO Instrument makers http://www.mimo-db.eu/
The Getty - Art & Architecture Thesaurus (AAT) http://vocab.getty.edu/
The Getty - Union List of Artist Names (ULAN) http://vocab.getty.edu/
Virtual International Authority File (VIAF) http://viaf.org/viaf/
Geonames http://sws.geonames.org/
IconClass http://iconclass.org/
Gemeinsame Normdatei (GND) http://d-nb.info/gnd
Israel Museum Jerusalem Concepts http://www.imj.org.il/imagine/thesaurus/objects/
Partage Plus concepts http://partage.vocnet.org/
data.europeana.eu WWI Concepts from Library of Congress
Subject Headings (LCSH) http://data.europeana.eu/concept/loc
Europeana Sounds Genres http://data.europeana.eu/concept/soundgenres/
EAGLE Material & Object Type http://www.eagle-network.eu/voc/
DISMARC Formats & Genres http://purl.org/dismarc/ns/
UDC http://udcdata.info/rdf/
UNESCO Thesaurus http://vocabularies.unesco.org/thesaurus/
NB: providers can send data from their local vocabularies too –
if they embed it in the EDM metadata they send to us!
Title here
CC BY-SACC BY-SA
A Linked Data Strategy for Europeana
CC BY-SA
• The Europeana Data Model (EDM) offers a base for linking
metadata
• Aims at providing data as resources (with URIs!), not only strings
• Enables the development of a multilingual data environment
• We encourage data providers to contribute their own links to
vocabularies
• In parallel, we apply automatic enrichment to link object
metadata to reference datasets
Title here
CC BY-SACC BY-SA
A Linked Data Strategy for Europeana –
Contextual Entities
CC BY-SA
We are building an "Entity Collection"
• Centralized point of reference and access to data about contextual
entities: places, agents (persons and organizations), concepts...
• Caching and curating data from the wider Linked Open Data cloud
• A sort of Europeana "knowledge graph"
Title here
CC BY-SACC BY-SA
Uses cases for the Entity Collection
CC BY-SA
Improve user experience on Europeana services
● Findability: users can search with and for people, places and subjects, not only objects. In many
more languages, and with less ambiguity
● Contextualization: users see contextual information about cultural heritage objects. Entity Pages
group and present all assertions about an entity
…
Entity Pages
Semantic auto-
completion
Data currently in the Entity Collection
CC BY-SA
• Places
a subset of Geonames, corresponding to places which are part of
European countries and of some specific feature classes.
• Agents
a subset of DBpedia corresponding to most of the instances of dbp:Artist
with some exceptions, and integrated from 49 DBpedia language editions.
• Concepts
a subset of DBpedia corresponding to a selection concepts matching the
needs from Europeana Collections (e.g., WWI battles).
Europeana Sounds music genres (obtained from Wikidata)
Photo Consortium's photography vocabulary
• Organizations
Extracted from Europeana's CRM and aligned to Wikidata when possible
216,302
resources
1,572
resources
165,005
resources
1,077
resources
Selecting data sources
CC BY-SA
• Availability and access: open license, published on the web as linked
data
• Granularity, size and coverage: multilingual data, helping to answer
key user needs for Europeana's CH collections. Too generic or large
datasets can create too much ambiguity for the simple processes we have
(e.g. enrichment)
• Quality: intrinsic aspects like correctness of representation (data
structures)
• Connectivity: good data sources are well-connected internally and
externally to other datasets
An example
DBpedia resource for “Mozart” in our data
CC BY-SA
Coreference links to 6 other
datasets
(e.g. Freebase, Wikidata)
Inter-linking information
Preferred labels for 48
languages
An enrichment example
Links to contextual entities
And what it allows
And what it allows
Title here
CC BY-SACC BY-SA
The Entity Collection’s contribution to multilingual
coverage
Entities effectively used to enrich Europeana Objects
Entities present in the Entity Collection
Title here
CC BY-SACC BY-SA
Multilingual enrichment is not easy!
Poisonous India or the Importance of a Semantic and Multilingual
Enrichment Strategy
Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012
http://link.springer.com/chapter/10.1007%2F978-3-642-35233-
1_25
Title here
CC BY-SACC BY-SA
We’re not really finished!
France, Public Domain
1914, National Library of France
Agence de presse Meurisse
Concours de cycles nautiques sur le lac
d’Enghien : Berregent piloté par Austerling
Work in progress – the Entity Collection
CC BY-SA
• We've made enough progress to release a first
version of the Entity Collection used in Europeana's
production services.
• But there are still challenges:
• Expand data coverage with new data sources
• For existing types of contextual entities
• For new types, e.g., events
• Better enrich Europeana object metadata
Work in progress - UI
Work in progress – Static content
Automatic translation?
CC BY-SA
• First experiments with the European Commission’s eTranslation
service, on curated (static) content
• So far it needs to be better quality, or applied to other
Europeana areas, like object display or search
Result ranked 1: French
Result ranked 2: Spanish
Result ranked 3: Polish
search
results
query
Search only in corresponding language
Approach 1
Query translation
Approach 2
Metadata/conten
t translation
Title here
CC BY-SACC BY-SA
Title here
CC BY-SA
Name of image | Creator
Providing organization|
Country, licence
Name of image | Creator
Providing organization| Country, licence
antoine.isaac@europeana.eu
@antoine_isaac

Contenu connexe

Tendances

Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?Antoine Isaac
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Antoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingAntoine Isaac
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14Antoine Isaac
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsAntoine Isaac
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Antoine Isaac
 
Europeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingEuropeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingAntoine Isaac
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for EuropeanaAntoine Isaac
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Antoine Isaac
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Antoine Isaac
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingAntoine Isaac
 
EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect
 
Consolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research ServicesConsolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research ServicesSaskia Scheltjens
 
Europeana creative sound archives and social networks @ British and Irish S...
Europeana creative  sound archives and social networks  @ British and Irish S...Europeana creative  sound archives and social networks  @ British and Irish S...
Europeana creative sound archives and social networks @ British and Irish S...Lizzy Komen
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsCARARE
 
Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Victor de Boer
 
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...Achieving Interoperability between the CARARE Schema for Monuments and Sites ...
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...Antoine Isaac
 
Francesca Schulze - Europeana Licensing 062013
Francesca Schulze - Europeana Licensing 062013Francesca Schulze - Europeana Licensing 062013
Francesca Schulze - Europeana Licensing 062013Europeana Licensing
 

Tendances (20)

Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap Meeting
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Europeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD MeetingEuropeana - American Art Collaborative LOD Meeting
Europeana - American Art Collaborative LOD Meeting
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
 
EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework
 
Consolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research ServicesConsolidating Openness : Developing Rijksmuseum Research Services
Consolidating Openness : Developing Rijksmuseum Research Services
 
Europeana creative sound archives and social networks @ British and Irish S...
Europeana creative  sound archives and social networks  @ British and Irish S...Europeana creative  sound archives and social networks  @ British and Irish S...
Europeana creative sound archives and social networks @ British and Irish S...
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri
 
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...Achieving Interoperability between the CARARE Schema for Monuments and Sites ...
Achieving Interoperability between the CARARE Schema for Monuments and Sites ...
 
Francesca Schulze - Europeana Licensing 062013
Francesca Schulze - Europeana Licensing 062013Francesca Schulze - Europeana Licensing 062013
Francesca Schulze - Europeana Licensing 062013
 

Similaire à Multilingual challenges and ongoing work to tackle them at Europeana

Everything you need to know about Europeana
Everything you need to know about EuropeanaEverything you need to know about Europeana
Everything you need to know about EuropeanaDouglas McCarthy
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachValentine Charles
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansAntoine Isaac
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataValentine Charles
 
Everything you need to know about Europeana, Budapest, 1 June 2018
Everything you need to know about Europeana, Budapest, 1 June 2018Everything you need to know about Europeana, Budapest, 1 June 2018
Everything you need to know about Europeana, Budapest, 1 June 2018Douglas McCarthy
 
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Nuno Freire
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Antoine Isaac
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’EuropeanaDouglas McCarthy
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataNuno Freire
 
Europeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decadeEuropeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decadeDouglas McCarthy
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013 Europeana
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology IndustryAntoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana missionAntoine Isaac
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Nuno Freire
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?David Haskiya
 
From Enriched Museum Collections to Social Web and TV: Seminar at BBC
From Enriched Museum Collections to Social Web and TV: Seminar at BBCFrom Enriched Museum Collections to Social Web and TV: Seminar at BBC
From Enriched Museum Collections to Social Web and TV: Seminar at BBCLora Aroyo
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...Valentine Charles
 

Similaire à Multilingual challenges and ongoing work to tackle them at Europeana (20)

Everything you need to know about Europeana
Everything you need to know about EuropeanaEverything you need to know about Europeana
Everything you need to know about Europeana
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage Data
 
Everything you need to know about Europeana, Budapest, 1 June 2018
Everything you need to know about Europeana, Budapest, 1 June 2018Everything you need to know about Europeana, Budapest, 1 June 2018
Everything you need to know about Europeana, Budapest, 1 June 2018
 
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013
 
Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 
Des nouvelles d’Europeana
Des nouvelles d’EuropeanaDes nouvelles d’Europeana
Des nouvelles d’Europeana
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Europeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decadeEuropeana at Ten: insights from our first decade
Europeana at Ten: insights from our first decade
 
LDBC 19 November 2013
LDBC 19 November 2013  LDBC 19 November 2013
LDBC 19 November 2013
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der Werf
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?
 
From Enriched Museum Collections to Social Web and TV: Seminar at BBC
From Enriched Museum Collections to Social Web and TV: Seminar at BBCFrom Enriched Museum Collections to Social Web and TV: Seminar at BBC
From Enriched Museum Collections to Social Web and TV: Seminar at BBC
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 

Plus de Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaAntoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Antoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesAntoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSAntoine Isaac
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Antoine Isaac
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Antoine Isaac
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataAntoine Isaac
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and EuropeanaAntoine Isaac
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 

Plus de Antoine Isaac (14)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 

Dernier

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 

Dernier (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 

Multilingual challenges and ongoing work to tackle them at Europeana

  • 1. Multilingual challenges and ongoing work to tackle them at Europeana Antoine Isaac with slides from Hugo Manguinhas, Valentine Charles, Juliane Stiller, Mónica Marrero CLUBS final workshop Saarbrücken, 7 June 2019
  • 2. What is Europeana? ● Europeana is the European Commission's digital platform for cultural heritage. ● We provide access to digital collections from libraries, museums and archives around Europe
  • 3. Title here CC BY-SACC BY-SA A network of data partners ● Data providers: Cultural heritage institutions providing content and metadata to Europeana ● "Intermediate” Aggregators: gathering metadata and content for institutions from a specific country, sector, or on a specific domain (music, archaeology, theater…) and making it available for Europeana and other data consumers
  • 4. Title here CC BY-SACC BY-SA 57 million digitized objects, from 3,700 institutions in 44 countries
  • 5. Title here CC BY-SACC BY-SA What’s multilingual in there? We working for a multilingual audience, in Europe and elsewhere ● Dynamic content - user interface ● Static content such as curated virtual exhibitions ● Search - returning cultural object for users’ queries
  • 6. Title here CC BY-SACC BY-SA Title here CC BY-SA Multilinguality in Europeana search Users from all around the world querying in their own language Metadata in 37 languages
  • 7. Title here CC BY-SACC BY-SA The data Europeana holds about cultural objects ● Descriptive and technical metadata ● Creator, title, rights… ● Some content for specific projects ● newspapers text and images
  • 8. Title here CC BY-SA Massively multilingual metadata Europeana Essentials CC BY-SACC BY-SA 57 million digitized objects, from 3,700 institutions in 44 countries ● Officially we get metadata in 37 languages ● But there are more languages used in individual metadata fields
  • 9. Title here CC BY-SA Europeana Essentials CC BY-SACC BY-SA Work by Péter Kiraly (Göttingen Research alliance) http://144.76.218.178/europeana- qa/languages.php?collectionId=all&field=aggregated
  • 10. Title here CC BY-SA Europeana Essentials CC BY-SACC BY-SA Data analysis by Péter Kiraly (Göttingen Research alliance)
  • 11. Title here CC BY-SA Massively multilingual metadata Europeana Essentials CC BY-SACC BY-SA ● Officially we get metadata in 37 languages ● But there are more languages used in individual metadata fields • Over 400 language codes • E.g., 6 values in x-aramaic-latn - not a valid code by the way • But the most common case is lack of language information!
  • 12. France, Public Domain 1932, National Library of France Agence de presse Mondial Photo-Presse. Tournoi royal de motos à Londres : changement d'une roue de side-car en marche How to tackle these issues? 1. Data Modeling for richer, multilingual data
  • 13. Title here CC BY-SACC BY-SA Following the Linked Open Data principles http://vimeo.com/36752317
  • 14. Title here CC BY-SA Title here CC BY-SA Europeana Essentials CC BY-SA The Europana Data Model as handle for multilingual data CC BY-SA Clavecin, Bartolomeo Cristofori Cite de la Musique, MIMO - Musical Instruments Museums Online|CC BY-NC-SA Europeana Data Model example Here, from the Musical Instrument Museums Online (MIMO) project
  • 15. France, Public Domain 1932, National Library of France Agence de presse Mondial Photo-Presse. Tournoi royal de motos à Londres : changement d'une roue de side-car en marche How to tackle these issues? 2. Enriching metadata
  • 16. Title here CC BY-SACC BY-SA A Linked Data Strategy for Europeana • The Europeana Data Model (EDM) gives a base for (linking to) multilingual metadata • data as resources (with URIs!), not only strings • We encourage data providers to contribute their own links to vocabularies
  • 17. Title here CC BY-SACC BY-SA Related to a general effort on quality We have set up a Data Quality Committee working on recommendations for the Europeana community on: ○ Mandatory metadata elements ○ Metadata normalization ○ Multilingualism … http://pro.europeana.eu/get-involved/europeana-tech/data-quality-committee
  • 18. Title here CC BY-SACC BY-SA A general effort on quality https://pro.europeana.eu/post/publishing-framework
  • 19. Title here CC BY-SACC BY-SA LOD Vocabularies currently recognized by Europeana in providers' metadata: Vocabulary URL MIMO Concepts http://www.mimo-db.eu/ MIMO Instrument makers http://www.mimo-db.eu/ The Getty - Art & Architecture Thesaurus (AAT) http://vocab.getty.edu/ The Getty - Union List of Artist Names (ULAN) http://vocab.getty.edu/ Virtual International Authority File (VIAF) http://viaf.org/viaf/ Geonames http://sws.geonames.org/ IconClass http://iconclass.org/ Gemeinsame Normdatei (GND) http://d-nb.info/gnd Israel Museum Jerusalem Concepts http://www.imj.org.il/imagine/thesaurus/objects/ Partage Plus concepts http://partage.vocnet.org/ data.europeana.eu WWI Concepts from Library of Congress Subject Headings (LCSH) http://data.europeana.eu/concept/loc Europeana Sounds Genres http://data.europeana.eu/concept/soundgenres/ EAGLE Material & Object Type http://www.eagle-network.eu/voc/ DISMARC Formats & Genres http://purl.org/dismarc/ns/ UDC http://udcdata.info/rdf/ UNESCO Thesaurus http://vocabularies.unesco.org/thesaurus/ NB: providers can send data from their local vocabularies too – if they embed it in the EDM metadata they send to us!
  • 20. Title here CC BY-SACC BY-SA A Linked Data Strategy for Europeana CC BY-SA • The Europeana Data Model (EDM) offers a base for linking metadata • Aims at providing data as resources (with URIs!), not only strings • Enables the development of a multilingual data environment • We encourage data providers to contribute their own links to vocabularies • In parallel, we apply automatic enrichment to link object metadata to reference datasets
  • 21. Title here CC BY-SACC BY-SA A Linked Data Strategy for Europeana – Contextual Entities CC BY-SA We are building an "Entity Collection" • Centralized point of reference and access to data about contextual entities: places, agents (persons and organizations), concepts... • Caching and curating data from the wider Linked Open Data cloud • A sort of Europeana "knowledge graph"
  • 22. Title here CC BY-SACC BY-SA Uses cases for the Entity Collection CC BY-SA Improve user experience on Europeana services ● Findability: users can search with and for people, places and subjects, not only objects. In many more languages, and with less ambiguity ● Contextualization: users see contextual information about cultural heritage objects. Entity Pages group and present all assertions about an entity … Entity Pages Semantic auto- completion
  • 23. Data currently in the Entity Collection CC BY-SA • Places a subset of Geonames, corresponding to places which are part of European countries and of some specific feature classes. • Agents a subset of DBpedia corresponding to most of the instances of dbp:Artist with some exceptions, and integrated from 49 DBpedia language editions. • Concepts a subset of DBpedia corresponding to a selection concepts matching the needs from Europeana Collections (e.g., WWI battles). Europeana Sounds music genres (obtained from Wikidata) Photo Consortium's photography vocabulary • Organizations Extracted from Europeana's CRM and aligned to Wikidata when possible 216,302 resources 1,572 resources 165,005 resources 1,077 resources
  • 24. Selecting data sources CC BY-SA • Availability and access: open license, published on the web as linked data • Granularity, size and coverage: multilingual data, helping to answer key user needs for Europeana's CH collections. Too generic or large datasets can create too much ambiguity for the simple processes we have (e.g. enrichment) • Quality: intrinsic aspects like correctness of representation (data structures) • Connectivity: good data sources are well-connected internally and externally to other datasets
  • 25. An example DBpedia resource for “Mozart” in our data CC BY-SA Coreference links to 6 other datasets (e.g. Freebase, Wikidata) Inter-linking information Preferred labels for 48 languages
  • 26. An enrichment example Links to contextual entities
  • 27. And what it allows
  • 28. And what it allows
  • 29. Title here CC BY-SACC BY-SA The Entity Collection’s contribution to multilingual coverage Entities effectively used to enrich Europeana Objects Entities present in the Entity Collection
  • 30. Title here CC BY-SACC BY-SA Multilingual enrichment is not easy! Poisonous India or the Importance of a Semantic and Multilingual Enrichment Strategy Marlies Olensky, Juliane Stiller, Evelyn Dröge, MTSR 2012 http://link.springer.com/chapter/10.1007%2F978-3-642-35233- 1_25
  • 31. Title here CC BY-SACC BY-SA We’re not really finished! France, Public Domain 1914, National Library of France Agence de presse Meurisse Concours de cycles nautiques sur le lac d’Enghien : Berregent piloté par Austerling
  • 32. Work in progress – the Entity Collection CC BY-SA • We've made enough progress to release a first version of the Entity Collection used in Europeana's production services. • But there are still challenges: • Expand data coverage with new data sources • For existing types of contextual entities • For new types, e.g., events • Better enrich Europeana object metadata
  • 34. Work in progress – Static content
  • 35. Automatic translation? CC BY-SA • First experiments with the European Commission’s eTranslation service, on curated (static) content • So far it needs to be better quality, or applied to other Europeana areas, like object display or search Result ranked 1: French Result ranked 2: Spanish Result ranked 3: Polish search results query Search only in corresponding language Approach 1 Query translation Approach 2 Metadata/conten t translation
  • 36. Title here CC BY-SACC BY-SA Title here CC BY-SA Name of image | Creator Providing organization| Country, licence Name of image | Creator Providing organization| Country, licence antoine.isaac@europeana.eu @antoine_isaac