SlideShare une entreprise Scribd logo
1  sur  13
Antoine Isaac
Information and networking days
H2020 / Connecting Europe Facility, Jan 15-16, 2014
Europe’s platform to access cultural heritage
Currently
30M objects
Built on descriptive metadata
from a broad, heterogeneous network
Audiovisual
collections
National Aggregators
Regional Aggregators
Archives
Thematic collections
Libraries
Musées
Lausannois
Culture.frThe
European
Library
APEX
European Film
Gateway Europeana Fashion
2,300 galleries, museums, archives and libraries
Accessing items from 36 countries
top 16
Portal interface in 31 languages
Metadata in 33 languages
Serving Europe’s citizens
5M visits on Europeana.eu
7M Facebook impressions
API use…
Content (digital objects on the site of the provider)
Metadata (descriptive object information)
Public Domain
Creative Commons
Licenses
Rights reserved
Orphan work
Facilitating re-use on the legal side
CC
Facilitating re-use on the language side?
Our network needs automatic translation tools
to address information needs all over Europe
Gathering/linking existing multilingual data
Related projects applying NLP tools
E.g., The PATHS project has developed techniques to enrich English
and Spanish collections
1)Identification of key entities
2)Detection of (typed) similarities between objects, using metadata
3)“Background links” to external resources such as Wikipedia
4)Classification of object against a hierarchy of topic
Applying these techniques to other languages would require work
1)requires language-specific tools (PoS tagging, lemmatization)
2)is straightforward to apply to new languages
3)requires language-specific tools
4)depends on (3) and on translation of some topics
http://www.paths-project.eu/eng/Resources/Semantic-Enrichment-of-Cultural-Heritage-content-in-PATHS
Language challenges for Digital Libraries
 Typical queries are very short
Average < 2 terms
 Identification of query language is not easy, even manually
39% of queries may belong to several languages
 Plenty of named entities
60% of queries are for persons & places
Not only is it hard for queries: the same issues apply
to the descriptive metadata
Studies by Humboldt University on Europeana and The European Library
http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
Language processing issues at the scale of
Europe
Thank you!
Antoine Isaac
antoine.isaac@europeana.eu
@EuropeanaEU
Europeana’s vision and mission
 We believe in making cultural heritage openly
accessible in a digital way, to promote the
exchange of ideas and information
 We want to be a catalyst for change in the world of
cultural heritage

Contenu connexe

Tendances

Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Antoine Isaac
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for EuropeanaAntoine Isaac
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Antoine Isaac
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?Antoine Isaac
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingAntoine Isaac
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011Antoine Isaac
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachValentine Charles
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkCARARE
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataAntoine Isaac
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Antoine Isaac
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Antoine Isaac
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
 
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerDov Winer
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceCARARE
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingAntoine Isaac
 

Tendances (20)

Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Linking data for Europeana
Linking data for EuropeanaLinking data for Europeana
Linking data for Europeana
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Europeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap MeetingEuropeana @ NISO Bibliographic Roadmap Meeting
Europeana @ NISO Bibliographic Roadmap Meeting
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Linked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approachLinked Data for EuropeanaCultural Heritage: the Europeana approach
Linked Data for EuropeanaCultural Heritage: the Europeana approach
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing framework
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018Semantic Interoperability at Europeana - MultilingualDSIs2018
Semantic Interoperability at Europeana - MultilingualDSIs2018
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...Olaf Janssen on benefits of collaboration between Europeana and archives duri...
Olaf Janssen on benefits of collaboration between Europeana and archives duri...
 
Europeana bergen may2010_dovwiner
Europeana bergen may2010_dovwinerEuropeana bergen may2010_dovwiner
Europeana bergen may2010_dovwiner
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
 

En vedette

Implementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationImplementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationRaphael Troncy
 
Networked books and networked reading
Networked books and networked readingNetworked books and networked reading
Networked books and networked readingCamille Hartsell
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Raphael Troncy
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating eventsRaphael Troncy
 
Europeana and RDF data validation
Europeana and RDF data validationEuropeana and RDF data validation
Europeana and RDF data validationAntoine Isaac
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)bisg
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide3 Round Stones
 

En vedette (20)

Implementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationImplementing the Media Fragments URI Specification
Implementing the Media Fragments URI Specification
 
Networked books and networked reading
Networked books and networked readingNetworked books and networked reading
Networked books and networked reading
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
 
NISO Annual Report of 2012 Activities
NISO Annual Report of 2012 ActivitiesNISO Annual Report of 2012 Activities
NISO Annual Report of 2012 Activities
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating events
 
Europeana and RDF data validation
Europeana and RDF data validationEuropeana and RDF data validation
Europeana and RDF data validation
 
NISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership MeetingNISO's Standards Update & Annual Membership Meeting
NISO's Standards Update & Annual Membership Meeting
 
ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)ALA2009_Andy Weissberg (Bowker)
ALA2009_Andy Weissberg (Bowker)
 
Progress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data WorldwideProgress Report on Government Linked Data Worldwide
Progress Report on Government Linked Data Worldwide
 
Uncork Your Licenses!
Uncork Your Licenses!Uncork Your Licenses!
Uncork Your Licenses!
 
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
Expanding the Possible: What’s New and Upcoming in Standards and Technologies...
 
Carpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference PresentationCarpenter Library Assessment Conference Presentation
Carpenter Library Assessment Conference Presentation
 
Uncork Your Licenses!
Uncork Your Licenses! Uncork Your Licenses!
Uncork Your Licenses!
 
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
Lagace Presentation on the NISO Open Access Metadata and Indicators Project a...
 
Uncork Your Licenses
Uncork Your LicensesUncork Your Licenses
Uncork Your Licenses
 
Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015Intro D2D Paper ER&L Feb 2015
Intro D2D Paper ER&L Feb 2015
 
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
Carpenter Update on NISO Altmetrics Initiative at CNI Fall meeting in Washing...
 
ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015ER&L SUSHI ALI Feb 2015
ER&L SUSHI ALI Feb 2015
 
The Infrastructure for Alternative Metrics
The Infrastructure for Alternative MetricsThe Infrastructure for Alternative Metrics
The Infrastructure for Alternative Metrics
 
NISO's Altmetrics Initiative
NISO's Altmetrics InitiativeNISO's Altmetrics Initiative
NISO's Altmetrics Initiative
 

Similaire à Multilingual challenges in Europeana

Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info
 
Rob Davies : How we got here
Rob Davies : How we got hereRob Davies : How we got here
Rob Davies : How we got hereAccessITplus
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...Valentine Charles
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseAntoine Isaac
 
Natural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNatural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNikos Palavitsinis, PhD
 
The European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookThe European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookOlaf Janssen
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageMax Kaiser
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsLIBER Europe
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana
 
Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Tolga Çakmak
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012Andrea D'Andrea
 
CARARE workshop: Europeana research
CARARE workshop: Europeana researchCARARE workshop: Europeana research
CARARE workshop: Europeana researchEuropeana
 
A deer for Europe
A deer for EuropeA deer for Europe
A deer for Europeashtreex
 
Europeana essentials June 2013
Europeana essentials June 2013Europeana essentials June 2013
Europeana essentials June 2013Europeana
 
Europeana Essentials - Latest
Europeana Essentials - LatestEuropeana Essentials - Latest
Europeana Essentials - LatestEuropeana
 
Why join Europeana?
Why join Europeana?Why join Europeana?
Why join Europeana?Europeana
 

Similaire à Multilingual challenges in Europeana (20)

Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
 
03 isaac dm2-e14-full
03 isaac dm2-e14-full03 isaac dm2-e14-full
03 isaac dm2-e14-full
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der Werf
 
Europeana in a Research Context
Europeana in a Research ContextEuropeana in a Research Context
Europeana in a Research Context
 
Rob Davies : How we got here
Rob Davies : How we got hereRob Davies : How we got here
Rob Davies : How we got here
 
Europeana and Researchers
Europeana and ResearchersEuropeana and Researchers
Europeana and Researchers
 
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...When Semantics support Multilingual Access to Digital Cultural Heritage - the...
When Semantics support Multilingual Access to Digital Cultural Heritage - the...
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Natural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshopNatural Europe presentation in Minerva Conference workshop
Natural Europe presentation in Minerva Conference workshop
 
The European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and OutlookThe European (Digital) Library - Overview and Outlook
The European (Digital) Library - Overview and Outlook
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential Facts
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect Communities
 
Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...Digital cultural heritage works and object description within the scope of Eu...
Digital cultural heritage works and object description within the scope of Eu...
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012
 
CARARE workshop: Europeana research
CARARE workshop: Europeana researchCARARE workshop: Europeana research
CARARE workshop: Europeana research
 
A deer for Europe
A deer for EuropeA deer for Europe
A deer for Europe
 
Europeana essentials June 2013
Europeana essentials June 2013Europeana essentials June 2013
Europeana essentials June 2013
 
Europeana Essentials - Latest
Europeana Essentials - LatestEuropeana Essentials - Latest
Europeana Essentials - Latest
 
Why join Europeana?
Why join Europeana?Why join Europeana?
Why join Europeana?
 

Plus de Antoine Isaac

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Antoine Isaac
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaAntoine Isaac
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovationAntoine Isaac
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansAntoine Isaac
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana missionAntoine Isaac
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaAntoine Isaac
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Antoine Isaac
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018Antoine Isaac
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaAntoine Isaac
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesAntoine Isaac
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSAntoine Isaac
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Antoine Isaac
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and EuropeanaAntoine Isaac
 

Plus de Antoine Isaac (17)

Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Le Cadre de publication d'Europeana
Le Cadre de publication d'EuropeanaLe Cadre de publication d'Europeana
Le Cadre de publication d'Europeana
 
The Europeana Data Model Principles, community and innovation
The Europeana Data Model  Principles, community and innovationThe Europeana Data Model  Principles, community and innovation
The Europeana Data Model Principles, community and innovation
 
Metadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plansMetadata aggregation of IIIF Resources at Europeana: status and plans
Metadata aggregation of IIIF Resources at Europeana: status and plans
 
IIIF and the Europeana mission
IIIF and the Europeana missionIIIF and the Europeana mission
IIIF and the Europeana mission
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...Lightweight rights modeling and linked data publication for online cultural h...
Lightweight rights modeling and linked data publication for online cultural h...
 
The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018The Europeana Data Model - TPDL2018
The Europeana Data Model - TPDL2018
 
Europeana et IIIF
Europeana et IIIFEuropeana et IIIF
Europeana et IIIF
 
Data scale and diversity issues at Europeana
Data scale and diversity issues at EuropeanaData scale and diversity issues at Europeana
Data scale and diversity issues at Europeana
 
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data VocabulariesIsaac - W3C Data on the Web Best Practices - Data Vocabularies
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
 
Europeana APIs
Europeana APIsEuropeana APIs
Europeana APIs
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WSModelling annotations for Europeana and related projects - DARIAH-EU WS
Modelling annotations for Europeana and related projects - DARIAH-EU WS
 
Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...Classification schemes, thesauri and other Knowledge Organization Systems - a...
Classification schemes, thesauri and other Knowledge Organization Systems - a...
 
Enrichment and Europeana
Enrichment and EuropeanaEnrichment and Europeana
Enrichment and Europeana
 

Dernier

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Dernier (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Multilingual challenges in Europeana

  • 1. Antoine Isaac Information and networking days H2020 / Connecting Europe Facility, Jan 15-16, 2014
  • 2. Europe’s platform to access cultural heritage Currently 30M objects
  • 3. Built on descriptive metadata from a broad, heterogeneous network Audiovisual collections National Aggregators Regional Aggregators Archives Thematic collections Libraries Musées Lausannois Culture.frThe European Library APEX European Film Gateway Europeana Fashion 2,300 galleries, museums, archives and libraries
  • 4. Accessing items from 36 countries top 16 Portal interface in 31 languages Metadata in 33 languages
  • 5. Serving Europe’s citizens 5M visits on Europeana.eu 7M Facebook impressions API use…
  • 6. Content (digital objects on the site of the provider) Metadata (descriptive object information) Public Domain Creative Commons Licenses Rights reserved Orphan work Facilitating re-use on the legal side CC
  • 7. Facilitating re-use on the language side? Our network needs automatic translation tools to address information needs all over Europe
  • 9. Related projects applying NLP tools E.g., The PATHS project has developed techniques to enrich English and Spanish collections 1)Identification of key entities 2)Detection of (typed) similarities between objects, using metadata 3)“Background links” to external resources such as Wikipedia 4)Classification of object against a hierarchy of topic Applying these techniques to other languages would require work 1)requires language-specific tools (PoS tagging, lemmatization) 2)is straightforward to apply to new languages 3)requires language-specific tools 4)depends on (3) and on translation of some topics http://www.paths-project.eu/eng/Resources/Semantic-Enrichment-of-Cultural-Heritage-content-in-PATHS
  • 10. Language challenges for Digital Libraries  Typical queries are very short Average < 2 terms  Identification of query language is not easy, even manually 39% of queries may belong to several languages  Plenty of named entities 60% of queries are for persons & places Not only is it hard for queries: the same issues apply to the descriptive metadata Studies by Humboldt University on Europeana and The European Library http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
  • 11. Language processing issues at the scale of Europe
  • 13. Europeana’s vision and mission  We believe in making cultural heritage openly accessible in a digital way, to promote the exchange of ideas and information  We want to be a catalyst for change in the world of cultural heritage

Notes de l'éditeur

  1. Les Miserables: Victor Hugo’s handwritten manuscripts: http://www.europeana.eu/portal/record/9200103/5372912AF66AB529E188218BC1F747E75EB1A18F.html BnF, public domain Matisse ‘53 in the form of a double helix’ http://www.europeana.eu/portal/record/9200104/F8D60AB9136C8A59B59DF1CFEC278A6CABA8B0C6.htmlThe Wellcome Library (CC-BY-NC-ND) ‘söprűtánc’ – Hungarian traditional dance http://www.europeana.eu/portal/record/08901/E1A7B01BE4AED87FD239672F4F3941F52262D6B2.html Hungarian Academy of Sciences Institute for Musicology, public domain ‘Neurologico reggae’ Music album http://www.europeana.eu/portal/record/08901/ADC241BCBF8470988DBA6EEAFCF13F14D88E5534.html DISMARC – EuropeanaConnect Paid Access ‘Castle of Kavala’ 3D exploration of a Greek castle http://www.europeana.eu/portal/record/2020703/05607B24D15BD516EE2B765F74CDA39C7427F7FB.html Cultural and Educational Technology Institute - Research Centre Athen CARARE CC-BY-NC-ND
  2. All partners send us descriptions of their assets, which we aggregate in a single service
  3. Germany 15.44% France 10.97% Netherlands 9.67% Sweden 9.44% Spain 9.98% UK 6.98% Norway 6.60% Italy 5.4% Ireland 4.04% Poland 4.02% Europe 3.95% Finland 2.95% Austria 2.05% Belgium 1.61% Hungary 1.26%
  4. http://www.clef-initiative.eu/documents/71612/86374/CLEF2010wn-LogCLEF-StillerEt2010.pdf
  5. Users from everywhere Data from everywhere Tools from everywhere http://europeana.eu/portal/record/2022347/B7C7D15C23C28EFD3FA25147ED3A580757CFBB04.html http://europeana.eu/portal/record/9200103/ark__12148_btv1b6921004c.html