SlideShare une entreprise Scribd logo
1  sur  22
Connecting European
archaeology datasets:
prospects and challenges
Kate Fernie, 2Culture Associates
Big Data in Archaeology: Practicalities and Possibilities
27-28 March 2019
• CARARE
• A brief history
• Datasets and their diversity
• Metadata and schemas
• Challenges
• Possibilities
Introduction
CARARE
Connecting Archaeology and Architecture in Europe
• Began as an EU-funded best practice network in 2010
• Established as a membership association in 2016
• Objective: Advancing professional practice and fostering appreciation
of the digital archaeological and architectural heritage
• Areas:
• Good practices, advice and guidance
• Services to enable data sharing
• CARARE metadata schema
• Promoting re-use
http://www.carare.eu/
Steps on the way to CARARE
• A shared vision
• International collaborations on
heritage data (CIDOC, Arena,
Acquarelle, DARIAH, INSPIRE,
Europeana, etc.)
• Digitisation and use of digital
technologies
• GIS
• Technical infrastructures
A brief history
Who is collecting archaeological and architectural heritage data?
• State agencies
• inventories of protected sites, monuments and buildings
• conservation records, field investigations, surveys
• Museums – finds and excavation archives
• Research Institutions & researchers
• Libraries
Datasets
Image: Swedish National Heritage Board
CARARE and related projects have aggregated over 6 million digital
objects from 20+ countries for Europeana.eu
Many different types of object
• Inventory records, reports, photographs, drawings, books, videos, objects,
aerial photos, GIS datasets, 3D datasets, models, reconstructions, and more
Many different ways of recording objects
• Heritage agencies, museums, archives, libraries, researchers all have
different ways of describing objects
Many different languages, vocabularies, time periods and map systems
Rather diverse
Tournoi royal de motos à Londres changement
d'une roue de side-car en marche, 1932
Agence de presse Mondial Photo-Presse.
We work with
the metadata
that’s provided
CARARE defined a metadata model for metadata aggregation
• Standards based: CIDOC core standards, MIDAS Heritage, LIDO and EDM
• Distinguishes between “heritage assets” (monument, building, painting, book,
image, film, 3D) and digital representations found online
• Allows for events (field activities, lab work) and collections
• Supports objects that are composed of other objects (complexes and
hierarchies)
• Is rich where the domain calls for it (e.g. time, space, monument character)
The schema meets a need to mediate between native data (exports) and enable
their transformation into a common format
Combining datasets
Let’s see an example
MINT
• Metadata mapping (from
native to target schema)
• Preview
• Statistics
• Transformation (to target
schema(s))
Rijksdienst voor het Cultureel Erfgoed:
Rijsmonmumenten
Making connections
Heritage asset
Has
representation
Images: Instituto Universitario de Investigación en Arqueología Ibérica
“Hornos de Peal, Jaén”
Has
representation
is related
Relationships between the main CARARE classes:
• Heritage asset, digital resources and events
Has Met
Enriching metadata during mapping
Heritage asset
Images: Instituto Universitario de Investigación en Arqueología Ibérica
“Hornos de Peal, Jaén”
<car:heritageAssetType>http://vocab.getty.edu/aat/300054328</car:heritageAssetType>
<car:heritageAssetType>http://vocab.getty.edu/aat/300000810</car:heritageAssetType>
<car:heritageAssetType>http://vocab.getty.edu/aat/300305500</car:heritageAssetType>
Adding constants: LOD
AAT concepts
<car:heritageAssetType lang="es">Necrópolis</car:heritageAssetType>
Languages identification
Mapping the metadata gives an opportunity to
make some simple enrichments, by adding:
• Language of the metadata
• Name of the provider
• Country of provider
There’s a difference between doing a schema mapping and a mapping to
transform real data.
Data issues can include:
• Data that doesn’t conform entirely to the scope of an element
• Multiple values within a single element (separators)
• Data inserted in mandatory elements (n/a)
• Lack of unique values
A good mapping can address some of these issues, e.g. by splitting
multiple subject concepts into separate elements.
(issues can be fixed at source, but this can be time consuming with datasets that
include hundreds of thousands of records).
Quality issues
Transformation: some semantic gains
Through transformation to a
common schema, we achieve
interoperability between
disparate datasets
 Enabling cross searches
(what, when, where, who)
 Open licencing of the
metadata and APIs enables
reuse in various applications
http://eculturemap.eculturelab.eu/eCulture14m/Map.html?
• Metadata mapping is rarely easy
• Metadata models are complex with subtle difference in world view
• Statistical metrics can show that recording practices diverge and other
quality issues
• Native metadata is designed to serve specific purposes
• Local context, audiences and questions
• Merging metadata from various organisations in different
countries/languages poses special challenges
Some challenges
Aggregators like CARARE enable transformation of metadata into a
common model and have some services to enable further work
• Language labelling
• Adding Linked Open Data
• Automatic enrichment
• Crowdsourcing
Aggregating and enriching
MORe
One of the big challenges in searching across datasets in Europe is
dealing with data in different languages
Linguistic resources and translation tools are increasingly available, but to
work they need first to identify which language is involved
 Language labels are often missing
 Language identification and labelling microservices
Interfaces, displays and search services can adapt to users’ preferred
language and in this way return results which are relevant but which have
been catalogued in unfamiliar languages.
Why add language information to data?
CARARE microservices include:
• Natural language processing techniques to enable subject concepts
and names to be extracted from text
• Geocoding services to add coordinates for named places
• Vocabulary matching services
• Geo conversion, inversion and normalization services
Automated enrichment
Location case study
• Location is important for archaeology but place information is often
missing, especially for content from library, archive and museum
collections
• Automated extraction techniques can identify place names in data, but
place names are not unique
• The process requires quality control
• Crowd sourcing is one way of harnessing the knowledge of individuals
to check the results of automated enrichment and place objects
correctly on the map
• One such service was developed by the LoCloud project
Crowd sourcing
Map tools
The content aggregated by CARARE is in Europeana
Take a look: www.europeana.eu
Is it big data?
• Volume – 2-4 million assets aggregated by CARARE
• Includes the national heritage inventories for several
countries, which are individually quite large datasets
• Europeana includes another 1 million+ assets relevant for
archaeology aggregated by other projects
• Includes museum and library collections, film archives,
newspaper reports
• Quite big?
• New research would be great!
kfernie27@gmail.com
Any questions?
www.carare.eu

Contenu connexe

Tendances

Introduction to CARARE
Introduction to CARAREIntroduction to CARARE
Introduction to CARARECARARE
 
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...CARARE
 
3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding3D reconstructions for story telling and understanding
3D reconstructions for story telling and understandingCARARE
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE
 
Geographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsGeographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsCARARE
 
Metadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONSMetadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONS3D ICONS Project
 
Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...3D ICONS Project
 
'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...CARARE
 
The Mint Mapping tool
The Mint Mapping toolThe Mint Mapping tool
The Mint Mapping toollocloud
 
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsImproving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsFARO
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana
 
Potential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena BassetPotential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena Basset3D ICONS Project
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programmelocloud
 
The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...3D ICONS Project
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanitiesariadnenetwork
 
Local content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providersLocal content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providerslocloud
 
LoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the CloudLoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the Cloudlocloud
 
Metadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena BassettMetadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena Bassett3D ICONS Project
 

Tendances (20)

Introduction to CARARE
Introduction to CARAREIntroduction to CARARE
Introduction to CARARE
 
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
3D in the CARARE Project. Providing Europeana with 3D Content for the Archaeo...
 
3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in Europeana
 
Geographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsGeographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena Projects
 
Metadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONSMetadata, the CARARE aggregation service and 3D ICONS
Metadata, the CARARE aggregation service and 3D ICONS
 
Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...Sorin Hermon, 'Towards an integrated repository for research and management o...
Sorin Hermon, 'Towards an integrated repository for research and management o...
 
Ariadne Services
Ariadne ServicesAriadne Services
Ariadne Services
 
'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...'Towards an integrated repository for research and management of 3D archaeolo...
'Towards an integrated repository for research and management of 3D archaeolo...
 
The Mint Mapping tool
The Mint Mapping toolThe Mint Mapping tool
The Mint Mapping tool
 
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsImproving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
 
Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
 
Potential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena BassetPotential usage of 3D data and IPR issues, presented by Sheena Basset
Potential usage of 3D data and IPR issues, presented by Sheena Basset
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programme
 
The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...The last mile of 3DIcons: making available 3D contents and their metadata thr...
The last mile of 3DIcons: making available 3D contents and their metadata thr...
 
DYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the HumanitiesDYAS: The Greek Research Infrastructure Network for the Humanities
DYAS: The Greek Research Infrastructure Network for the Humanities
 
Local content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providersLocal content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providers
 
LoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the CloudLoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the Cloud
 
Metadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena BassettMetadata for 3D models, Sheena Bassett
Metadata for 3D models, Sheena Bassett
 
Introduction to 3D ICONS
Introduction to 3D ICONSIntroduction to 3D ICONS
Introduction to 3D ICONS
 

Similaire à Connecting European Archaeology datasets: prospects and challenges

Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model Valentine Charles
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudEuropeana
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...ariadnenetwork
 
ARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperabilityARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperabilityariadnenetwork
 
LoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritageLoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritagelocloud
 
Digital Archiving at the Meertens Institute
Digital Archiving at the Meertens InstituteDigital Archiving at the Meertens Institute
Digital Archiving at the Meertens Institutejuntez
 
LoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana CloudLoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana Cloudlocloud
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
 
Data quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)dataData quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)dataValentine Charles
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODAntoine Isaac
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Introduction to LoCloud
Introduction to LoCloud Introduction to LoCloud
Introduction to LoCloud locloud
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...LinDa_FP7
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataNuno Freire
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsJenn Riley
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataNuno Freire
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DTdstudhope
 
Workshop: Concluding Remarks
Workshop: Concluding RemarksWorkshop: Concluding Remarks
Workshop: Concluding Remarkslocloud
 

Similaire à Connecting European Archaeology datasets: prospects and challenges (20)

Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloud
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
 
ARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperabilityARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperability
 
LoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritageLoCloud: cloud-based services for local cultural heritage
LoCloud: cloud-based services for local cultural heritage
 
Digital Archiving at the Meertens Institute
Digital Archiving at the Meertens InstituteDigital Archiving at the Meertens Institute
Digital Archiving at the Meertens Institute
 
LoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana CloudLoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana Cloud
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Data quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)dataData quality in cultural heritage (meta)data
Data quality in cultural heritage (meta)data
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LOD
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Introduction to LoCloud
Introduction to LoCloud Introduction to LoCloud
Introduction to LoCloud
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DT
 
Corrado -- Establishing the Landscape
Corrado -- Establishing the LandscapeCorrado -- Establishing the Landscape
Corrado -- Establishing the Landscape
 
DLCS
DLCSDLCS
DLCS
 
Workshop: Concluding Remarks
Workshop: Concluding RemarksWorkshop: Concluding Remarks
Workshop: Concluding Remarks
 

Plus de CARARE

Europeana 3D
Europeana 3D Europeana 3D
Europeana 3D CARARE
 
Speaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise informationSpeaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise informationCARARE
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceCARARE
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing accessCARARE
 
Towards data FAIRness
Towards data FAIRnessTowards data FAIRness
Towards data FAIRnessCARARE
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkCARARE
 
Archaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishingArchaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishingCARARE
 
Carare Membership
Carare MembershipCarare Membership
Carare MembershipCARARE
 
How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...CARARE
 
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...CARARE
 
The everyday reality behind the iron curtain
The everyday reality behind the iron curtainThe everyday reality behind the iron curtain
The everyday reality behind the iron curtainCARARE
 
Inspiration from the past
Inspiration from the pastInspiration from the past
Inspiration from the pastCARARE
 
Archaeology in the europeana publishing framework
Archaeology in the europeana publishing frameworkArchaeology in the europeana publishing framework
Archaeology in the europeana publishing frameworkCARARE
 
Sharing New perspectives: overview presentation
Sharing New perspectives: overview presentationSharing New perspectives: overview presentation
Sharing New perspectives: overview presentationCARARE
 
Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...CARARE
 
An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...CARARE
 
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...CARARE
 
Europeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van SchaverbekeEuropeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van SchaverbekeCARARE
 
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...CARARE
 
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus SmithA presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus SmithCARARE
 

Plus de CARARE (20)

Europeana 3D
Europeana 3D Europeana 3D
Europeana 3D
 
Speaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise informationSpeaking one language: how vocabularies can help organise information
Speaking one language: how vocabularies can help organise information
 
Exploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practiceExploiting vocabularies and Linked Data: in practice
Exploiting vocabularies and Linked Data: in practice
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access
 
Towards data FAIRness
Towards data FAIRnessTowards data FAIRness
Towards data FAIRness
 
Archaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing frameworkArchaeology in Europeana’s publishing framework
Archaeology in Europeana’s publishing framework
 
Archaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishingArchaeology in Europeana quality assurance, enrichment and publishing
Archaeology in Europeana quality assurance, enrichment and publishing
 
Carare Membership
Carare MembershipCarare Membership
Carare Membership
 
How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...How and why people today engage with the archaeological heritage and scholarl...
How and why people today engage with the archaeological heritage and scholarl...
 
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
An introduction to the PARTHENOS guidelines to FAIRify data management and ma...
 
The everyday reality behind the iron curtain
The everyday reality behind the iron curtainThe everyday reality behind the iron curtain
The everyday reality behind the iron curtain
 
Inspiration from the past
Inspiration from the pastInspiration from the past
Inspiration from the past
 
Archaeology in the europeana publishing framework
Archaeology in the europeana publishing frameworkArchaeology in the europeana publishing framework
Archaeology in the europeana publishing framework
 
Sharing New perspectives: overview presentation
Sharing New perspectives: overview presentationSharing New perspectives: overview presentation
Sharing New perspectives: overview presentation
 
Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...Linking Europe to the Nile: connecting sites, monuments, museums and historic...
Linking Europe to the Nile: connecting sites, monuments, museums and historic...
 
An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...An archaeological approach to epigraphy: new data on the electoral programata...
An archaeological approach to epigraphy: new data on the electoral programata...
 
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
Updating the Iberians in Europeana, Alberto Sánchez, José A. Tuñón, Carmen Ru...
 
Europeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van SchaverbekeEuropeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
Europeana Collections: Archaeology in Europeana, Nienke van Schaverbeke
 
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
HBIM Leinster House, Laser Scan Survey Modelling and Conservation documentati...
 
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus SmithA presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
A presentation of SOCH: Swedish Open Cultural Heritage, Marcus Smith
 

Dernier

Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsMonica Sydney
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsMonica Sydney
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...meghakumariji156
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsMonica Sydney
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查ydyuyu
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样ayvbos
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查ydyuyu
 
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...kumargunjan9515
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理F
 
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsMira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsPriya Reddy
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查ydyuyu
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Balliameghakumariji156
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasDigicorns Technologies
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdfMatthew Sinclair
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.krishnachandrapal52
 

Dernier (20)

Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girls
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsMira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency Dallas
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 

Connecting European Archaeology datasets: prospects and challenges

  • 1. Connecting European archaeology datasets: prospects and challenges Kate Fernie, 2Culture Associates Big Data in Archaeology: Practicalities and Possibilities 27-28 March 2019
  • 2. • CARARE • A brief history • Datasets and their diversity • Metadata and schemas • Challenges • Possibilities Introduction
  • 3. CARARE Connecting Archaeology and Architecture in Europe • Began as an EU-funded best practice network in 2010 • Established as a membership association in 2016 • Objective: Advancing professional practice and fostering appreciation of the digital archaeological and architectural heritage • Areas: • Good practices, advice and guidance • Services to enable data sharing • CARARE metadata schema • Promoting re-use http://www.carare.eu/
  • 4. Steps on the way to CARARE • A shared vision • International collaborations on heritage data (CIDOC, Arena, Acquarelle, DARIAH, INSPIRE, Europeana, etc.) • Digitisation and use of digital technologies • GIS • Technical infrastructures A brief history
  • 5. Who is collecting archaeological and architectural heritage data? • State agencies • inventories of protected sites, monuments and buildings • conservation records, field investigations, surveys • Museums – finds and excavation archives • Research Institutions & researchers • Libraries Datasets Image: Swedish National Heritage Board
  • 6. CARARE and related projects have aggregated over 6 million digital objects from 20+ countries for Europeana.eu Many different types of object • Inventory records, reports, photographs, drawings, books, videos, objects, aerial photos, GIS datasets, 3D datasets, models, reconstructions, and more Many different ways of recording objects • Heritage agencies, museums, archives, libraries, researchers all have different ways of describing objects Many different languages, vocabularies, time periods and map systems Rather diverse
  • 7. Tournoi royal de motos à Londres changement d'une roue de side-car en marche, 1932 Agence de presse Mondial Photo-Presse. We work with the metadata that’s provided
  • 8. CARARE defined a metadata model for metadata aggregation • Standards based: CIDOC core standards, MIDAS Heritage, LIDO and EDM • Distinguishes between “heritage assets” (monument, building, painting, book, image, film, 3D) and digital representations found online • Allows for events (field activities, lab work) and collections • Supports objects that are composed of other objects (complexes and hierarchies) • Is rich where the domain calls for it (e.g. time, space, monument character) The schema meets a need to mediate between native data (exports) and enable their transformation into a common format Combining datasets
  • 9. Let’s see an example MINT • Metadata mapping (from native to target schema) • Preview • Statistics • Transformation (to target schema(s)) Rijksdienst voor het Cultureel Erfgoed: Rijsmonmumenten
  • 10. Making connections Heritage asset Has representation Images: Instituto Universitario de Investigación en Arqueología Ibérica “Hornos de Peal, Jaén” Has representation is related Relationships between the main CARARE classes: • Heritage asset, digital resources and events Has Met
  • 11. Enriching metadata during mapping Heritage asset Images: Instituto Universitario de Investigación en Arqueología Ibérica “Hornos de Peal, Jaén” <car:heritageAssetType>http://vocab.getty.edu/aat/300054328</car:heritageAssetType> <car:heritageAssetType>http://vocab.getty.edu/aat/300000810</car:heritageAssetType> <car:heritageAssetType>http://vocab.getty.edu/aat/300305500</car:heritageAssetType> Adding constants: LOD AAT concepts <car:heritageAssetType lang="es">Necrópolis</car:heritageAssetType> Languages identification Mapping the metadata gives an opportunity to make some simple enrichments, by adding: • Language of the metadata • Name of the provider • Country of provider
  • 12. There’s a difference between doing a schema mapping and a mapping to transform real data. Data issues can include: • Data that doesn’t conform entirely to the scope of an element • Multiple values within a single element (separators) • Data inserted in mandatory elements (n/a) • Lack of unique values A good mapping can address some of these issues, e.g. by splitting multiple subject concepts into separate elements. (issues can be fixed at source, but this can be time consuming with datasets that include hundreds of thousands of records). Quality issues
  • 13. Transformation: some semantic gains Through transformation to a common schema, we achieve interoperability between disparate datasets  Enabling cross searches (what, when, where, who)  Open licencing of the metadata and APIs enables reuse in various applications http://eculturemap.eculturelab.eu/eCulture14m/Map.html?
  • 14. • Metadata mapping is rarely easy • Metadata models are complex with subtle difference in world view • Statistical metrics can show that recording practices diverge and other quality issues • Native metadata is designed to serve specific purposes • Local context, audiences and questions • Merging metadata from various organisations in different countries/languages poses special challenges Some challenges
  • 15. Aggregators like CARARE enable transformation of metadata into a common model and have some services to enable further work • Language labelling • Adding Linked Open Data • Automatic enrichment • Crowdsourcing Aggregating and enriching MORe
  • 16. One of the big challenges in searching across datasets in Europe is dealing with data in different languages Linguistic resources and translation tools are increasingly available, but to work they need first to identify which language is involved  Language labels are often missing  Language identification and labelling microservices Interfaces, displays and search services can adapt to users’ preferred language and in this way return results which are relevant but which have been catalogued in unfamiliar languages. Why add language information to data?
  • 17. CARARE microservices include: • Natural language processing techniques to enable subject concepts and names to be extracted from text • Geocoding services to add coordinates for named places • Vocabulary matching services • Geo conversion, inversion and normalization services Automated enrichment
  • 18. Location case study • Location is important for archaeology but place information is often missing, especially for content from library, archive and museum collections • Automated extraction techniques can identify place names in data, but place names are not unique • The process requires quality control • Crowd sourcing is one way of harnessing the knowledge of individuals to check the results of automated enrichment and place objects correctly on the map • One such service was developed by the LoCloud project Crowd sourcing
  • 20. The content aggregated by CARARE is in Europeana Take a look: www.europeana.eu
  • 21. Is it big data? • Volume – 2-4 million assets aggregated by CARARE • Includes the national heritage inventories for several countries, which are individually quite large datasets • Europeana includes another 1 million+ assets relevant for archaeology aggregated by other projects • Includes museum and library collections, film archives, newspaper reports • Quite big? • New research would be great!