3. Agenda
The European Library
Datasets life-cycle and workflows
Content ingestion questionnaire
Ingestions tools
Aggregation and delivery to Europeana
Europeana Data Model (EDM)
Full-text index
Questions
4. 48 National Libraries
~ 40 Research and University Libraries
~ 115 Million Bibliographic Records
> 16 Million Digital Objects
> 25 Million Pages of Full-text
The European Library
7. The European Library
Data access point for researchers
Combination of bibliographic records and
metadata for digital objects
Aggregator for Europeana Cloud
9. The European Library
Ingestion Workflow
Content ingestion questionnaire
Scheduling of ingestion
Datasets ready for harvesting
Create case in CRM: case # to provider
Harvesting metadata
Enhance metadata (VIAF, Geonames, MACS,...)
Indexing in acceptance portal
E-mail to provider to accept dataset
Live index = live portal
Delivery to Europeana
Enhancing and publishing in Europeana
10. Content Ingestion Questionnaire
Web-form
Personal Information
(about the person filling the web-form)
Name & surname
Job title
E-mail address
Skype address
Information about Organization
Organization name
Country
Website
Type of institution
11. Content Ingestion Questionnaire
Harvesting Details
Which protocol will be used to transfer data?
OAI-PMH
File
Z39,50
FTP
HTTP
Harvesting time and dates preferences
How often dataset(s) will need to be updated?
Weekly
Monthly
Quarterly
Annually
On demand
12. Content Ingestion Questionnaire
Information about dataset(s)
Number of dataset(s) to be ingested
Number of records to be expected
Number of digital objects to be expected
Contact person(s) per dataset(s)
Editorial: for collection description
Technical: for collection ingestion
13. Content Ingestion Questionnaire
Information about Metadata
Metadata standard(s) available to describe objects
Marc21
MarcXchange
Unimarc
ESE
EDM
METS
MODS
OAI_DC
TEI
Number of formats available per dataset
14. Content Ingestion Questionnaire
Information about Metadata
Are the metadata ready?
If yes, for which dataset(s)
If not, when will they be ready?
Type of digital objects per dataset(s)
TEXT
IMAGE
AUDIO
VIDEO
15. Content Ingestion Questionnaire
Information about Content
Will content be delivered in addition to
metadata?
If yes, for which dataset(s)?
If yes, in which format(s)?
Has the content been digitized?
If yes, for which dataset(s)?
If not, when will the content be available?
16. Content Ingestion Questionnaire
Information about Authority
Will authority files be delivered?
If yes, for which dataset(s)?
If yes, in which format(s)?
Are controlled vocabularies utilized?
If yes, which kind?
• Classification
• Thesauri
• Subject Headings
• Other
If yes, for which dataset(s)?
Will full-text be delivered?
If yes, for which dataset(s)?
If yes, in which format(s)?
20. In SugarCRM
Organizations, contacts, datasets, project and
more
SugarCRM is utilized for
Collections control
Ingestion plans
Automated reports
Cases per specific datasets
SugarCRM
Customer Relation Management
tool
23. Dataset in Acceptance Portal
Acceptance Portal
Test environment
Providers to validate data
Reports via UIM workflows
Link Validation
Field Validation
25. When Dataset in Acceptance Portal
Create an account on
http://www.theeuropeanlibrary.org/
Use credential to log-in in acceptance
http://www.tel.ulcc.ac.uk/acceptance/
Validate data using tabs for
Default
XML
27. Dataset(s) in Live Index and Portal
When a provider accepts dataset(s)
E-mail
Dataset(s) ready for live index
Dataset(s) ready for Europeana
Dataset(s) indexed into the live portal
It takes ~ 24 hrs for dataset(s) to be
searchable into the live portal
28. Dataset(s) Live in Europeana
When a provider accepts dataset(s)
Dataset(s) delivered to Europeana
Europeana publishes live once a month
Delivery deadline ~ 21 of each month
Dataset(s) searchable in Europeana by
following month
Dataset(s) published live in Europeana
E-mail to provider with link to dataset(s)
into Europeana portal
29. EDM – Europeana Data Model
Europeana Libraries project
EDM for library data
Europeana Cloud Project
EDM for museum and archive metadata &
content
Delivery in EDM to Europeana
VIAF: Virtual International Authority File GeoNames: geographical database MACS: to retrieve records with subjects in multiple languages LCSH: Library of Congress Subject Headings