SlideShare a Scribd company logo
1 of 55
Download to read offline
Representation and Absence in Digital
Resources: The Case of Europeana
Newspapers
Alastair Dunning, The European Library, @alastairdunning
Clemens Neudecker, National Library of Netherlands,
@cneudecker
DH2014, Lausanne
Source: Europeana Strategic Plan, 2015-2020, currently unpublished. See also Enumerate Project, enumerate.eu
The estimated total cost of digitising
the collections of Europe’s
museums, archives and libraries,
including the audiovisual material
they hold is approximately €100bn,
or €10bn per annum for the next 10
years, factoring in
a cumulative efficiency gain of 0.5%
per annum.
The Research & Development
Budget for the Joint Strike Fighter
programme is estimated at
€40.34bn.
It would cost between 10% and 40%
of the Joint Strike Fighter R&D
budget to digitise every eligible title
in Europe’s librariesSource: Nick Poole, Collections Trust,
http://nickpoole.org.uk/wp-
content/uploads/2011/12/digiti_repor
t.pdf
Currently:
2
million
pages of full text
By 2015:
10
million
pages of
full text
Searching by keyword, and
organise by language,
date, source library, title
Link: http://www.theeuropeanlibrary.org/tel4/newspapers
Currently:
Metadata records
relating to
1.12m
issues
By 2015:
Metadata records
relating to up to
4m issues -
Browse by date or map
Link: http://www.theeuropeanlibrary.org/tel4/newspapers
Full Text from following libraries
•Bibliotheque nationale de France / National Library
France
•Koninklijke Bibliotheek / National Library of the
Netherlands
•Landesbibliothek Dr. Friedrich Teßmann / Teßmann
Library
•Eesti Rahvusraamatukogu / Estonian National
Library
• Kansalliskirjasto / National Library of Finland
• Latvijas Nacionala Biblioteka / National Library of
Latvia
•Biblioteka Narodowa / National Library of Poland
•Milli Kutuphane Baskanligi / National Library of
Turkey
• Österreichische Nationalbibliothek / Austrian
National Library
•Staatsbibliothek zu Berlin / Berlin State Library
•Staats- und Universitätsbibliothek Hamburg / State
and University Library
• Univerzitet u Beogradu / University Library of
Belgrade
Searching by title
Issue Level Records from following libraries
•National Library of Wales
•St. Cyril and Methodius National Library / The
National Library of Bulgaria
•National Library of Czech Republic
•National and University Library in Zagreb
•Koninklijke Bibliotheek van België / Bibliothèque
royale de Belgique
•Narodna in univerzitetna knjinica / National and
University Library of Slovenia
•National Library of Portugal
•National Library of Romania
•Landsbókasafn Íslands - Háskólabókasafn / National
and Univeristy Library of Iceland National Library of
Spain
•Bibliothèque nationale de Luxembourg / National
Library of Luxembourg
Finding matching results in
single or multiple issues
Highlighting search terms
So far, okay. Similar functionality to other national and
regional digital libraries of newspapers
See other archives via:
https://www.google.com/maps/ms?msid=217164746645697066594.0004c3d764fcb71ed2
314&msa=0
But what was the user response to an aggregation
of European newspaper libraries ?
Results of Usability Testing: http://www.europeana-newspapers.eu/wp-content/uploads/2014/05/The-European-
Library-Newspaper-Archive-Usability-testing-Report-April-2014.pdf
Source: http://www.nytimes.com/2007/03/10/business/yourmoney/11archive.html
“Many saying they would be
keen to return to the site as
the content expands.”
“Ability to search over geographic map was
highly valued”
Plenty of quibbles about design
- positions of advanced options
- re-order list of results
- manipulating facets
Much greater expectations of functionality once logged in
For example,
Saved searches
New content notification
“Much of the value of the site to participants was provided by the
images of the documents.
Participants expected to be able to save a 'local' copy once they
had located content of relevance.
As no download facility is provided, this led to some frustration
and undermined the overall potential value of the site for some
participants.”
Timetable for rest of project
Now – Protype version of interface shared with project
Throughout 2014 - Ongoing creation of OCR, and other
related technical work (OLR, Named Entities)
Throughout 2014 – Live version of website improved /
usability testing / added content
Autumn 2014 - Final project conference
Late 2014 - Newspaper browser completed with content and
tools from project
More information at
http://www.europeana-newspapers.eu/
Interface at
http://www.theeuropeanlibrary.org/tel4/newspapers/
Things the users didn’t say
(but we thought they would)
Why can’t I edit the text ?
(Our sample was researchers/ maybe it is other communities
interested in crowdsourcing?)
Note: If time permits, The European Library will develop some
crowdsourcing feature
Source: Europeana Strategic Plan, 2015-2020, currently unpublished. See also Enumerate Project, enumerate.eu
Number of digitised pages in interface: c.2m
Number of digitised pages in European libraries: c.130m
Number of physical pages in European libraries: 1.5bn+
Source: European Newspaper Survey Report
http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana-newspapers-
survey-report.pdf
Source: European Newspaper Survey Report
http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana-newspapers-
survey-report.pdf
Quantities of newspapers – a) in project b) digitised in total c) in
physical libraries
The project digital library is only a fraction of the newspaper
archive of the continent, indeed the world
As libraries, how should we represent that
absence to users ?
Should such absence be represented in the
interface itself ?
Vast
white
spaces in
the list of
results ?
….. Difficult to represent
‘archival gaps’ when seen in
the context of how little has
been digitised - creates a
needle in the haystack ….
The estimated total cost of digitising
the collections of Europe’s
museums, archives and libraries,
including the audiovisual material
they hold is approximately €100bn,
or €10bn per annum for the next 10
years, factoring in
a cumulative efficiency gain of 0.5%
per annum.
The Research & Development
Budget for the Joint Strike Fighter
programme is estimated at
€40.34bn.
It would cost between 10% and 40%
of the Joint Strike Fighter R&D
budget to digitise every eligible title
in Europe’s librariesSource: Nick Poole, Collections Trust,
http://nickpoole.org.uk/wp-
content/uploads/2011/12/digiti_repor
t.pdf
Standardised information for
every digital resource for
representing collections,
extent of content, licencing
and re-use conditions
Standardised information? For
every digital resource
produced in the world ?
Are you kidding ?
Charts and graphs external to the interface ?
Graphs are the most obvious way of adding context
but still very reliant on the library producing such
charts
How to derive a representative
(random) sample from a digital
collection?
Source: http://dilbert.com/strips/comic/2001-10-25/
Pieter Francois, winner of BL
Labs competition 2013:
“How representative are the
historical texts humanities
scholars study of the overall
body of ‘surviving’ texts that
are held in the various
library collections?”
labs.bl.uk/Sample+
Generator
There are other issues in the project content too
 Major issues
 OCR quality varies
 Different licensing statements from
different countries
 Date of copyright boundaries different in
each country
There are other issues in the interface too
 Minor Issues
 Some pages (2m by 2015) have articles
segmentation
 Some library content has named entity
extraction effecting search results
Source: http://homepages.inf.ed.ac.uk/balex/publications/slides-DATeCH.pdf
10M pages, 7 billion words – how
much you are actually ignoring
when using only the “good” OCR
How should we allow users better ways to
understand the digital library ?
What role can the API play in this?
Can opening up the data in the digital library and allowing it to
explored in different ways ?
Traditional Model With an API
Interface
(Created by Library)
Data
(Published by Library)
Interface
(Created by Third Party)
Data
(Published by Library)
API – Application Programming Interfaces
Pioneering work of Trove API
(or rather of Tim Sherratt)
Currently:
2
million
pages of full text
By 2015:
10
million
pages of
full text
Searching by keyword, and
organise by language,
date, source library, title
Link: http://www.theeuropeanlibrary.org/tel4/newspapers
Trove Newspapers statistics
develolped by third party, based
on data provided by library
http://wraggelabs.com/shed/trove/graphs/
Interface
(Created by Third Party)
Data
(Published by Library)
Headline Roulette, developed by
third party, based on data
provided by library
http://wraggelabs.com/shed/headline-
roulette/
Interface
(Created by Third Party)
Data
(Published by Library)
Word Count of Articles, developed
by third party, based on data
provided by library
http://dhistory.org/frontpages/53/words/
Interface
(Created by Third Party)
Data
(Published by Library)
Sounds great !
But … ?
How many people in this audience would now
how to build an interface on top of API?
How many users do you know who could
build on top of an API ?
Currently:
Metadata records
relating to
1.12m
issues
By 2015:
Metadata records
relating to up to
4m issues -
Browse by date or map
Link: http://www.theeuropeanlibrary.org/tel4/newspapers
Desert: https://www.flickr.com/photos/aigle_dore/5952236932/sizes/l
Borges Sign: https://www.flickr.com/photos/monceau/7705020640/
Map: http://gallica.bnf.fr/ark:/12148/btv1b530299707
Strike Fighter : http://en.wikipedia.org/wiki/Strike_fighter
Credits

More Related Content

What's hot

Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser Europeana Newspapers
 
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200British Library Labs - Open University Presentation - 3 April 2014, 1100-1200
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200labsbl
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
BHL-Europe_MINERVA_20111116_hrainer
BHL-Europe_MINERVA_20111116_hrainerBHL-Europe_MINERVA_20111116_hrainer
BHL-Europe_MINERVA_20111116_hrainerHeimo Rainer
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Antoine Isaac
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in EuropeanaAntoine Isaac
 
Use Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataUse Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataNuno Freire
 
Stiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataStiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataPéter Király
 
How to read a million books?
How to read a million books?How to read a million books?
How to read a million books?cneudecker
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Antoine Isaac
 
British Library Labs - Presentation at the University of Nottingham - Digital...
British Library Labs - Presentation at the University of Nottingham - Digital...British Library Labs - Presentation at the University of Nottingham - Digital...
British Library Labs - Presentation at the University of Nottingham - Digital...labsbl
 
Charper.lawdi.20130531
Charper.lawdi.20130531Charper.lawdi.20130531
Charper.lawdi.20130531charper
 
Launch of Welsh Newspapers Online
Launch of Welsh Newspapers OnlineLaunch of Welsh Newspapers Online
Launch of Welsh Newspapers OnlineAlastair Dunning
 
Future Directions of the European Library
Future Directions of the European LibraryFuture Directions of the European Library
Future Directions of the European LibraryAlastair Dunning
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?Antoine Isaac
 
Digital Libraries: Local and Global
Digital Libraries: Local and GlobalDigital Libraries: Local and Global
Digital Libraries: Local and GlobalAlastair Dunning
 
Open ONI and IIIF: NDNP data in an IIIF Viewer
Open ONI and IIIF: NDNP data in an IIIF ViewerOpen ONI and IIIF: NDNP data in an IIIF Viewer
Open ONI and IIIF: NDNP data in an IIIF ViewerKaren Estlund
 

What's hot (20)

Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser Challenges and solutions in creating a european historic newspapers browser
Challenges and solutions in creating a european historic newspapers browser
 
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200British Library Labs - Open University Presentation - 3 April 2014, 1100-1200
British Library Labs - Open University Presentation - 3 April 2014, 1100-1200
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
BHL-Europe_MINERVA_20111116_hrainer
BHL-Europe_MINERVA_20111116_hrainerBHL-Europe_MINERVA_20111116_hrainer
BHL-Europe_MINERVA_20111116_hrainer
 
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
Wikidata, a target for Europeana's semantic strategy - GLAM-WIKI 2015
 
Europeana Libraries Review
Europeana Libraries ReviewEuropeana Libraries Review
Europeana Libraries Review
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
Use Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataUse Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked Data
 
Stiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataStiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of Metadata
 
How to read a million books?
How to read a million books?How to read a million books?
How to read a million books?
 
Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013Europeana and Schema.org - DC2013
Europeana and Schema.org - DC2013
 
British Library Labs - Presentation at the University of Nottingham - Digital...
British Library Labs - Presentation at the University of Nottingham - Digital...British Library Labs - Presentation at the University of Nottingham - Digital...
British Library Labs - Presentation at the University of Nottingham - Digital...
 
Charper.lawdi.20130531
Charper.lawdi.20130531Charper.lawdi.20130531
Charper.lawdi.20130531
 
Launch of Welsh Newspapers Online
Launch of Welsh Newspapers OnlineLaunch of Welsh Newspapers Online
Launch of Welsh Newspapers Online
 
Future Directions of the European Library
Future Directions of the European LibraryFuture Directions of the European Library
Future Directions of the European Library
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
Europeana, more than data aggregation?
Europeana, more than data aggregation?Europeana, more than data aggregation?
Europeana, more than data aggregation?
 
Digital Libraries: Local and Global
Digital Libraries: Local and GlobalDigital Libraries: Local and Global
Digital Libraries: Local and Global
 
Open ONI and IIIF: NDNP data in an IIIF Viewer
Open ONI and IIIF: NDNP data in an IIIF ViewerOpen ONI and IIIF: NDNP data in an IIIF Viewer
Open ONI and IIIF: NDNP data in an IIIF Viewer
 

Similar to Representation and Absence in Digital Resources: The Case of Europeana Newspapers

LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectEuropeana Newspapers
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER Europe
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers ProjectEuropeana Newspapers
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?The European Library
 
ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewEuropeana Newspapers
 
EuropeanaLocal: what’s it all about?
EuropeanaLocal: what’s it all about?EuropeanaLocal: what’s it all about?
EuropeanaLocal: what’s it all about?EuropeanaLocal Project
 
What's up, Europeana Newspapers?
What's up, Europeana Newspapers?What's up, Europeana Newspapers?
What's up, Europeana Newspapers?cneudecker
 
GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liberIGN Vorstand
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...The European Library
 
Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?AubreyMcFato
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projectsEuropeanaConnect
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsLIBER Europe
 
“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...bridgingworlds2008
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012Europeana Newspapers
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02The European Library
 
Europeana Cloud - Alastair Dunning - November 2013
Europeana Cloud - Alastair Dunning - November 2013Europeana Cloud - Alastair Dunning - November 2013
Europeana Cloud - Alastair Dunning - November 2013Europeana
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
 
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...The European Library
 

Similar to Representation and Absence in Digital Resources: The Case of Europeana Newspapers (20)

LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
LIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers ProjectLIBER, Europeana and the Europeana Newspapers Project
LIBER, Europeana and the Europeana Newspapers Project
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
ENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project OverviewENP Belgrade Workshop Project Overview
ENP Belgrade Workshop Project Overview
 
EuropeanaLocal: what’s it all about?
EuropeanaLocal: what’s it all about?EuropeanaLocal: what’s it all about?
EuropeanaLocal: what’s it all about?
 
What's up, Europeana Newspapers?
What's up, Europeana Newspapers?What's up, Europeana Newspapers?
What's up, Europeana Newspapers?
 
GI2012 pekarek-liber
GI2012 pekarek-liberGI2012 pekarek-liber
GI2012 pekarek-liber
 
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
Alastair Dunning, The successes of the Europeana Libraries project, The Europ...
 
Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
 
Europeana Cloud: The Essential Facts
Europeana Cloud: The Essential FactsEuropeana Cloud: The Essential Facts
Europeana Cloud: The Essential Facts
 
“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012
 
Museums and Europeana
Museums and EuropeanaMuseums and Europeana
Museums and Europeana
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Europeana Cloud - Alastair Dunning - November 2013
Europeana Cloud - Alastair Dunning - November 2013Europeana Cloud - Alastair Dunning - November 2013
Europeana Cloud - Alastair Dunning - November 2013
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
Alastair Dunning, Europeana Cloud: The Project and the Challenges of Assessin...
 

More from TU Delft, Netherlands

The Landscape of Research Data Management
The Landscape of Research Data Management The Landscape of Research Data Management
The Landscape of Research Data Management TU Delft, Netherlands
 
Winning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipWinning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipTU Delft, Netherlands
 
Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013TU Delft, Netherlands
 
Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013TU Delft, Netherlands
 
Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser TU Delft, Netherlands
 
Why aggregate European Historic Newspapers
Why aggregate European Historic NewspapersWhy aggregate European Historic Newspapers
Why aggregate European Historic NewspapersTU Delft, Netherlands
 
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the CloudEuropeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the CloudTU Delft, Netherlands
 
A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project TU Delft, Netherlands
 
Introduction to Europeana Cloud project
Introduction to Europeana Cloud projectIntroduction to Europeana Cloud project
Introduction to Europeana Cloud projectTU Delft, Netherlands
 
Presentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers OnlinePresentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers OnlineTU Delft, Netherlands
 
Presentation on The European Library
Presentation on The European LibraryPresentation on The European Library
Presentation on The European LibraryTU Delft, Netherlands
 

More from TU Delft, Netherlands (16)

The Landscape of Research Data Management
The Landscape of Research Data Management The Landscape of Research Data Management
The Landscape of Research Data Management
 
Winning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data StewardshipWinning the Tour de France, Research Data and Data Stewardship
Winning the Tour de France, Research Data and Data Stewardship
 
Europeana and Researchers
Europeana and ResearchersEuropeana and Researchers
Europeana and Researchers
 
Introduction to eCloud
Introduction to eCloudIntroduction to eCloud
Introduction to eCloud
 
Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013Short Presentation on Europeana Cloud at Europeana AGM 2013
Short Presentation on Europeana Cloud at Europeana AGM 2013
 
Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013Presentation on Europeana Cloud at Internet Librarian Conference 2013
Presentation on Europeana Cloud at Internet Librarian Conference 2013
 
Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser Challenges and Solutions in Creating a European Historic newspapers Browser
Challenges and Solutions in Creating a European Historic newspapers Browser
 
Open Data from the European Library
Open Data from the European LibraryOpen Data from the European Library
Open Data from the European Library
 
Why aggregate European Historic Newspapers
Why aggregate European Historic NewspapersWhy aggregate European Historic Newspapers
Why aggregate European Historic Newspapers
 
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the CloudEuropeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the Cloud
 
A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project A general introduction to the Europeana Cloud project
A general introduction to the Europeana Cloud project
 
Introduction to Europeana Cloud project
Introduction to Europeana Cloud projectIntroduction to Europeana Cloud project
Introduction to Europeana Cloud project
 
Presentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers OnlinePresentation for Launch of Welsh Newspapers Online
Presentation for Launch of Welsh Newspapers Online
 
Breaking the Waves
Breaking the WavesBreaking the Waves
Breaking the Waves
 
Presentation on The European Library
Presentation on The European LibraryPresentation on The European Library
Presentation on The European Library
 
The European Library
The European LibraryThe European Library
The European Library
 

Recently uploaded

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 

Recently uploaded (20)

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 

Representation and Absence in Digital Resources: The Case of Europeana Newspapers

  • 1. Representation and Absence in Digital Resources: The Case of Europeana Newspapers Alastair Dunning, The European Library, @alastairdunning Clemens Neudecker, National Library of Netherlands, @cneudecker DH2014, Lausanne
  • 2.
  • 3.
  • 4. Source: Europeana Strategic Plan, 2015-2020, currently unpublished. See also Enumerate Project, enumerate.eu
  • 5. The estimated total cost of digitising the collections of Europe’s museums, archives and libraries, including the audiovisual material they hold is approximately €100bn, or €10bn per annum for the next 10 years, factoring in a cumulative efficiency gain of 0.5% per annum. The Research & Development Budget for the Joint Strike Fighter programme is estimated at €40.34bn. It would cost between 10% and 40% of the Joint Strike Fighter R&D budget to digitise every eligible title in Europe’s librariesSource: Nick Poole, Collections Trust, http://nickpoole.org.uk/wp- content/uploads/2011/12/digiti_repor t.pdf
  • 6.
  • 7. Currently: 2 million pages of full text By 2015: 10 million pages of full text Searching by keyword, and organise by language, date, source library, title Link: http://www.theeuropeanlibrary.org/tel4/newspapers
  • 8. Currently: Metadata records relating to 1.12m issues By 2015: Metadata records relating to up to 4m issues - Browse by date or map Link: http://www.theeuropeanlibrary.org/tel4/newspapers
  • 9. Full Text from following libraries •Bibliotheque nationale de France / National Library France •Koninklijke Bibliotheek / National Library of the Netherlands •Landesbibliothek Dr. Friedrich Teßmann / Teßmann Library •Eesti Rahvusraamatukogu / Estonian National Library • Kansalliskirjasto / National Library of Finland • Latvijas Nacionala Biblioteka / National Library of Latvia •Biblioteka Narodowa / National Library of Poland •Milli Kutuphane Baskanligi / National Library of Turkey • Österreichische Nationalbibliothek / Austrian National Library •Staatsbibliothek zu Berlin / Berlin State Library •Staats- und Universitätsbibliothek Hamburg / State and University Library • Univerzitet u Beogradu / University Library of Belgrade Searching by title
  • 10. Issue Level Records from following libraries •National Library of Wales •St. Cyril and Methodius National Library / The National Library of Bulgaria •National Library of Czech Republic •National and University Library in Zagreb •Koninklijke Bibliotheek van België / Bibliothèque royale de Belgique •Narodna in univerzitetna knjinica / National and University Library of Slovenia •National Library of Portugal •National Library of Romania •Landsbókasafn Íslands - Háskólabókasafn / National and Univeristy Library of Iceland National Library of Spain •Bibliothèque nationale de Luxembourg / National Library of Luxembourg Finding matching results in single or multiple issues
  • 12. So far, okay. Similar functionality to other national and regional digital libraries of newspapers See other archives via: https://www.google.com/maps/ms?msid=217164746645697066594.0004c3d764fcb71ed2 314&msa=0
  • 13. But what was the user response to an aggregation of European newspaper libraries ? Results of Usability Testing: http://www.europeana-newspapers.eu/wp-content/uploads/2014/05/The-European- Library-Newspaper-Archive-Usability-testing-Report-April-2014.pdf
  • 15. “Many saying they would be keen to return to the site as the content expands.”
  • 16. “Ability to search over geographic map was highly valued”
  • 17. Plenty of quibbles about design - positions of advanced options - re-order list of results - manipulating facets
  • 18. Much greater expectations of functionality once logged in For example, Saved searches New content notification
  • 19. “Much of the value of the site to participants was provided by the images of the documents. Participants expected to be able to save a 'local' copy once they had located content of relevance. As no download facility is provided, this led to some frustration and undermined the overall potential value of the site for some participants.”
  • 20. Timetable for rest of project Now – Protype version of interface shared with project Throughout 2014 - Ongoing creation of OCR, and other related technical work (OLR, Named Entities) Throughout 2014 – Live version of website improved / usability testing / added content Autumn 2014 - Final project conference Late 2014 - Newspaper browser completed with content and tools from project More information at http://www.europeana-newspapers.eu/ Interface at http://www.theeuropeanlibrary.org/tel4/newspapers/
  • 21. Things the users didn’t say (but we thought they would)
  • 22. Why can’t I edit the text ? (Our sample was researchers/ maybe it is other communities interested in crowdsourcing?) Note: If time permits, The European Library will develop some crowdsourcing feature
  • 23. Source: Europeana Strategic Plan, 2015-2020, currently unpublished. See also Enumerate Project, enumerate.eu
  • 24. Number of digitised pages in interface: c.2m Number of digitised pages in European libraries: c.130m Number of physical pages in European libraries: 1.5bn+ Source: European Newspaper Survey Report http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana-newspapers- survey-report.pdf
  • 25. Source: European Newspaper Survey Report http://www.europeana-newspapers.eu/wp-content/uploads/2012/04/D4.1-Europeana-newspapers- survey-report.pdf Quantities of newspapers – a) in project b) digitised in total c) in physical libraries
  • 26. The project digital library is only a fraction of the newspaper archive of the continent, indeed the world
  • 27. As libraries, how should we represent that absence to users ?
  • 28. Should such absence be represented in the interface itself ?
  • 30. ….. Difficult to represent ‘archival gaps’ when seen in the context of how little has been digitised - creates a needle in the haystack ….
  • 31. The estimated total cost of digitising the collections of Europe’s museums, archives and libraries, including the audiovisual material they hold is approximately €100bn, or €10bn per annum for the next 10 years, factoring in a cumulative efficiency gain of 0.5% per annum. The Research & Development Budget for the Joint Strike Fighter programme is estimated at €40.34bn. It would cost between 10% and 40% of the Joint Strike Fighter R&D budget to digitise every eligible title in Europe’s librariesSource: Nick Poole, Collections Trust, http://nickpoole.org.uk/wp- content/uploads/2011/12/digiti_repor t.pdf
  • 32. Standardised information for every digital resource for representing collections, extent of content, licencing and re-use conditions
  • 33. Standardised information? For every digital resource produced in the world ? Are you kidding ?
  • 34. Charts and graphs external to the interface ?
  • 35. Graphs are the most obvious way of adding context but still very reliant on the library producing such charts
  • 36. How to derive a representative (random) sample from a digital collection? Source: http://dilbert.com/strips/comic/2001-10-25/
  • 37. Pieter Francois, winner of BL Labs competition 2013: “How representative are the historical texts humanities scholars study of the overall body of ‘surviving’ texts that are held in the various library collections?” labs.bl.uk/Sample+ Generator
  • 38. There are other issues in the project content too  Major issues  OCR quality varies  Different licensing statements from different countries  Date of copyright boundaries different in each country
  • 39. There are other issues in the interface too  Minor Issues  Some pages (2m by 2015) have articles segmentation  Some library content has named entity extraction effecting search results
  • 40.
  • 41. Source: http://homepages.inf.ed.ac.uk/balex/publications/slides-DATeCH.pdf 10M pages, 7 billion words – how much you are actually ignoring when using only the “good” OCR
  • 42. How should we allow users better ways to understand the digital library ?
  • 43. What role can the API play in this? Can opening up the data in the digital library and allowing it to explored in different ways ?
  • 44. Traditional Model With an API Interface (Created by Library) Data (Published by Library) Interface (Created by Third Party) Data (Published by Library) API – Application Programming Interfaces
  • 45. Pioneering work of Trove API (or rather of Tim Sherratt)
  • 46. Currently: 2 million pages of full text By 2015: 10 million pages of full text Searching by keyword, and organise by language, date, source library, title Link: http://www.theeuropeanlibrary.org/tel4/newspapers
  • 47. Trove Newspapers statistics develolped by third party, based on data provided by library http://wraggelabs.com/shed/trove/graphs/ Interface (Created by Third Party) Data (Published by Library)
  • 48. Headline Roulette, developed by third party, based on data provided by library http://wraggelabs.com/shed/headline- roulette/ Interface (Created by Third Party) Data (Published by Library)
  • 49. Word Count of Articles, developed by third party, based on data provided by library http://dhistory.org/frontpages/53/words/ Interface (Created by Third Party) Data (Published by Library)
  • 51. How many people in this audience would now how to build an interface on top of API?
  • 52. How many users do you know who could build on top of an API ?
  • 53.
  • 54. Currently: Metadata records relating to 1.12m issues By 2015: Metadata records relating to up to 4m issues - Browse by date or map Link: http://www.theeuropeanlibrary.org/tel4/newspapers
  • 55. Desert: https://www.flickr.com/photos/aigle_dore/5952236932/sizes/l Borges Sign: https://www.flickr.com/photos/monceau/7705020640/ Map: http://gallica.bnf.fr/ark:/12148/btv1b530299707 Strike Fighter : http://en.wikipedia.org/wiki/Strike_fighter Credits