Europeana Newspapers (Project Details and Aggregation Workflow)
1. Search and Browse Europe’s Historical Newspapers:
The Europeana Newspapers Content Browser
Connecting knowledge
Alena Fedasenka, Markus Muhr, Elizabeth Joss, Anastasia Gasia, Alastair Dunning
Project Details
Newspapers Content Browser
Search within full-text and image viewer
• Allowing the search and browsing of historic
newspapers and putting them within everyone’s
reach
• A three-year project from February 2012 to
January 2015
Search by newspaper
Search within
full-text and
refine your
search
Explore
other search
results
within a
particular
issue
Navigate to
provider’s
image
server
• Funded under the European Commission’s CIP
2007 – 2013 Programme
Search full-text
Search
newspaper
titles
Newspaper page links to provider
Navigate to
newspaper
issue by
date
• Aggregating 18 million historic newspaper pages
for Europeana and The European Library
• Converting 10 million newspaper pages to fulltext, helping users quickly search for specific
articles, people and destinations mentioned within
the newspaper
Newspaper Issue Page:
Newspaper Results
Page:
• Search within full-text panel
or image viewer
• Find a specific word/article
and view its corresponding
image
• Search within the newspaper
image and navigate to the
highlighted full-text section.
• Building a special content viewer to improve
online newspaper browsing
• Building tools for professionals, which will better
assess the quality of newspaper digitization in
relation to levels of detail, speed and costs.
• Search by newspaper
title or within
newspaper content
• Refine your search,
filter results, obtain
search suggestions
etc.
Newspaper Main Page:
Newspaper Gallery:
Newspaper Viewer:
• Explore historical
newspapers by various
filters, for example,
title, date, provider,
language, popularity,
country etc.
• Navigate to
newspaper record
page to search for
available issue dates
and other newspaper
publication details
• Specific browsing tools
available, for example,
zooming, navigating etc.
• Leaf through
newspapers by page
number and issue date
Newspapers Aggregation Workflow
Project Innovations
• Dynamic image retrieval from partner libraries
Harvest Metadata
XSLT Transformations
Storing Metadata
Copy/Transform
METS/ALTO Enrichments
• Named Entity extraction
Partners
servers
External image
server
DB
DB
DB
DB
Server
Enrichment & Format
Normalization
• Searching across named national boundaries
Metadata
Repository
Enrich
TEL IIP image
server
Index
Full-text
Repository
• What was a published on a single day – an international
perspective
Learn More
Our Newspaper content browser will launch in early 2014. For the
latest information check our websites and follow us on Twitter:
http://www.europeana-newspapers.eu
http://www.theeuropeanlibrary.org
@eurnews
Hard Discs from
UIBK/ CCS
This project runs from February 2012 to February 2015. It is led by the Staatsbibliothek zu Berlin and co-funded by
the European Commission under the Competitiveness and Innovation framework Programme. http://ec.europa.eu/ict_psp
Fulltext
Fulltext
Index
Index
Partners providing
digitized content