SlideShare une entreprise Scribd logo
1  sur  30
Europeana Newspapers Project
"Distant Reading: Historic Newspapers in the Digital Age“

National Library, Warsaw, Poland
January 16, 2014
Ulrike Kölsch, Project Coordinator - Berlin State Library
Europeana Newspapers
16 January 2014 – Warsaw– Morning Edition
Europeana Newspapers Project

On 15th April 1912, the passenger ship
Titanic, carrying over 2000 passengers and
crew, crashed into an iceberg on its maiden
voyage from Southampton to New York

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

3
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

4
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

5
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

6
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

7
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

8
Europeana Newspapers Project

Responses to the Titanic Disaster

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

9
Europeana Newspapers Project

News travels at
different speeds,
with importance that
diminishes at
different rates.
This is true now as
is was in 1912.
(though the web changes things
…)

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

10
Europeana Newspapers Project

The Europeana Newspapers Project is
making this kind of investigation easier, in
several ways

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

11
Europeana Newspapers Project

1. By creating full text for 8m pages
2. By undertaking article segmentation for 2m
pages
3. By undertaking named entity extraction for 2m
pages
4. By developing a cross-searchable newspapers
browser at The European Library
(with metadata forwarded to Europeana)

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

12
Europeana Newspapers Project
Best Practice Network that aims at aggregating 18 million digitised
historic newspaper pages from 12 European libraries, drastically
improving search and retrieve possibilities.
Volume
Cross European cultures

Sharing best practices

Improving accessibility

Improving availability
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

13
The challenges……
Newspapers were not meant to be preserved…
 frail and crumbly paper
 missing edition
 incomplete supplements
 poorly bound
 fading ink
 different fonts
 legal uncertainties
with contemporary material

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
Who

12 content providers

Blue– Providing
Content
Yellow –Providing
Technical Services
Green – Associate
Partners

2 networking partners

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
Who
4 technology providers
12 content providers

1 aggregator

2 networking partners

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp
Challenges and Solutions in Creating a European Historic
Newspapers Browser I
Creating a newspapers interface that ...

Provides unique value to users
Reflects relationship to original
physical newspaper collections

Is sustainable
Offers contributors added value
Defines relationship to Europeana
Respects library wishes
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

17
Challenges and Solutions in Creating a European Historic
Newspapers Browser II

What content will be included ?
Full Images, Full Text, Metadata
Latvia, Belgrade, Germany (Hamburg, Berlin), Estonia,
Finland, Netherlands , Austria
Snippets of Images, Full Text, Metadata
Italy, France , Poland

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

18
Challenges and Solutions in Creating a European Historic
Newspapers Browser III

First Iteration
- Basic text search
- Filtering of results by date,
country, newspaper,
language, library
- OCR shown
- Zoom able version of full image
- Clickable links between full text and image (sometimes)
- Link to newspaper source library

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

19
Challenges and Solutions in Creating a European Historic
Newspapers Browser IV

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

20
Challenges and Solutions in Creating a European Historic
Newspapers Browser V

Complete Newspaper image can be shown

Eesti Potimees ehk
Naddaleleht,
2 November 1866
(National Library of Estonia)

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

21
Challenges and Solutions in Creating a European Historic
Newspapers Browser VI

Fragment of Newspaper image can be shown

Dziennik Slaskui,
10 June 1915
(National Library of Poland)
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

22
Challenges and Solutions in Creating a European Historic
Newspapers Browser VII

• Just title level metadata can be shown:

“Kleine Blatt, 15 November 1932”
(National Library of Austria)

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

23
Challenges and Solutions in Creating a European Historic
Newspapers Browser VIII

Zooming in

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

24
Challenges and Solutions in Creating a European Historic
Newspapers Browser IX

Second Iteration
- Fragments
- See information on particular title
- See what was published on a particular day
- Search over titles (not just text)
- Other browse-able visualisations of publication and
library source
- Search / browse via entities

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

25
Challenges and Solutions in Creating a European Historic
Newspapers Browser X

Who are the users ?
- Historians
- Researchers
- Students
- Genealogists
- Teachers and school pupils
- Interested public  Citizen researcher
…

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

26
Challenges for Users

“Texts are designed to “speak” to us, and so, they always end
up telling us something; but archives are not messages that
were meant to address us, and so they say absolutely
nothing until one asks the right question.”
(Franco Moretti "Distant Reading“, 2013)

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

27
Share best practices
… via workshops and national information days

Image: Australian National Maritime Museum

This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

28
Network Partner Project
Europeana Collections 1914-1918 – Remembering the First World War
Unlocking Sources – The First World War online & Europeana“,
30./31.01.2014
2014 will mark the centenary of the outbreak of the First World War, which
will be commemorated worldwide. In recent years a wide range of
European cultural institutions, including the Staatsbibliothek zu Berlin,
have digitized manuscript and print materials as well as film holdings.
Books, photos, films, posters, manuscripts, and song lyrics have recently
been made available online.
On 30 and 31 January 2014 the Staatsbibliothek zu Berlin will host the event
“Unlocking Sources – The First World War online & Europeana” to mark
the commemoration.

More information : www.unlocking-sources.eu
This project is partially funded under the ICT Policy Support Programme (ICT PSP)
as part of the Competitiveness and Innovation Framework Programme by the
European Community http://ec.europa.eu/ict_psp

29
Thank you for interest!
More information on our website
www.europeana-newspapers.eu

Contenu connexe

Tendances

Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers
 
Europeana Newspapers Amsterdam workshop introduction
Europeana Newspapers Amsterdam workshop introductionEuropeana Newspapers Amsterdam workshop introduction
Europeana Newspapers Amsterdam workshop introduction
Europeana Newspapers
 
Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013
Europeana Newspapers
 
04 europeana newspapers
04 europeana newspapers04 europeana newspapers
04 europeana newspapers
Europeana
 
Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013
Europeana Newspapers
 

Tendances (20)

IFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza AtanassovaIFLA 2014 Europeana Newspapers Rossitza Atanassova
IFLA 2014 Europeana Newspapers Rossitza Atanassova
 
Presentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information DayPresentation of Hans-Jörg Lieder, BnF Information Day
Presentation of Hans-Jörg Lieder, BnF Information Day
 
Overview of the Europeana Newspapers Project
Overview of the Europeana Newspapers ProjectOverview of the Europeana Newspapers Project
Overview of the Europeana Newspapers Project
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
 
Europeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista KiisaEuropeana Newspapers Estonian Infoday Krista Kiisa
Europeana Newspapers Estonian Infoday Krista Kiisa
 
Europeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop introEuropeana Newspapers LIBER2013 Workshop intro
Europeana Newspapers LIBER2013 Workshop intro
 
Europeana Newspapers Amsterdam workshop introduction
Europeana Newspapers Amsterdam workshop introductionEuropeana Newspapers Amsterdam workshop introduction
Europeana Newspapers Amsterdam workshop introduction
 
Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013Europeana Newspapers wp2 liber2013
Europeana Newspapers wp2 liber2013
 
ENP Belgrade WS Metadata
ENP Belgrade WS MetadataENP Belgrade WS Metadata
ENP Belgrade WS Metadata
 
Europeana_Newspapers_ONB_infoday_HJLieder
Europeana_Newspapers_ONB_infoday_HJLiederEuropeana_Newspapers_ONB_infoday_HJLieder
Europeana_Newspapers_ONB_infoday_HJLieder
 
Refinement of Digitised Newspapers
Refinement of Digitised NewspapersRefinement of Digitised Newspapers
Refinement of Digitised Newspapers
 
The challenges of making Europe's newspapers available online
The challenges of making Europe's newspapers available onlineThe challenges of making Europe's newspapers available online
The challenges of making Europe's newspapers available online
 
Europeana Newspapers Aggregation Plan
Europeana Newspapers Aggregation PlanEuropeana Newspapers Aggregation Plan
Europeana Newspapers Aggregation Plan
 
04 europeana newspapers
04 europeana newspapers04 europeana newspapers
04 europeana newspapers
 
Europeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers OnlineEuropeana Newspapers - the Gateway to European Newspapers Online
Europeana Newspapers - the Gateway to European Newspapers Online
 
EurnewsLDN_Clemens_Neudecker
EurnewsLDN_Clemens_NeudeckerEurnewsLDN_Clemens_Neudecker
EurnewsLDN_Clemens_Neudecker
 
Large scale refinement of digital historical newspapers with named entities r...
Large scale refinement of digital historical newspapers with named entities r...Large scale refinement of digital historical newspapers with named entities r...
Large scale refinement of digital historical newspapers with named entities r...
 
Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013Europeana Newspaper metadata LIBER2013
Europeana Newspaper metadata LIBER2013
 
META-NET and META-SHARE: An Overview
META-NET and META-SHARE: An OverviewMETA-NET and META-SHARE: An Overview
META-NET and META-SHARE: An Overview
 
packed-preforma@lleida2015
packed-preforma@lleida2015packed-preforma@lleida2015
packed-preforma@lleida2015
 

Similaire à Europeana Newspapers Polish Information Day

Europeana: Connecting society through aggregation
Europeana: Connecting society through aggregationEuropeana: Connecting society through aggregation
Europeana: Connecting society through aggregation
Museums Computer Group
 
04 digitising contemporary art
04 digitising contemporary art04 digitising contemporary art
04 digitising contemporary art
Europeana
 

Similaire à Europeana Newspapers Polish Information Day (16)

Europeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday MuehlbergerEuropeana Newspapers LFT Infoday Muehlberger
Europeana Newspapers LFT Infoday Muehlberger
 
Europeana Newspapers in a nutshell
Europeana Newspapers in a nutshellEuropeana Newspapers in a nutshell
Europeana Newspapers in a nutshell
 
Centre of Competence in digitisation. Clemens Neudecker
Centre of Competence in digitisation. Clemens NeudeckerCentre of Competence in digitisation. Clemens Neudecker
Centre of Competence in digitisation. Clemens Neudecker
 
'Smart Cities'/'Open Data' event in Westminster on 13/11/14: EC/Olavi Luotone...
'Smart Cities'/'Open Data' event in Westminster on 13/11/14: EC/Olavi Luotone...'Smart Cities'/'Open Data' event in Westminster on 13/11/14: EC/Olavi Luotone...
'Smart Cities'/'Open Data' event in Westminster on 13/11/14: EC/Olavi Luotone...
 
Europeana: Connecting society through aggregation
Europeana: Connecting society through aggregationEuropeana: Connecting society through aggregation
Europeana: Connecting society through aggregation
 
Digitisation of Cultural Heritage: Funding Opportunities at EU level - Luca M...
Digitisation of Cultural Heritage: Funding Opportunities at EU level - Luca M...Digitisation of Cultural Heritage: Funding Opportunities at EU level - Luca M...
Digitisation of Cultural Heritage: Funding Opportunities at EU level - Luca M...
 
Experimental Workflow Development in Digitisation
Experimental Workflow Development in DigitisationExperimental Workflow Development in Digitisation
Experimental Workflow Development in Digitisation
 
Science and Culture in the EU‘s Digital Agenda
Science and Culture  in the EU‘s Digital AgendaScience and Culture  in the EU‘s Digital Agenda
Science and Culture in the EU‘s Digital Agenda
 
Citadel Apps4Dummies London Workshop - 13th Nov 2014 - Olavi Luotonen
Citadel Apps4Dummies London Workshop - 13th Nov 2014 - Olavi LuotonenCitadel Apps4Dummies London Workshop - 13th Nov 2014 - Olavi Luotonen
Citadel Apps4Dummies London Workshop - 13th Nov 2014 - Olavi Luotonen
 
OCR challenges in historic documents and the contribution of IMPACT
OCR challenges in historic documents and the contribution of IMPACTOCR challenges in historic documents and the contribution of IMPACT
OCR challenges in historic documents and the contribution of IMPACT
 
Positioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscapePositioning libraries in the digital preservation landscape
Positioning libraries in the digital preservation landscape
 
04 digitising contemporary art
04 digitising contemporary art04 digitising contemporary art
04 digitising contemporary art
 
An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...
 
Cultural Heritage & H2020
Cultural Heritage & H2020Cultural Heritage & H2020
Cultural Heritage & H2020
 
Workflow Development for OCR (and beyond)
Workflow Development for OCR (and beyond)Workflow Development for OCR (and beyond)
Workflow Development for OCR (and beyond)
 
E mobility as part of smart city concepts - Vienna case study
E mobility as part of smart city concepts - Vienna case studyE mobility as part of smart city concepts - Vienna case study
E mobility as part of smart city concepts - Vienna case study
 

Plus de Europeana Newspapers

Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
Europeana Newspapers
 

Plus de Europeana Newspapers (20)

Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in ParisPresentation of Philippe Mezzasalma at the BnF Information Day in Paris
Presentation of Philippe Mezzasalma at the BnF Information Day in Paris
 
Presentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information DayPresentation of Ioannis Anagnostopoulos at BnF Information Day
Presentation of Ioannis Anagnostopoulos at BnF Information Day
 
Présentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information DayPrésentation Günter Mühlberger, BnF Information Day
Présentation Günter Mühlberger, BnF Information Day
 
Presentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information DayPresentation of Claus Gravenhorst, BnF Information Day
Presentation of Claus Gravenhorst, BnF Information Day
 
Presentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information DayPresentation of Alaa Abi Haidar at the BnF Information Day
Presentation of Alaa Abi Haidar at the BnF Information Day
 
Europeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne KoutsEuropeana Newspapers Estonian Infoday Ragne Kouts
Europeana Newspapers Estonian Infoday Ragne Kouts
 
Europeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel VeimannEuropeana Newspapers Estonian Infoday Kristel Veimann
Europeana Newspapers Estonian Infoday Kristel Veimann
 
Europeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista AruEuropeana Newspapers Estonian Infoday Krista Aru
Europeana Newspapers Estonian Infoday Krista Aru
 
Europeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred PussEuropeana Newspapers Estonian Infoday Fred Puss
Europeana Newspapers Estonian Infoday Fred Puss
 
Europeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday NeudeckerEuropeana Newpapers LFT Infoday Neudecker
Europeana Newpapers LFT Infoday Neudecker
 
Europeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday ThompsonEuropeana Newspapers LFT Infoday Thompson
Europeana Newspapers LFT Infoday Thompson
 
Europeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday RossiEuropeana Newspapers LFT Infoday Rossi
Europeana Newspapers LFT Infoday Rossi
 
Enp lft infoday_neudecker
Enp lft infoday_neudeckerEnp lft infoday_neudecker
Enp lft infoday_neudecker
 
Europeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday MessinaEuropeana Newspapers LFT Infoday Messina
Europeana Newspapers LFT Infoday Messina
 
Europeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday MarchettiEuropeana Newspapers Infoday Marchetti
Europeana Newspapers Infoday Marchetti
 
Europeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday KempfEuropeana Newspapers LFT Infoday Kempf
Europeana Newspapers LFT Infoday Kempf
 
Europeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday GenereuxEuropeana Newspapers LFT Infoday Genereux
Europeana Newspapers LFT Infoday Genereux
 
Europeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday BolioliEuropeana Newspapers LFT Infoday Bolioli
Europeana Newspapers LFT Infoday Bolioli
 
ENP_Dutch_Infoday_MWillems
ENP_Dutch_Infoday_MWillemsENP_Dutch_Infoday_MWillems
ENP_Dutch_Infoday_MWillems
 
ENP_Dutch_Infoday_PHuijnen
ENP_Dutch_Infoday_PHuijnen ENP_Dutch_Infoday_PHuijnen
ENP_Dutch_Infoday_PHuijnen
 

Dernier

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

Europeana Newspapers Polish Information Day

  • 1. Europeana Newspapers Project "Distant Reading: Historic Newspapers in the Digital Age“ National Library, Warsaw, Poland January 16, 2014 Ulrike Kölsch, Project Coordinator - Berlin State Library
  • 2. Europeana Newspapers 16 January 2014 – Warsaw– Morning Edition
  • 3. Europeana Newspapers Project On 15th April 1912, the passenger ship Titanic, carrying over 2000 passengers and crew, crashed into an iceberg on its maiden voyage from Southampton to New York This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 3
  • 4. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 4
  • 5. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 5
  • 6. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 6
  • 7. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 7
  • 8. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 8
  • 9. Europeana Newspapers Project Responses to the Titanic Disaster This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 9
  • 10. Europeana Newspapers Project News travels at different speeds, with importance that diminishes at different rates. This is true now as is was in 1912. (though the web changes things …) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 10
  • 11. Europeana Newspapers Project The Europeana Newspapers Project is making this kind of investigation easier, in several ways This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 11
  • 12. Europeana Newspapers Project 1. By creating full text for 8m pages 2. By undertaking article segmentation for 2m pages 3. By undertaking named entity extraction for 2m pages 4. By developing a cross-searchable newspapers browser at The European Library (with metadata forwarded to Europeana) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 12
  • 13. Europeana Newspapers Project Best Practice Network that aims at aggregating 18 million digitised historic newspaper pages from 12 European libraries, drastically improving search and retrieve possibilities. Volume Cross European cultures Sharing best practices Improving accessibility Improving availability This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 13
  • 14. The challenges…… Newspapers were not meant to be preserved…  frail and crumbly paper  missing edition  incomplete supplements  poorly bound  fading ink  different fonts  legal uncertainties with contemporary material This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 15. Who 12 content providers Blue– Providing Content Yellow –Providing Technical Services Green – Associate Partners 2 networking partners This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 16. Who 4 technology providers 12 content providers 1 aggregator 2 networking partners This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp
  • 17. Challenges and Solutions in Creating a European Historic Newspapers Browser I Creating a newspapers interface that ... Provides unique value to users Reflects relationship to original physical newspaper collections Is sustainable Offers contributors added value Defines relationship to Europeana Respects library wishes This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 17
  • 18. Challenges and Solutions in Creating a European Historic Newspapers Browser II What content will be included ? Full Images, Full Text, Metadata Latvia, Belgrade, Germany (Hamburg, Berlin), Estonia, Finland, Netherlands , Austria Snippets of Images, Full Text, Metadata Italy, France , Poland This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 18
  • 19. Challenges and Solutions in Creating a European Historic Newspapers Browser III First Iteration - Basic text search - Filtering of results by date, country, newspaper, language, library - OCR shown - Zoom able version of full image - Clickable links between full text and image (sometimes) - Link to newspaper source library This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 19
  • 20. Challenges and Solutions in Creating a European Historic Newspapers Browser IV This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 20
  • 21. Challenges and Solutions in Creating a European Historic Newspapers Browser V Complete Newspaper image can be shown Eesti Potimees ehk Naddaleleht, 2 November 1866 (National Library of Estonia) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 21
  • 22. Challenges and Solutions in Creating a European Historic Newspapers Browser VI Fragment of Newspaper image can be shown Dziennik Slaskui, 10 June 1915 (National Library of Poland) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 22
  • 23. Challenges and Solutions in Creating a European Historic Newspapers Browser VII • Just title level metadata can be shown: “Kleine Blatt, 15 November 1932” (National Library of Austria) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 23
  • 24. Challenges and Solutions in Creating a European Historic Newspapers Browser VIII Zooming in This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 24
  • 25. Challenges and Solutions in Creating a European Historic Newspapers Browser IX Second Iteration - Fragments - See information on particular title - See what was published on a particular day - Search over titles (not just text) - Other browse-able visualisations of publication and library source - Search / browse via entities This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 25
  • 26. Challenges and Solutions in Creating a European Historic Newspapers Browser X Who are the users ? - Historians - Researchers - Students - Genealogists - Teachers and school pupils - Interested public  Citizen researcher … This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 26
  • 27. Challenges for Users “Texts are designed to “speak” to us, and so, they always end up telling us something; but archives are not messages that were meant to address us, and so they say absolutely nothing until one asks the right question.” (Franco Moretti "Distant Reading“, 2013) This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 27
  • 28. Share best practices … via workshops and national information days Image: Australian National Maritime Museum This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 28
  • 29. Network Partner Project Europeana Collections 1914-1918 – Remembering the First World War Unlocking Sources – The First World War online & Europeana“, 30./31.01.2014 2014 will mark the centenary of the outbreak of the First World War, which will be commemorated worldwide. In recent years a wide range of European cultural institutions, including the Staatsbibliothek zu Berlin, have digitized manuscript and print materials as well as film holdings. Books, photos, films, posters, manuscripts, and song lyrics have recently been made available online. On 30 and 31 January 2014 the Staatsbibliothek zu Berlin will host the event “Unlocking Sources – The First World War online & Europeana” to mark the commemoration. More information : www.unlocking-sources.eu This project is partially funded under the ICT Policy Support Programme (ICT PSP) as part of the Competitiveness and Innovation Framework Programme by the European Community http://ec.europa.eu/ict_psp 29
  • 30. Thank you for interest! More information on our website www.europeana-newspapers.eu