SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Australian Newspapers
       Digitisation Program

          Overview of Progress
      March 2007 – November 2008
and the public search and delivery system
           Rose Holley – Manager ANDP
             5 November 2008, National Library of Australia
       Presentation to the National Library of Indonesia Delegation.




                                                                       1
Objectives
 Increase access to Australian newspapers 

 Build a national service that will provide free online 
 access from the first Australian newspaper published 
 in 1803 through to the end of 1954 

 Key Features of the service
   Online access
   Freely available
   Full text searchable
                                                     2
Website:   http://www.nla.gov.au/ndp   3
National Content
                                             Northern
                                             Territory
 Initial focus on                            Times

 major titles from 
 each state and 
 territory
                                                                 Courier Mail
 Anticipate that 
 ‘regional’ titles may 
 be contributed later     West Australian                    Sydney Morning Herald
                                                             Sydney Gazette
                                            Advertiser
                                                           Canberra Times
 Coverage: published 
 between 1803 – 1954                                     Argus


 (out of copyright)
                                                                     Mercury



                                                                                4
Coverage 1803 ‐ 1954




                   5
State Newspaper Titles




                   6
$1 Million Grant from the Vincent Fairfax Family 
Foundation to digitise The Sydney Morning Herald to 1954




                                                      7
Progress November 2008…
 1.5 million newspaper pages digitised from microfilm 

 Pilot phase completed – Optical Character Recognition 
 (OCR) and content analysis of 50,000 pages

  Prototype search and delivery service developed and 
 tested with Australian state libraries 

 Beta search and delivery service available with 360,000 
 pages (3.5 million articles)
                                                     8
The Process
Microfilm converted to digital images




                                        9
Check images on reels




                  10
Quality Assurance on each page




                                 11
Page 
sequence

Metadata 
creation

Missing
page 
targets

            12
Optical Character Recognition (OCR) of pages and article zoning




                                                            13
OCR In India




               14
Accessing the Newspapers
Beta service now available
Contains 360,000 pages
Open for public use and feedback
Screenshots follow



                                   15
Home page   16
Search results   17
Page view   18
Article View   19
Correct OCR   20
21
Add tags to articles   22
Tag cloud
            23
Add note/comment to article   24
Title information and browse   25
Title list   26
Feedback   27
Website:   http://www.nla.gov.au/ndp   28

Contenu connexe

Plus de Rose Holley

Collecting sharing and improving data: changing roles for librarians and user...
Collecting sharing and improving data: changing roles for librarians and user...Collecting sharing and improving data: changing roles for librarians and user...
Collecting sharing and improving data: changing roles for librarians and user...
Rose Holley
 
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
Rose Holley
 

Plus de Rose Holley (20)

The strategic rebuilding and positioning of UNSW Canberra Special Collections...
The strategic rebuilding and positioning of UNSW Canberra Special Collections...The strategic rebuilding and positioning of UNSW Canberra Special Collections...
The strategic rebuilding and positioning of UNSW Canberra Special Collections...
 
Crowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library designCrowdsourcing based curation and user engagement in digital library design
Crowdsourcing based curation and user engagement in digital library design
 
National Archives of Australia. AVAMS Project Achievements August 2014
National Archives of Australia. AVAMS Project Achievements August 2014National Archives of Australia. AVAMS Project Achievements August 2014
National Archives of Australia. AVAMS Project Achievements August 2014
 
Building and Managing Online Communities
Building and Managing Online CommunitiesBuilding and Managing Online Communities
Building and Managing Online Communities
 
Collecting sharing and improving data: changing roles for librarians and user...
Collecting sharing and improving data: changing roles for librarians and user...Collecting sharing and improving data: changing roles for librarians and user...
Collecting sharing and improving data: changing roles for librarians and user...
 
Resource Sharing in Australia: 'Find' and 'Get' in Trove - Making 'Getting' b...
Resource Sharing in Australia: 'Find' and 'Get' in Trove - Making 'Getting' b...Resource Sharing in Australia: 'Find' and 'Get' in Trove - Making 'Getting' b...
Resource Sharing in Australia: 'Find' and 'Get' in Trove - Making 'Getting' b...
 
The Australian Women's Weekly now available in Trove: An overview of the digi...
The Australian Women's Weekly now available in Trove: An overview of the digi...The Australian Women's Weekly now available in Trove: An overview of the digi...
The Australian Women's Weekly now available in Trove: An overview of the digi...
 
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
 
Finding Information Just Got Easier for Historians. Lachlan Macquarie:200 yea...
Finding Information Just Got Easier for Historians. Lachlan Macquarie:200 yea...Finding Information Just Got Easier for Historians. Lachlan Macquarie:200 yea...
Finding Information Just Got Easier for Historians. Lachlan Macquarie:200 yea...
 
Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010
 
Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...
 
A model for incorporating e-resources into Trove, September 2010
A model for incorporating e-resources into Trove, September 2010A model for incorporating e-resources into Trove, September 2010
A model for incorporating e-resources into Trove, September 2010
 
Developments in Access to Art Information: Trove. Presentation at ARLIS confe...
Developments in Access to Art Information: Trove. Presentation at ARLIS confe...Developments in Access to Art Information: Trove. Presentation at ARLIS confe...
Developments in Access to Art Information: Trove. Presentation at ARLIS confe...
 
Consultation Forum: Music Australia and Trove Transition, September 2010, IAM...
Consultation Forum: Music Australia and Trove Transition, September 2010, IAM...Consultation Forum: Music Australia and Trove Transition, September 2010, IAM...
Consultation Forum: Music Australia and Trove Transition, September 2010, IAM...
 
Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...
Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...
Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...
 
Trove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian ParliamentTrove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian Parliament
 
Legal Research using digitised historic Australian Newspapers August 2010, by...
Legal Research using digitised historic Australian Newspapers August 2010, by...Legal Research using digitised historic Australian Newspapers August 2010, by...
Legal Research using digitised historic Australian Newspapers August 2010, by...
 
Trove: Explore Like Never Before. Key Features of Trove May 2010
Trove: Explore Like Never Before. Key Features of Trove May 2010Trove: Explore Like Never Before. Key Features of Trove May 2010
Trove: Explore Like Never Before. Key Features of Trove May 2010
 
Trove: Innovation In Access To Information. June 2010
Trove: Innovation In Access To Information. June 2010Trove: Innovation In Access To Information. June 2010
Trove: Innovation In Access To Information. June 2010
 
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
Trove: Collecting, Sharing and Improving Digital Data: Changing roles of libr...
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 

Australian Newspapers service Progress And Search And Delivery System Nov 2008