SlideShare une entreprise Scribd logo
1  sur  26
● Scale & materiality
– Not individual, standard documents but vast collections of
      them; authenticity demands multiplicity of versions
● Cost
– Preservation not by individuals but large organizations

● Intellectual Property
– If content worth saving someone is making money
  from it
http://www.economist.com/node/15557443   http://youtube-global.blogspot
                                         .com/2011/05/thanks-youtube-community-for-two-big.ht
   “The challenge for libraries is to find ways to preserve
    platform dependent digital works and to prevent the loss of
    complex digital media…. Since we cannot possibly save
    everything, we need to carefully consider which digital
    materials are the most important to preserve and try to
    anticipate the needs of future scholars and researchers”
    Marlene Manoff, 2006


   If preservation priority is X and user need is Y,
    what are the values of X and Y?
If sustainability means that information is kept useful and available, then the LOCKSS
approach has real merit! It implies that SERVICES must be preserved as well.
Abundance of stored content:
                     attention is scarce & must be earned
Services
           Content
   platforms to focus in 2012+



   Maintain active presence on



   Continue to assess
Digital Research & Curator Team



            digital                                                    commercia
                                                                       l
            scholars




                                                       pa
                                                                             academic




                                                         rt n
     social                                                                    digital consortia

                            ers




                                                             ers
     networks                         Digital                                       funding
                         us




                                                                 hip
                                    Research &                                      bodies
machines




                                                                    s
                                   Curator Team                                          media




                          systems & services


                                                                           M
                                                ser ation



                                                                ns
                          ret h &
                                  al




                                                       es
                                           g
                     l




                                                                        /ST
                                                                   o
           se oria




                             riev
                   s




                                                    vic
                              rc



                                       etin




                                                             ibiti
                 ce




                                                    v
                          sea




                                                s er




                                                                       eIS
                                                            exh
              rvi
             rat




                                               pre
                                     rk
           cu




                                  ma
Training & development:
               seminars, conferences, events,
               ‘Digital Conversations’
     Extend    + ‘Tooling up’



 Collaborate   Digital Scholarship:
               horizon scanning, Tech Watch
               communities of practice, consortia

Consolidate
               Digital Curation as
               collaborative process:
               acquisitions, workflows, tools,
               project management, funding,
               exhibitions & marketing
Europeana – SB Berlin                    User generated content
 Centenary of the outbreak of the First      Roadshows in 10 countries to create
  World War                                    unique pan-European archive
 Will create a European corpus of            Preston event produced more than
  digitised materials concerning the First     2300 images from letters, diaries,
  World War in all its aspects                 medals, pictures, trench art, and more
 Will contribute to Europeana a
  substantial collection of more than
  400,000 outstanding sources
British Newspaper Archive                      Google Books
   British Library and brightsolid online        A 6 year project starting June 2011
    publishing                                    250,000 Books, 1700-1870
   Up to 40 million newspaper pages from         From the French Revolution to the
    the British Library's collection over 10
                                                   end of slavery.
    years
   Collection includes runs of most
                                                  Material in major European languages
    newspapers published in the UK since          Focus on books that are not yet
    1800                                           freely available in digital form online
   Over 4m pages added since launch              Access via Google Books and BL
                                                  Storage at Google and BL
                                                  Contract and terms available on the
                                                   web!
Broadcast News                                IMPACT Historic Text
  TV & radio news receivable in the UK,          Improve the digital accessibility of
   since May 2010, e.g. Al-Jazeera English,        printed text produced before 1900
   CNN, France 24, Russia Today
                                                   OCR does not produce
  Search subtitles (where available)
                                                     satisfactory results for old books,
  AHRC-funded project looking at                    magazines and newspapers
   speech-to-text technologies for
   opening up audio and video archives             Historic material have archaic
  Project will index 3,00 hours of TV               fonts, complex layouts, warped or
   news and 3,000 hours of radio content             degraded pages
                                                   Manual post-correction is slow and
                                                     expensive
   Early music on-line: digitised 300 volumes (21k
        images) of rare early printed music from the
        British Library’s collections
       Open educational licence encourages use and re-
        purposing of content and embedding in teaching
        and research
       Detailed inventories of the books’ contents
        created for the first time, with access points for
        composer and title
       Data included in British Library catalogue, COPAC
        and RISM music database, with links to digitised
        content
       Digital images provided to Aruspix, which is
        developing an OCR and transcription tool for
        early music

www.earlymusiconline.org
Personal digital archives                  Web archives
   Data analysis beyond documents            Create a research collection of UK
   Use computer forensics                     websites
   Capture, management, description,         Develop high-impact data analytical
    and preservation of personal digital       access services
    collections to facilitate access and      Demonstrate the potential of domain
    analysis                                   level web archives, or the “haystacks”
   Archives range from poets (W Cope)        UK web domain > 9m .uk domain
    and playwrights (H Pinter) to              names
    computer scientists (D Michie) and        Estimate 110TB/crawl
    biologists
    Goal                          • Approach
     Builds on previous crowd-        • Accessible and convenient application
      sourcing projects, e.g. UK       • Immediate results and feedback
      SoundMap
                                       • Competitive tools
     Addressed key challenges –
      awareness, engagement,           • Recognition and visible contribution
      productivity at scale
What is georeferencing?




Ordnance Surveyors Drawing 40 (detail). Pen and Ink on paper. 1801.
British Library, Maps OSD 40(3).
sults:
725 maps assigned spatial metadata over 5 days
Publicity minimal – social media key
~90 participants, top five completed half the work
Data quality good: <3% had errors
T-Pen Transcription UI http://www.youtube.com/watch?v=sOnJtWtCFZc
   Evolution by projects and
    commercial ties tends to
    reduce interoperability and
    inconveniences the
    researcher


   International collaborations,
    such as International Image
    Interoperability Framework,
    seek a shared canvas
ARROW project - a tool to assist ‘diligent
  search’ and provide faster answers to:
  Rights status? – Rightsholders? – Can I digitise?

2008              2009              2010              2011               2012                2013


                       ARROW

        29 Partners                                                  ARROW Plus
        Libraries, BIP, Reprographic Rights
         Organisation (UK)                              36 Partners
        12 countries                                   14 countries
           (Austria, Denmark, France, Finland,               (Austria, Belgium, Bulgaria, Germany,
           Germany, Italy, the Netherlands, Norway,          Greece, Hungary, Ireland, Italy, Latvia,
           Slovenia, Spain,                                  Lithuania, the Netherlands, Poland,
           Sweden, UK)                                       Portugal, Spain)
        Pilots: Germany, France; Spain; UK                 Books and images in books
        Books only



                                                                                                   21
ARROW benefits
   Automated (where it can be – still some manual processes)
   Therefore saves time and cost




                     ARROW search = 5 % of Manual search time


       National partners working together across different sectors
       Domain partners working together across countries
                                                                      22
Persistent enquiry: can I use this?
   Open Knowledge Foundation


   Creative Commons Licenses



   Persistent URLs
   Six decades into the computer revolution,
    four decades since the invention of the microprocessor,
    and
    two decades into the rise of the modern internet, all of
    the technology required to transform industries through
    software finally works and can be delivered at global
    scale.

      Marc Andreessen ‘Why software is eating the world’
      Wall Street Journal August 20 2011
   Our vision: In 2020, the British Library will be a leading
    hub in the global information network, advancing
    knowledge through our collections, expertise and
    partnerships, for the benefit of the economy and society
    and the enrichment of cultural life.

   If Andreessen is right, we may not be talking in 2020
    about digital libraries and digital curators but an agency
    for the curation and creation of software.
@chrisleeclark




                 www.bl.uk

Contenu connexe

Similaire à Digital Berkshire, April 2012: Chris Clark, British Library PT#2

Rolf Källman Culture Cloud Fränsta 18-20 feb 2014
Rolf Källman Culture Cloud Fränsta 18-20 feb 2014Rolf Källman Culture Cloud Fränsta 18-20 feb 2014
Rolf Källman Culture Cloud Fränsta 18-20 feb 2014Digisam
 
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...Mapping Online Publics: Understanding the Role of Twitter in Public Communica...
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...Axel Bruns
 
Towards a Comprehensive Picture of the Australian Twittersphere
Towards a Comprehensive Picture of the Australian TwittersphereTowards a Comprehensive Picture of the Australian Twittersphere
Towards a Comprehensive Picture of the Australian TwittersphereAxel Bruns
 
2012 oct 22 shaping access presentation_alt
2012 oct 22  shaping access presentation_alt2012 oct 22  shaping access presentation_alt
2012 oct 22 shaping access presentation_altEuropeana
 
Advances in Digital Scholarship
Advances in Digital ScholarshipAdvances in Digital Scholarship
Advances in Digital ScholarshipDavid De Roure
 
European smart cities and smart city projects in user empowered innovation ec...
European smart cities and smart city projects in user empowered innovation ec...European smart cities and smart city projects in user empowered innovation ec...
European smart cities and smart city projects in user empowered innovation ec...European Network of Living Labs (ENoLL)
 
Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013Randle Stonier
 
Transnational Landscape De Groote Heide: Smart Forest
Transnational Landscape De Groote Heide: Smart ForestTransnational Landscape De Groote Heide: Smart Forest
Transnational Landscape De Groote Heide: Smart ForestVenturespring
 
Looking forward to 2023: What is the state of telecentres
Looking forward to 2023: What is the state of telecentresLooking forward to 2023: What is the state of telecentres
Looking forward to 2023: What is the state of telecentresIsmael Peña-López
 
London Ict E Bulletin Sep 2009
London Ict E Bulletin Sep 2009London Ict E Bulletin Sep 2009
London Ict E Bulletin Sep 2009Voscur Staff
 
Social Media as Part of the Wider Mediasphere
Social Media as Part of the Wider MediasphereSocial Media as Part of the Wider Mediasphere
Social Media as Part of the Wider MediasphereAxel Bruns
 
Simon Tanner Keynote for Kultura 2.0, Poland
Simon Tanner Keynote for Kultura 2.0, PolandSimon Tanner Keynote for Kultura 2.0, Poland
Simon Tanner Keynote for Kultura 2.0, PolandSimon Tanner
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataValentine Charles
 
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...Europeana
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVroelandordelman.nl
 
Long-Term Engagement.
Long-Term Engagement. Long-Term Engagement.
Long-Term Engagement. Alan Dix
 

Similaire à Digital Berkshire, April 2012: Chris Clark, British Library PT#2 (20)

Rolf Källman Culture Cloud Fränsta 18-20 feb 2014
Rolf Källman Culture Cloud Fränsta 18-20 feb 2014Rolf Källman Culture Cloud Fränsta 18-20 feb 2014
Rolf Källman Culture Cloud Fränsta 18-20 feb 2014
 
Smart City Conference Seville A. Garcia (ENoLL)
Smart City Conference Seville A. Garcia (ENoLL)Smart City Conference Seville A. Garcia (ENoLL)
Smart City Conference Seville A. Garcia (ENoLL)
 
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...Mapping Online Publics: Understanding the Role of Twitter in Public Communica...
Mapping Online Publics: Understanding the Role of Twitter in Public Communica...
 
Being Social
Being SocialBeing Social
Being Social
 
Towards a Comprehensive Picture of the Australian Twittersphere
Towards a Comprehensive Picture of the Australian TwittersphereTowards a Comprehensive Picture of the Australian Twittersphere
Towards a Comprehensive Picture of the Australian Twittersphere
 
2012 oct 22 shaping access presentation_alt
2012 oct 22  shaping access presentation_alt2012 oct 22  shaping access presentation_alt
2012 oct 22 shaping access presentation_alt
 
Advances in Digital Scholarship
Advances in Digital ScholarshipAdvances in Digital Scholarship
Advances in Digital Scholarship
 
European smart cities and smart city projects in user empowered innovation ec...
European smart cities and smart city projects in user empowered innovation ec...European smart cities and smart city projects in user empowered innovation ec...
European smart cities and smart city projects in user empowered innovation ec...
 
Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013Trends & Momentum in Events - 2013
Trends & Momentum in Events - 2013
 
Transnational Landscape De Groote Heide: Smart Forest
Transnational Landscape De Groote Heide: Smart ForestTransnational Landscape De Groote Heide: Smart Forest
Transnational Landscape De Groote Heide: Smart Forest
 
Looking forward to 2023: What is the state of telecentres
Looking forward to 2023: What is the state of telecentresLooking forward to 2023: What is the state of telecentres
Looking forward to 2023: What is the state of telecentres
 
London Ict E Bulletin Sep 2009
London Ict E Bulletin Sep 2009London Ict E Bulletin Sep 2009
London Ict E Bulletin Sep 2009
 
Multimedia application in libraries gaurav boudh
Multimedia application in libraries gaurav boudhMultimedia application in libraries gaurav boudh
Multimedia application in libraries gaurav boudh
 
Social Media as Part of the Wider Mediasphere
Social Media as Part of the Wider MediasphereSocial Media as Part of the Wider Mediasphere
Social Media as Part of the Wider Mediasphere
 
Simon Tanner Keynote for Kultura 2.0, Poland
Simon Tanner Keynote for Kultura 2.0, PolandSimon Tanner Keynote for Kultura 2.0, Poland
Simon Tanner Keynote for Kultura 2.0, Poland
 
Building a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage DataBuilding a Framework for Semantic Cultural Heritage Data
Building a Framework for Semantic Cultural Heritage Data
 
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...
Strategy 2015-2020 transcript - Europeana Network AGM - Harry Verwayen - 2 De...
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISV
 
Long-Term Engagement.
Long-Term Engagement. Long-Term Engagement.
Long-Term Engagement.
 
UCL's Media Adventure
UCL's Media AdventureUCL's Media Adventure
UCL's Media Adventure
 

Plus de Berkshire Digital

Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event dates
Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event datesBerkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event dates
Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event datesBerkshire Digital
 
Berkshire Digital - Teen tech: inspiring the next generation
Berkshire Digital - Teen tech: inspiring the next generationBerkshire Digital - Teen tech: inspiring the next generation
Berkshire Digital - Teen tech: inspiring the next generationBerkshire Digital
 
Berkshire Digital - The Future of Ecommerce: 2013 and beyond
Berkshire Digital - The Future of Ecommerce: 2013 and beyondBerkshire Digital - The Future of Ecommerce: 2013 and beyond
Berkshire Digital - The Future of Ecommerce: 2013 and beyondBerkshire Digital
 
Digital Berkshire, October 2012: Emerging Mobile Trends
Digital Berkshire, October 2012: Emerging Mobile TrendsDigital Berkshire, October 2012: Emerging Mobile Trends
Digital Berkshire, October 2012: Emerging Mobile TrendsBerkshire Digital
 
Digital Berkshire, July 2012: Dr James Morris
Digital Berkshire, July 2012: Dr James MorrisDigital Berkshire, July 2012: Dr James Morris
Digital Berkshire, July 2012: Dr James MorrisBerkshire Digital
 
Digital Berkshire, April 2012: Chris Clark, British Library PT#1
Digital Berkshire, April 2012: Chris Clark, British Library PT#1Digital Berkshire, April 2012: Chris Clark, British Library PT#1
Digital Berkshire, April 2012: Chris Clark, British Library PT#1Berkshire Digital
 
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...Berkshire Digital
 

Plus de Berkshire Digital (7)

Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event dates
Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event datesBerkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event dates
Berkshire Digital - Teen Tech: Inspiring the Next Genearation - 2013 event dates
 
Berkshire Digital - Teen tech: inspiring the next generation
Berkshire Digital - Teen tech: inspiring the next generationBerkshire Digital - Teen tech: inspiring the next generation
Berkshire Digital - Teen tech: inspiring the next generation
 
Berkshire Digital - The Future of Ecommerce: 2013 and beyond
Berkshire Digital - The Future of Ecommerce: 2013 and beyondBerkshire Digital - The Future of Ecommerce: 2013 and beyond
Berkshire Digital - The Future of Ecommerce: 2013 and beyond
 
Digital Berkshire, October 2012: Emerging Mobile Trends
Digital Berkshire, October 2012: Emerging Mobile TrendsDigital Berkshire, October 2012: Emerging Mobile Trends
Digital Berkshire, October 2012: Emerging Mobile Trends
 
Digital Berkshire, July 2012: Dr James Morris
Digital Berkshire, July 2012: Dr James MorrisDigital Berkshire, July 2012: Dr James Morris
Digital Berkshire, July 2012: Dr James Morris
 
Digital Berkshire, April 2012: Chris Clark, British Library PT#1
Digital Berkshire, April 2012: Chris Clark, British Library PT#1Digital Berkshire, April 2012: Chris Clark, British Library PT#1
Digital Berkshire, April 2012: Chris Clark, British Library PT#1
 
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...
January 2012 Digital Berkshire - Walking the Walk - Digital Marketing at Blac...
 

Dernier

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Dernier (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Digital Berkshire, April 2012: Chris Clark, British Library PT#2

  • 1.
  • 2. ● Scale & materiality – Not individual, standard documents but vast collections of them; authenticity demands multiplicity of versions ● Cost – Preservation not by individuals but large organizations ● Intellectual Property – If content worth saving someone is making money from it
  • 3. http://www.economist.com/node/15557443 http://youtube-global.blogspot .com/2011/05/thanks-youtube-community-for-two-big.ht
  • 4. “The challenge for libraries is to find ways to preserve platform dependent digital works and to prevent the loss of complex digital media…. Since we cannot possibly save everything, we need to carefully consider which digital materials are the most important to preserve and try to anticipate the needs of future scholars and researchers” Marlene Manoff, 2006  If preservation priority is X and user need is Y, what are the values of X and Y?
  • 5. If sustainability means that information is kept useful and available, then the LOCKSS approach has real merit! It implies that SERVICES must be preserved as well.
  • 6. Abundance of stored content: attention is scarce & must be earned Services Content
  • 7. platforms to focus in 2012+  Maintain active presence on  Continue to assess
  • 8. Digital Research & Curator Team digital commercia l scholars pa academic rt n social digital consortia ers ers networks Digital funding us hip Research & bodies machines s Curator Team media systems & services M ser ation ns ret h & al es g l /ST o se oria riev s vic rc etin ibiti ce v sea s er eIS exh rvi rat pre rk cu ma
  • 9. Training & development: seminars, conferences, events, ‘Digital Conversations’ Extend + ‘Tooling up’ Collaborate Digital Scholarship: horizon scanning, Tech Watch communities of practice, consortia Consolidate Digital Curation as collaborative process: acquisitions, workflows, tools, project management, funding, exhibitions & marketing
  • 10. Europeana – SB Berlin User generated content  Centenary of the outbreak of the First  Roadshows in 10 countries to create World War unique pan-European archive  Will create a European corpus of  Preston event produced more than digitised materials concerning the First 2300 images from letters, diaries, World War in all its aspects medals, pictures, trench art, and more  Will contribute to Europeana a substantial collection of more than 400,000 outstanding sources
  • 11. British Newspaper Archive Google Books  British Library and brightsolid online  A 6 year project starting June 2011 publishing  250,000 Books, 1700-1870  Up to 40 million newspaper pages from  From the French Revolution to the the British Library's collection over 10 end of slavery. years  Collection includes runs of most  Material in major European languages newspapers published in the UK since  Focus on books that are not yet 1800 freely available in digital form online  Over 4m pages added since launch  Access via Google Books and BL  Storage at Google and BL  Contract and terms available on the web!
  • 12. Broadcast News IMPACT Historic Text  TV & radio news receivable in the UK,  Improve the digital accessibility of since May 2010, e.g. Al-Jazeera English, printed text produced before 1900 CNN, France 24, Russia Today  OCR does not produce  Search subtitles (where available) satisfactory results for old books,  AHRC-funded project looking at magazines and newspapers speech-to-text technologies for opening up audio and video archives  Historic material have archaic  Project will index 3,00 hours of TV fonts, complex layouts, warped or news and 3,000 hours of radio content degraded pages  Manual post-correction is slow and expensive
  • 13. Early music on-line: digitised 300 volumes (21k images) of rare early printed music from the British Library’s collections  Open educational licence encourages use and re- purposing of content and embedding in teaching and research  Detailed inventories of the books’ contents created for the first time, with access points for composer and title  Data included in British Library catalogue, COPAC and RISM music database, with links to digitised content  Digital images provided to Aruspix, which is developing an OCR and transcription tool for early music www.earlymusiconline.org
  • 14. Personal digital archives Web archives  Data analysis beyond documents  Create a research collection of UK  Use computer forensics websites  Capture, management, description,  Develop high-impact data analytical and preservation of personal digital access services collections to facilitate access and  Demonstrate the potential of domain analysis level web archives, or the “haystacks”  Archives range from poets (W Cope)  UK web domain > 9m .uk domain and playwrights (H Pinter) to names computer scientists (D Michie) and  Estimate 110TB/crawl biologists
  • 15. Goal • Approach  Builds on previous crowd- • Accessible and convenient application sourcing projects, e.g. UK • Immediate results and feedback SoundMap • Competitive tools  Addressed key challenges – awareness, engagement, • Recognition and visible contribution productivity at scale
  • 16. What is georeferencing? Ordnance Surveyors Drawing 40 (detail). Pen and Ink on paper. 1801. British Library, Maps OSD 40(3).
  • 17. sults: 725 maps assigned spatial metadata over 5 days Publicity minimal – social media key ~90 participants, top five completed half the work Data quality good: <3% had errors
  • 18.
  • 19. T-Pen Transcription UI http://www.youtube.com/watch?v=sOnJtWtCFZc
  • 20. Evolution by projects and commercial ties tends to reduce interoperability and inconveniences the researcher  International collaborations, such as International Image Interoperability Framework, seek a shared canvas
  • 21. ARROW project - a tool to assist ‘diligent search’ and provide faster answers to: Rights status? – Rightsholders? – Can I digitise? 2008 2009 2010 2011 2012 2013 ARROW  29 Partners ARROW Plus  Libraries, BIP, Reprographic Rights Organisation (UK)  36 Partners  12 countries  14 countries (Austria, Denmark, France, Finland, (Austria, Belgium, Bulgaria, Germany, Germany, Italy, the Netherlands, Norway, Greece, Hungary, Ireland, Italy, Latvia, Slovenia, Spain, Lithuania, the Netherlands, Poland, Sweden, UK) Portugal, Spain)  Pilots: Germany, France; Spain; UK  Books and images in books  Books only 21
  • 22. ARROW benefits  Automated (where it can be – still some manual processes)  Therefore saves time and cost ARROW search = 5 % of Manual search time  National partners working together across different sectors  Domain partners working together across countries 22
  • 23. Persistent enquiry: can I use this?  Open Knowledge Foundation  Creative Commons Licenses  Persistent URLs
  • 24. Six decades into the computer revolution, four decades since the invention of the microprocessor, and two decades into the rise of the modern internet, all of the technology required to transform industries through software finally works and can be delivered at global scale. Marc Andreessen ‘Why software is eating the world’ Wall Street Journal August 20 2011
  • 25. Our vision: In 2020, the British Library will be a leading hub in the global information network, advancing knowledge through our collections, expertise and partnerships, for the benefit of the economy and society and the enrichment of cultural life.  If Andreessen is right, we may not be talking in 2020 about digital libraries and digital curators but an agency for the curation and creation of software.
  • 26. @chrisleeclark www.bl.uk

Notes de l'éditeur

  1. More than 200 people poured into the Museum of Lancashire in Preston at the weekend to have their loved-one’s precious items digitised for the virtual archive www.europeana1914-1918.eu/ Queuing began an hour before doors opened on Saturday (10.03.12) – and the crowds continued to stream in until they closed nine hours later. Many people had travelled from as far afield as Leeds, Manchester, Birkenhead, Liverpool and Warrington just to be there. More than 2,300 images were taken of a wide variety of items, including: letters, diaries, medals, birth and death certificates, nurses’ autograph books, cartoons, pictures and trench art – everyday objects made from anything the soldiers found, such as shell casings and spent ammunition. The Preston roadshow is the latest in a series that is being rolled-out across 10 countries in Europe this year to create a unique pan-European account of WW1 that is available to everyone. Europeana 1914-1918 brings together a partnership of libraries, museums, academic and cultural institutions, which in the UK includes the British Library, Oxford University, JISC and Lancashire County Council. Page
  2. Commercial partners: + it gets the job done - It can lock up information in silos while investment is recouped. The first works to be digitised by Google will range from feminist pamphlets about Queen Marie-Antoinette (1791), to the invention of the first combustion engine-driven submarine (1858), and an account of a stuffed Hippopotamus owned by the Prince of Orange (1775). Page
  3. Luke McKernan (Lead Curator, Moving Image) and AHRC Page
  4. Page
  5. Showing relationships between  Target, Subject, Event, Special Collection. The Targets are colour coded to high level subject area and the size of the node represents the number of target instances. The UK web domain 9 million .uk domain names registered in December 2010 ~ 1 million using other domain names Growing at 11% - 14% per year 40% estimated to be in scope for Legal Deposit Estimated ~110TB each UK domain crawl Traditional “document-centric” approach does not scale up - canonical mission of heritage institutions being challenged Many technical challenges – the constant need to respond to the evolving web Harvests are at best snapshots or samples cannot get everything: resource and legal constraints; Crawler works well with HTML but struggles to capture advanced web content, e.g. rich media, dynamic and interactive content Rendering software does not always “replay” the archived content Cannot reply streaming media Risks of “republishing” – libel, copyright Legal Deposit offers some protection but access restricted to premises of LD institutions Page
  6. This work was only continued in England when the threat of attack by France loomed, with the start of the Napoleonic Wars in 1795. Started on a scale of six-inches-to-the-mile for the south coast, which was then reduced at a later point. Thenceforth all mappong to be done by Board of Ordnance. Done by 1815. Upon this was based the original 1-inch maps of England. I’ve taken a detail of Exeter
  7. These geo-tools allow historic maps to be overlaid and combined with modern mapping, enhancing the ability to compare and analyse the representation, and enabling searching by placename.