SlideShare une entreprise Scribd logo
1  sur  23
Goobi in the Wellcome Library

      Digitisation Roadshow, Linz, Feb 2013

                 Dave Thompson
        Digital Curator, Wellcome Library
Goobi in the Wellcome Library
  •  In production March 2012.
  •  6 Servers running Goobi – test & production.
  •  11 staff users, some part time.
  •  1.2 million images processed & available via Library website.
  •  Can upload maximum of <1000 objects into SDB per 24 hrs.
  •  Total space allocated to Goobi is 40tb.
Digitising books can be boring…
…there isn’t much to see...
…but we have done more than just text.
A strategic approach

     •  Library transformation strategy, physical to digital.
     •  From ‘project’ to ‘production’.
     •  Digitisation as a sustainable end-to-end process.
     •  18 month pilot/implementation project.
     •  Just taken into production.
Diverse sources of content

     •  In-house digitisation.
     •  External contractors.
     •  Contractors working in-house.
     •  External organisations digitising their content for
        us.
Where did Goobi come from?

     •  Late 2010 early 2011 as plans for developing SDB
        grew realised that we needed a means of mass
        import of digital content.
     •  Began to think about high volume production &
        the management of that.
     •  Early modelling of our systems suggested that we
        needed a tool to manage production of content.
     •  Began looking at workflow tracking systems.
Needed to use existing Library tools
Perceived benefits of Goobi

     •  Web based distributed access to concurrent
        users.
     •  Flexible workflow based processing, managed
        through ‘Projects’.
     •  Workflow process enforced, ensures accuracy &
        efficiency.
     •  Adaptable to different types of content.
     •  Initiates & manages esternal processes via
        Intranda task manager (ITM).
     •  METS as basis of access & access control.
Rapid evolution of Goobi

     •  Goobi we have now quite different to what we
        bought.
     •  Initial configuration to import MARC XML DMD &
        to automate ingest into SDB.
     •  Initially Goobi didn’t scale to met our ambition.
     •  Initial install monolithic, now running Goobi as
        distributed services.
     •  Developed new features with Intranda, e.g.
        Jpylyzation.
Working with DMD

    •  Upload MARC XML DMD exported from Sierra
       using standard Goobi features.
    •  MARC fields edited to provide a consistent Goobi
       process title, e.g. using shelf mark.
    •  MARC Leader 6 field identifies content type, e.g
       ‘Archive’ or ‘Monograph’.
    •  Content ‘type’ used by Goobi to set default METS
       access conditions.
    •  DMD not delivered to end user, that comes from
       live catalogue.
Uploading content

     •  Content upload using the Sync2Goobi Tool for
        bulk import.
     •  Drag ‘n drop interface.
     •  Can be either TIFF or JP2.
     •  Project based workflow templates manage either
        format.
     •  Use Goobi Mount Tool (GMT) to access/manage
        content already uploaded.
Using METS Editor

     •  Main point of human interaction with Goobi. Goobi
        automates METS creation.
     •  METS basis for access control & usage conditions
        for material.
     •  Basis for retrieval of content from SDB by using
        SDB PUIDs.
     •  Goobi automates ingest of content into SDB &
        receives AMD in return.
How we use METS

    •  Setting material type & default values for access
       based on DMD.
    •  Access restrictions can be at the item level.
    •  DMD in METS not delivered to end user, serves
       only to help a human identify content when
       snagging.
Shared development

     •  Wellcome Trust is not a development house. Rely
        on Intranda to provide development support.
     •  Developed specifc requirememnts for extensions
        to Goobi, e.g. Jpylyser for JPEG2000 validation.
     •  Development proposals from both sides. We have
        idea, Intranda helps us make that idea a reality.
     •  Benefit from community developments
        commissioned by others.
Additional Tools

     •  Lurawave for converting TIFF to JPEG2000.
     •  Jpylyzer for validating JPEG2000 files.
     •  Sync2Goobi Tool for bulk upload of content.
     •  Goobi Mount Tool/MS Windows File Explorer for
        access to ‘Home’ folders.
Goobi – the future

     •  Built in OCR & creation of ALTO files.
     •  Further refinement of Sync2Goobi Tool.
     •  Further development/integration of validation
        tools.
     •  Integration of ftp with Goobi for 3rd party direct
        upload of content.
     •  Establishment of separate database server for
        Goobi.
Lessons learned - systems

     •  We were ambitious but underestimated what
        capacity we would require.
     •  Underestimated storage requirements.
     •  Underestimated the desirability of high levels of
        automation.
     •  Focus human interaction at as few points as
        possible.
Lessons learned - Intranda

     •  Have relied heavily on input & support from
        Intranda.
     •  Share information with Intranda & trust them to
        provide answers.
     •  Be prepared to share development. But be
        prepared to accept some pain.
Lessons learned - Goobi

     •  In less than a year Goobi has become key to
        delivering the Library’s content.
     •  Centralised user activities in one system – Goobi
        – less to learn, more efficient.
     •  Streamline & automate. High volume efficient
        production essential.
     •  Streamline other digitisation & access processes
        to match Goobi.
     •  METS an efficient single place for access related
        metadata.
Thank you

Questions now, questions later…?

   Dave Thompson, Digital Curator
         Wellcome Library

       d.thompson@wellcome.ac.uk

         http://wellcomelibrary.org/

Contenu connexe

Similaire à Goobi in the Wellcome Library

Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaosWellcome Library
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wellcome Library
 
Building a Documentation Portal
Building a Documentation PortalBuilding a Documentation Portal
Building a Documentation Portalstc-siliconvalley
 
Connecting Intelligent Content with Micropublishing and Beyond
Connecting Intelligent Content with Micropublishing and BeyondConnecting Intelligent Content with Micropublishing and Beyond
Connecting Intelligent Content with Micropublishing and BeyondDon Day
 
Automate Hadoop Cluster Deployment in a Banking Ecosystem
Automate Hadoop Cluster Deployment in a Banking EcosystemAutomate Hadoop Cluster Deployment in a Banking Ecosystem
Automate Hadoop Cluster Deployment in a Banking EcosystemHellmar Becker
 
You Don't Need IT To Do That - The World of Outsourcing and SaaS
You Don't Need IT To Do That - The World of Outsourcing and SaaSYou Don't Need IT To Do That - The World of Outsourcing and SaaS
You Don't Need IT To Do That - The World of Outsourcing and SaaSKyle James
 
Untangling spring week2
Untangling spring week2Untangling spring week2
Untangling spring week2Derek Jacoby
 
Tips for a successful SharePoint Migration strategy
Tips for a successful SharePoint Migration strategyTips for a successful SharePoint Migration strategy
Tips for a successful SharePoint Migration strategyDon Daubert
 
2015 WritersUA Sourcing Graphics
2015 WritersUA Sourcing Graphics2015 WritersUA Sourcing Graphics
2015 WritersUA Sourcing GraphicsMary Connor
 
Mongo DB for Java, Python and PHP Developers
Mongo DB for Java, Python and PHP DevelopersMongo DB for Java, Python and PHP Developers
Mongo DB for Java, Python and PHP DevelopersRick Hightower
 
Targeted documentation STC Houston, Mar 20, 2012
Targeted documentation   STC Houston, Mar 20, 2012Targeted documentation   STC Houston, Mar 20, 2012
Targeted documentation STC Houston, Mar 20, 2012STC_Houston
 
A Tale from the Upstream Path
A Tale from the Upstream PathA Tale from the Upstream Path
A Tale from the Upstream PathTesora
 
How Not to Be Conned by Your Drupal Vendor!
How Not to Be Conned by Your Drupal Vendor!How Not to Be Conned by Your Drupal Vendor!
How Not to Be Conned by Your Drupal Vendor!pixelonion
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Jody Garnett
 
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishContent Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishJani Tarvainen
 

Similaire à Goobi in the Wellcome Library (20)

Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
 
Dave's Wellcome Library digitisation presentation
Dave's Wellcome Library digitisation presentationDave's Wellcome Library digitisation presentation
Dave's Wellcome Library digitisation presentation
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
 
Building a Documentation Portal
Building a Documentation PortalBuilding a Documentation Portal
Building a Documentation Portal
 
Connecting Intelligent Content with Micropublishing and Beyond
Connecting Intelligent Content with Micropublishing and BeyondConnecting Intelligent Content with Micropublishing and Beyond
Connecting Intelligent Content with Micropublishing and Beyond
 
Google Summer of Code 2011: UOC & Apertium
Google Summer of Code 2011: UOC & ApertiumGoogle Summer of Code 2011: UOC & Apertium
Google Summer of Code 2011: UOC & Apertium
 
Automate Hadoop Cluster Deployment in a Banking Ecosystem
Automate Hadoop Cluster Deployment in a Banking EcosystemAutomate Hadoop Cluster Deployment in a Banking Ecosystem
Automate Hadoop Cluster Deployment in a Banking Ecosystem
 
You Don't Need IT To Do That - The World of Outsourcing and SaaS
You Don't Need IT To Do That - The World of Outsourcing and SaaSYou Don't Need IT To Do That - The World of Outsourcing and SaaS
You Don't Need IT To Do That - The World of Outsourcing and SaaS
 
08 jorsek llc
08 jorsek llc08 jorsek llc
08 jorsek llc
 
Untangling spring week2
Untangling spring week2Untangling spring week2
Untangling spring week2
 
Bill McCoy氏:電子出版の将来展望
Bill McCoy氏:電子出版の将来展望Bill McCoy氏:電子出版の将来展望
Bill McCoy氏:電子出版の将来展望
 
Tips for a successful SharePoint Migration strategy
Tips for a successful SharePoint Migration strategyTips for a successful SharePoint Migration strategy
Tips for a successful SharePoint Migration strategy
 
2015 WritersUA Sourcing Graphics
2015 WritersUA Sourcing Graphics2015 WritersUA Sourcing Graphics
2015 WritersUA Sourcing Graphics
 
Mongo DB for Java, Python and PHP Developers
Mongo DB for Java, Python and PHP DevelopersMongo DB for Java, Python and PHP Developers
Mongo DB for Java, Python and PHP Developers
 
OS Accelerate London - 09/16/15
OS Accelerate London - 09/16/15OS Accelerate London - 09/16/15
OS Accelerate London - 09/16/15
 
Targeted documentation STC Houston, Mar 20, 2012
Targeted documentation   STC Houston, Mar 20, 2012Targeted documentation   STC Houston, Mar 20, 2012
Targeted documentation STC Houston, Mar 20, 2012
 
A Tale from the Upstream Path
A Tale from the Upstream PathA Tale from the Upstream Path
A Tale from the Upstream Path
 
How Not to Be Conned by Your Drupal Vendor!
How Not to Be Conned by Your Drupal Vendor!How Not to Be Conned by Your Drupal Vendor!
How Not to Be Conned by Your Drupal Vendor!
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...
 
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ PublishContent Management Systems and Refactoring - Drupal, WordPress and eZ Publish
Content Management Systems and Refactoring - Drupal, WordPress and eZ Publish
 

Plus de goobi_org

Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013
Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013
Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013goobi_org
 
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...goobi_org
 
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungen
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen SammlungenGottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungen
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungengoobi_org
 
Staatsbibliothek zu Berlin: Neue Entwicklungen und Projekte
Staatsbibliothek zu Berlin: Neue Entwicklungen und ProjekteStaatsbibliothek zu Berlin: Neue Entwicklungen und Projekte
Staatsbibliothek zu Berlin: Neue Entwicklungen und Projektegoobi_org
 
Goobi-Anwendung an der UB Bielefeld
Goobi-Anwendung an der UB BielefeldGoobi-Anwendung an der UB Bielefeld
Goobi-Anwendung an der UB Bielefeldgoobi_org
 
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothek
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und LandesbibliothekFulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothek
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothekgoobi_org
 
Aufbau des Digitalisierungsreferats der UB TU Berlin
Aufbau des Digitalisierungsreferats der UB TU BerlinAufbau des Digitalisierungsreferats der UB TU Berlin
Aufbau des Digitalisierungsreferats der UB TU Berlingoobi_org
 
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburg
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi HamburgAktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburg
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburggoobi_org
 
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgaben
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige AufgabenGoobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgaben
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgabengoobi_org
 
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...goobi_org
 
Goobi an der Univesitätsbibliothek Greifswald
Goobi an der Univesitätsbibliothek GreifswaldGoobi an der Univesitätsbibliothek Greifswald
Goobi an der Univesitätsbibliothek Greifswaldgoobi_org
 
Goobi in der Verbundzentrale des GBV
Goobi in der Verbundzentrale des GBVGoobi in der Verbundzentrale des GBV
Goobi in der Verbundzentrale des GBVgoobi_org
 
Goobi-Einsatz in der Zentral- und Landesbibliothek Berlin
Goobi-Einsatz in der Zentral- und Landesbibliothek BerlinGoobi-Einsatz in der Zentral- und Landesbibliothek Berlin
Goobi-Einsatz in der Zentral- und Landesbibliothek Berlingoobi_org
 
Hamburgensien digital – Goobi an der Stabi Hamburg
Hamburgensien digital – Goobi an der Stabi HamburgHamburgensien digital – Goobi an der Stabi Hamburg
Hamburgensien digital – Goobi an der Stabi Hamburggoobi_org
 
Goobi für alle(s).
Goobi für alle(s).Goobi für alle(s).
Goobi für alle(s).goobi_org
 
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013goobi_org
 
Digitalisierung in der Stabi Hamburg - zwischen Projekten und Routine
Digitalisierung in der Stabi Hamburg - zwischen Projekten und RoutineDigitalisierung in der Stabi Hamburg - zwischen Projekten und Routine
Digitalisierung in der Stabi Hamburg - zwischen Projekten und Routinegoobi_org
 
Goobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaft
Goobi e.V.: Strukturen und Ergebnisse der AnwendergemeinschaftGoobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaft
Goobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaftgoobi_org
 
Mit Goobi in die Deutsche Digitale Bibliothek
Mit Goobi in die Deutsche Digitale BibliothekMit Goobi in die Deutsche Digitale Bibliothek
Mit Goobi in die Deutsche Digitale Bibliothekgoobi_org
 

Plus de goobi_org (19)

Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013
Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013
Leistungsvergleich Präsentationsoberflächen für digitale Sammlungen 2013
 
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...
Dokumenten-Management der Herzogin Anna Amalia Bibliothek Weimar: Ziele und G...
 
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungen
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen SammlungenGottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungen
Gottfried Wilhelm Leibniz Bibliothek – Besonderheiten der Digitalen Sammlungen
 
Staatsbibliothek zu Berlin: Neue Entwicklungen und Projekte
Staatsbibliothek zu Berlin: Neue Entwicklungen und ProjekteStaatsbibliothek zu Berlin: Neue Entwicklungen und Projekte
Staatsbibliothek zu Berlin: Neue Entwicklungen und Projekte
 
Goobi-Anwendung an der UB Bielefeld
Goobi-Anwendung an der UB BielefeldGoobi-Anwendung an der UB Bielefeld
Goobi-Anwendung an der UB Bielefeld
 
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothek
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und LandesbibliothekFulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothek
FulDig - Fuldaer Digitalisierungsserver der Hochschul- und Landesbibliothek
 
Aufbau des Digitalisierungsreferats der UB TU Berlin
Aufbau des Digitalisierungsreferats der UB TU BerlinAufbau des Digitalisierungsreferats der UB TU Berlin
Aufbau des Digitalisierungsreferats der UB TU Berlin
 
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburg
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi HamburgAktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburg
Aktuelle "Baustellen" und Fragen - Goobi an der Stabi Hamburg
 
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgaben
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige AufgabenGoobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgaben
Goobi an der UB Kassel - ORKA, Fortschritte und zukünftige Aufgaben
 
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...
GEI digital - Aufbau einer fachlichen Digitalisierungsplattform für externe D...
 
Goobi an der Univesitätsbibliothek Greifswald
Goobi an der Univesitätsbibliothek GreifswaldGoobi an der Univesitätsbibliothek Greifswald
Goobi an der Univesitätsbibliothek Greifswald
 
Goobi in der Verbundzentrale des GBV
Goobi in der Verbundzentrale des GBVGoobi in der Verbundzentrale des GBV
Goobi in der Verbundzentrale des GBV
 
Goobi-Einsatz in der Zentral- und Landesbibliothek Berlin
Goobi-Einsatz in der Zentral- und Landesbibliothek BerlinGoobi-Einsatz in der Zentral- und Landesbibliothek Berlin
Goobi-Einsatz in der Zentral- und Landesbibliothek Berlin
 
Hamburgensien digital – Goobi an der Stabi Hamburg
Hamburgensien digital – Goobi an der Stabi HamburgHamburgensien digital – Goobi an der Stabi Hamburg
Hamburgensien digital – Goobi an der Stabi Hamburg
 
Goobi für alle(s).
Goobi für alle(s).Goobi für alle(s).
Goobi für alle(s).
 
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013
Goobi. Digitalisieren im Verein - Leipzig, 13.03.2013
 
Digitalisierung in der Stabi Hamburg - zwischen Projekten und Routine
Digitalisierung in der Stabi Hamburg - zwischen Projekten und RoutineDigitalisierung in der Stabi Hamburg - zwischen Projekten und Routine
Digitalisierung in der Stabi Hamburg - zwischen Projekten und Routine
 
Goobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaft
Goobi e.V.: Strukturen und Ergebnisse der AnwendergemeinschaftGoobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaft
Goobi e.V.: Strukturen und Ergebnisse der Anwendergemeinschaft
 
Mit Goobi in die Deutsche Digitale Bibliothek
Mit Goobi in die Deutsche Digitale BibliothekMit Goobi in die Deutsche Digitale Bibliothek
Mit Goobi in die Deutsche Digitale Bibliothek
 

Dernier

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 

Dernier (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 

Goobi in the Wellcome Library

  • 1. Goobi in the Wellcome Library Digitisation Roadshow, Linz, Feb 2013 Dave Thompson Digital Curator, Wellcome Library
  • 2. Goobi in the Wellcome Library •  In production March 2012. •  6 Servers running Goobi – test & production. •  11 staff users, some part time. •  1.2 million images processed & available via Library website. •  Can upload maximum of <1000 objects into SDB per 24 hrs. •  Total space allocated to Goobi is 40tb.
  • 3. Digitising books can be boring…
  • 5. …but we have done more than just text.
  • 6. A strategic approach •  Library transformation strategy, physical to digital. •  From ‘project’ to ‘production’. •  Digitisation as a sustainable end-to-end process. •  18 month pilot/implementation project. •  Just taken into production.
  • 7. Diverse sources of content •  In-house digitisation. •  External contractors. •  Contractors working in-house. •  External organisations digitising their content for us.
  • 8. Where did Goobi come from? •  Late 2010 early 2011 as plans for developing SDB grew realised that we needed a means of mass import of digital content. •  Began to think about high volume production & the management of that. •  Early modelling of our systems suggested that we needed a tool to manage production of content. •  Began looking at workflow tracking systems.
  • 9. Needed to use existing Library tools
  • 10. Perceived benefits of Goobi •  Web based distributed access to concurrent users. •  Flexible workflow based processing, managed through ‘Projects’. •  Workflow process enforced, ensures accuracy & efficiency. •  Adaptable to different types of content. •  Initiates & manages esternal processes via Intranda task manager (ITM). •  METS as basis of access & access control.
  • 11. Rapid evolution of Goobi •  Goobi we have now quite different to what we bought. •  Initial configuration to import MARC XML DMD & to automate ingest into SDB. •  Initially Goobi didn’t scale to met our ambition. •  Initial install monolithic, now running Goobi as distributed services. •  Developed new features with Intranda, e.g. Jpylyzation.
  • 12. Working with DMD •  Upload MARC XML DMD exported from Sierra using standard Goobi features. •  MARC fields edited to provide a consistent Goobi process title, e.g. using shelf mark. •  MARC Leader 6 field identifies content type, e.g ‘Archive’ or ‘Monograph’. •  Content ‘type’ used by Goobi to set default METS access conditions. •  DMD not delivered to end user, that comes from live catalogue.
  • 13. Uploading content •  Content upload using the Sync2Goobi Tool for bulk import. •  Drag ‘n drop interface. •  Can be either TIFF or JP2. •  Project based workflow templates manage either format. •  Use Goobi Mount Tool (GMT) to access/manage content already uploaded.
  • 14. Using METS Editor •  Main point of human interaction with Goobi. Goobi automates METS creation. •  METS basis for access control & usage conditions for material. •  Basis for retrieval of content from SDB by using SDB PUIDs. •  Goobi automates ingest of content into SDB & receives AMD in return.
  • 15. How we use METS •  Setting material type & default values for access based on DMD. •  Access restrictions can be at the item level. •  DMD in METS not delivered to end user, serves only to help a human identify content when snagging.
  • 16.
  • 17. Shared development •  Wellcome Trust is not a development house. Rely on Intranda to provide development support. •  Developed specifc requirememnts for extensions to Goobi, e.g. Jpylyser for JPEG2000 validation. •  Development proposals from both sides. We have idea, Intranda helps us make that idea a reality. •  Benefit from community developments commissioned by others.
  • 18. Additional Tools •  Lurawave for converting TIFF to JPEG2000. •  Jpylyzer for validating JPEG2000 files. •  Sync2Goobi Tool for bulk upload of content. •  Goobi Mount Tool/MS Windows File Explorer for access to ‘Home’ folders.
  • 19. Goobi – the future •  Built in OCR & creation of ALTO files. •  Further refinement of Sync2Goobi Tool. •  Further development/integration of validation tools. •  Integration of ftp with Goobi for 3rd party direct upload of content. •  Establishment of separate database server for Goobi.
  • 20. Lessons learned - systems •  We were ambitious but underestimated what capacity we would require. •  Underestimated storage requirements. •  Underestimated the desirability of high levels of automation. •  Focus human interaction at as few points as possible.
  • 21. Lessons learned - Intranda •  Have relied heavily on input & support from Intranda. •  Share information with Intranda & trust them to provide answers. •  Be prepared to share development. But be prepared to accept some pain.
  • 22. Lessons learned - Goobi •  In less than a year Goobi has become key to delivering the Library’s content. •  Centralised user activities in one system – Goobi – less to learn, more efficient. •  Streamline & automate. High volume efficient production essential. •  Streamline other digitisation & access processes to match Goobi. •  METS an efficient single place for access related metadata.
  • 23. Thank you Questions now, questions later…? Dave Thompson, Digital Curator Wellcome Library d.thompson@wellcome.ac.uk http://wellcomelibrary.org/