SlideShare une entreprise Scribd logo
1  sur  27
Digitisation Open Day
Doing Projects
30 January 2014

Matthew Brack
Digitisation Project Manager
Wellcome Digital Library Programme

10 ‘laws’ of digitisation

These are personal views based on experience of doing digitisation at Wellcome Library…
#1: Know your purpose
(“Thou shalt observe real users
and keep them holy”)

Obvious but important: who are you doing digitisation for? Knowing the answer
to that question will affect every subsequent decision you make in your project.
Digitisation Open Day

SELECTION

DIGITISATION

DELIVERY

You need to ensure that this common purpose is shared by all project stakeholders and
is a thread running through the whole process from the very beginning to the end.
#2: Know Project Management

If you’ve never run a digitisation project before and only do one thing to prepare:
study project management. Your project is more likely to succeed with an
understanding of project management than a technical understanding of digitisation.
Digitisation Open Day

The nature of digitisation
Digitisation Open Day

METADATA

SYSTEMS

RETRIEVAL

CAPTURE

CONSERVATION

FINAL PREP
#3: There is no ‘best practice’

We can tell you how we created our digital library, but it’s unlikely that anyone is
going to be able to leave today and, even with a blank check, put that into practice
to solve their particular problems – there are too many variables.
Digitisation Open Day

Digitisation Doctor in a slide
Got general questions about digitisation?
The answer will always be: “It depends”
#4: There are no simple projects
(especially at the beginning)
Digitisation Open Day

Project problems
post-mortem:
Machinery issues
Retrieval across 30
collections, 4 floors,
2 buildings, 2 states
of access
Copyright clearance
in parallel

12% of selection not
found
Display issues
#5: Imaging is the quickest step

The imaging step is dwarfed by preceding preparation and subsequent digital asset
management processes, yet it’s the most visible aspect of any digitisation project.
215B STACKS

1.22 STORAGE

CONSERVATION

1.21 DIADEIS

BOOKS IN
STACKS

START

CONDITION?

NO

NOTE

CATALOGUING

IN
SCOPE

1a
YES

FAIR

STAY ON
SHELF

REPAIR

7

1c
NO
PRINT
CAT?

BOX
ONLINE
CAT?

1b
POOR
8

YES
GENERATE
SHELF
LIST

NOTE

NO

1d

1.22
STORE
TO
CATALOGUE?

SINGLE
SHELF
LISTS

DUPLICATE
CHECK

YES

9

2
CATALOGUE

3
SORT
BY
SIZE

4

10
1.22
STORE

CHECK
OUT

CHECK
OUT

5

DIGITISE

NOT OK
6

LARGER

UPDATE
SHELF
LIST

11

RETURN TO
SHELF

OK

CON
ASSESS

1.22
STORE
NO WAY

Imaging step
within a
preparation
workflow
#6: Metadata is really important

(see Dave Thompson’s presentation)

Lacking good metadata is an existential threat to your project – without it your
digital content will simply disappear and never be seen by users.
Digitisation Open Day

Metadata
• Digital objects „don‟t exist‟ without metadata – no search, no
discovery
• Metadata first, then digitisation – otherwise you don‟t know what
you have, where it is, or any way of controlling it…
• On average 50% of project time is spent on metadata and
cataloguing
• Must be shaped by user need and what an organisation is
capable of delivering
• Tension between low-volume digitisation with more metadata
for a richer user experience or larger-scale digitisation with
lighter metadata attached
• Standards-based framework helpful for consistency, accuracy
and efficiency in metadata input (e.g. Dublin Core, MARC21)
Digitisation Open Day

Metadata examples
#7: It’s lots of small tasks
(repeated over and over…)
Digitisation Open Day

Tracking and retrieval
1.
2.
3.
4.
5.
6.
7.
8.
9.

Generate unique ID
Create ‘scan list’
Create „review file‟
Make unavailable to users
Create barcodes
Retrieve items
Insert barcodes
Deliver items for imaging
Update tracking list

[Re-work]

a.
b.
c.
d.
e.
f.

Return
Remove barcodes
Update tracking list
Make available to users
Pray for no more re-work
Repeat for next batch
#8: Digi can damage your stuff
(but not as much as you’d think)
Digitisation Open Day

Conservation
• Most damage to collections comes from handling
• Digitisation handles collections intensively in
new ways
• Survey to develop image capture approach and identify
out of scope material
• Survey detail depends on collection
• Training for photographers and digital preparators
• Actual preparation of materials (staples, openings)
• Digitisation is not preservation
#9: Digitisation is not preservation

This should not be a guiding principle of your project:
Generally your original physical material is going to last much longer than your
digital manifestation – no competition.
You’ve just created a second collection of material that you need to ‘preserve’
and manage.
Preservation doesn’t mean much in a digital context – it’s actually a
contradiction from traditional usage, which succeeds by restricting access –
what we are interested in is sustainable access.
#10: Copyright + sensitivity =
workflow
Digitisation Open Day

Copyright and sensitivity
• UK copyright law is lagging behind the needs of
today‟s economy
• UK copyright is held by the creator and not the owner of
a work, making a rights risk assessment essential for
most projects
• Rights clearance of works on an item-by-item
basis is unworkable in the context of mass
digitisation
• Small organisations without legal support are
unlikely to take the risk of digitising orphan works, or
anything else that carries potential copyright risk
Digitisation Open Day

ProQuest EEB Project Overview
Project Scope:
14,000 books
5.5 million images
Incunabula to 1700
Printed outside UK
Access in UK and
HINARI – 15 years

3600 books now online: http://eeb.chadwyck.com
Digitisation Open Day

Phase 2 projects
Reading Room / Project X

Forensics and
Sex temporary
exhibitions

Western Manuscripts 1000-1650
Digitisation Open Day

Useful resources
THORNTON, E. (2013) Digitisation Doctor Workshop. 15th April 2013.
Available from: http://blog.wellcomelibrary.org/2013/05/resources-fromdigitisation-doctor-workshop-now-available
HENSHAW, C. and KILEY, R. (2013) The Wellcome Library, Digital.
Ariadne. July 2013. Available from:
http://www.ariadne.ac.uk/issue71/henshaw-kiley
JISC, Project Management for Digitisation, JISC Digital Media. Available
from: http://www.jiscdigitalmedia.ac.uk/guide/project-management-fora-digitisation-project
BRACK, M. (2012) Bridging the Gap: Library digital collections, innovation
and the user. Thesis submitted in partial fulfilment of the requirements
of King‟s College London for the Degree of Masters in Digital Asset
Management. Available from: http://nsla.org.au/publication/bridginggap-library-digital-collections-innovation-and-user
Digitisation Open Day

Thanks

m.brack@wellcome.ac.uk
@WellcomeDigital
@MatthewBrack

Contenu connexe

Similaire à Doing Projects: 10 laws of digitisation

Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for DigitisationWellcome Library
 
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 AIIM Ottawa Presentation Digital Preservation A Wicked Problem  AIIM Ottawa Presentation Digital Preservation A Wicked Problem
AIIM Ottawa Presentation Digital Preservation A Wicked Problem Debra Power
 
2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the TrenchesChris Dagdigian
 
UX camp trustworks
UX camp trustworksUX camp trustworks
UX camp trustworkslmdelvi
 
Make it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationMake it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationTrevor Owens
 
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...Dries De Roeck
 
Start Today: Digital Stewardship Communities & Collaborations
Start Today: Digital Stewardship  Communities & CollaborationsStart Today: Digital Stewardship  Communities & Collaborations
Start Today: Digital Stewardship Communities & CollaborationsTrevor Owens
 
Agile Development - Are you building the right thing ? (Follow the value)
Agile Development - Are you building the right thing ? (Follow the value)Agile Development - Are you building the right thing ? (Follow the value)
Agile Development - Are you building the right thing ? (Follow the value)Martin Nymann Vinther
 
6 Digital Myths Debunked: What it really takes to create a dynamic web presence
6 Digital Myths Debunked: What it really takes to create a dynamic web presence6 Digital Myths Debunked: What it really takes to create a dynamic web presence
6 Digital Myths Debunked: What it really takes to create a dynamic web presencePark Howell
 
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013Maurizio Pilu
 
A4 i2018 blockchain_slideshare
A4 i2018 blockchain_slideshareA4 i2018 blockchain_slideshare
A4 i2018 blockchain_slideshareNadia Fabrizio
 
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Trevor Owens
 
Tieto ped2018 allhumansarenaturalborndesign hinkers
Tieto ped2018 allhumansarenaturalborndesign hinkersTieto ped2018 allhumansarenaturalborndesign hinkers
Tieto ped2018 allhumansarenaturalborndesign hinkersSean McGuire
 
Rapid Product Design in the Wild
Rapid Product Design in the WildRapid Product Design in the Wild
Rapid Product Design in the WildMichele Ide-Smith
 
Open Data at Edinburgh City Council
Open Data at Edinburgh City CouncilOpen Data at Edinburgh City Council
Open Data at Edinburgh City CouncilJadu
 
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...Ultan O'Broin
 
Acquia Opensource Conference 2014 for UK Public Sector
Acquia Opensource Conference 2014 for UK Public SectorAcquia Opensource Conference 2014 for UK Public Sector
Acquia Opensource Conference 2014 for UK Public SectorAcquia
 
#noprojects (digest version)
#noprojects (digest version)#noprojects (digest version)
#noprojects (digest version)Fabian Kiss
 
Getting Things Done for Technical Communicators at TCUK14
Getting Things Done for Technical Communicators at TCUK14Getting Things Done for Technical Communicators at TCUK14
Getting Things Done for Technical Communicators at TCUK14Karen Mardahl
 

Similaire à Doing Projects: 10 laws of digitisation (20)

Matthew Brack Wellcome Library Presentation
Matthew Brack Wellcome Library PresentationMatthew Brack Wellcome Library Presentation
Matthew Brack Wellcome Library Presentation
 
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
 
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 AIIM Ottawa Presentation Digital Preservation A Wicked Problem  AIIM Ottawa Presentation Digital Preservation A Wicked Problem
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 
2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches2015 Bio-IT Trends From the Trenches
2015 Bio-IT Trends From the Trenches
 
UX camp trustworks
UX camp trustworksUX camp trustworks
UX camp trustworks
 
Make it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and ConservationMake it Last: Principals for Digital Preservation and Conservation
Make it Last: Principals for Digital Preservation and Conservation
 
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...
Gearing up! A Designer-Focused Evaluation of Ideation Tools for Connected Pro...
 
Start Today: Digital Stewardship Communities & Collaborations
Start Today: Digital Stewardship  Communities & CollaborationsStart Today: Digital Stewardship  Communities & Collaborations
Start Today: Digital Stewardship Communities & Collaborations
 
Agile Development - Are you building the right thing ? (Follow the value)
Agile Development - Are you building the right thing ? (Follow the value)Agile Development - Are you building the right thing ? (Follow the value)
Agile Development - Are you building the right thing ? (Follow the value)
 
6 Digital Myths Debunked: What it really takes to create a dynamic web presence
6 Digital Myths Debunked: What it really takes to create a dynamic web presence6 Digital Myths Debunked: What it really takes to create a dynamic web presence
6 Digital Myths Debunked: What it really takes to create a dynamic web presence
 
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013
CDE Catapult by Maurizio Pilu - Cambridge Wireless Event - 28 Nov 2013
 
A4 i2018 blockchain_slideshare
A4 i2018 blockchain_slideshareA4 i2018 blockchain_slideshare
A4 i2018 blockchain_slideshare
 
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
Enduring Digital Access: Establishing, Supporting, and Sustaining Digital Cur...
 
Tieto ped2018 allhumansarenaturalborndesign hinkers
Tieto ped2018 allhumansarenaturalborndesign hinkersTieto ped2018 allhumansarenaturalborndesign hinkers
Tieto ped2018 allhumansarenaturalborndesign hinkers
 
Rapid Product Design in the Wild
Rapid Product Design in the WildRapid Product Design in the Wild
Rapid Product Design in the Wild
 
Open Data at Edinburgh City Council
Open Data at Edinburgh City CouncilOpen Data at Edinburgh City Council
Open Data at Edinburgh City Council
 
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...
It's Better To Have a Permanent Income Than to Be Fascinating: Killer Feature...
 
Acquia Opensource Conference 2014 for UK Public Sector
Acquia Opensource Conference 2014 for UK Public SectorAcquia Opensource Conference 2014 for UK Public Sector
Acquia Opensource Conference 2014 for UK Public Sector
 
#noprojects (digest version)
#noprojects (digest version)#noprojects (digest version)
#noprojects (digest version)
 
Getting Things Done for Technical Communicators at TCUK14
Getting Things Done for Technical Communicators at TCUK14Getting Things Done for Technical Communicators at TCUK14
Getting Things Done for Technical Communicators at TCUK14
 

Plus de Wellcome Library

Wellcome Library Transcribing Recipes report
Wellcome Library Transcribing Recipes reportWellcome Library Transcribing Recipes report
Wellcome Library Transcribing Recipes reportWellcome Library
 
ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveWellcome Library
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wellcome Library
 
Creating an online resource for medical archives at the Wellcome Library
Creating an online resource for medical archives at the Wellcome LibraryCreating an online resource for medical archives at the Wellcome Library
Creating an online resource for medical archives at the Wellcome LibraryWellcome Library
 
Jpeg2000 at Wellcome Library
Jpeg2000 at Wellcome LibraryJpeg2000 at Wellcome Library
Jpeg2000 at Wellcome LibraryWellcome Library
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryWellcome Library
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryWellcome Library
 
Managing Large Scale Digitisation at the Wellcome Library
Managing Large Scale Digitisation at the Wellcome LibraryManaging Large Scale Digitisation at the Wellcome Library
Managing Large Scale Digitisation at the Wellcome LibraryWellcome Library
 
Upscaling digitisation at the Wellcome Library
Upscaling digitisation at the Wellcome LibraryUpscaling digitisation at the Wellcome Library
Upscaling digitisation at the Wellcome LibraryWellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustWellcome Library
 

Plus de Wellcome Library (11)

Wellcome Library Transcribing Recipes report
Wellcome Library Transcribing Recipes reportWellcome Library Transcribing Recipes report
Wellcome Library Transcribing Recipes report
 
ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
 
Creating an online resource for medical archives at the Wellcome Library
Creating an online resource for medical archives at the Wellcome LibraryCreating an online resource for medical archives at the Wellcome Library
Creating an online resource for medical archives at the Wellcome Library
 
Jpeg2000 at Wellcome Library
Jpeg2000 at Wellcome LibraryJpeg2000 at Wellcome Library
Jpeg2000 at Wellcome Library
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
 
Image Capture
Image CaptureImage Capture
Image Capture
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
 
Managing Large Scale Digitisation at the Wellcome Library
Managing Large Scale Digitisation at the Wellcome LibraryManaging Large Scale Digitisation at the Wellcome Library
Managing Large Scale Digitisation at the Wellcome Library
 
Upscaling digitisation at the Wellcome Library
Upscaling digitisation at the Wellcome LibraryUpscaling digitisation at the Wellcome Library
Upscaling digitisation at the Wellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
 

Dernier

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Dernier (20)

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Doing Projects: 10 laws of digitisation

  • 1. Digitisation Open Day Doing Projects 30 January 2014 Matthew Brack Digitisation Project Manager
  • 2. Wellcome Digital Library Programme 10 ‘laws’ of digitisation These are personal views based on experience of doing digitisation at Wellcome Library…
  • 3. #1: Know your purpose (“Thou shalt observe real users and keep them holy”) Obvious but important: who are you doing digitisation for? Knowing the answer to that question will affect every subsequent decision you make in your project.
  • 4. Digitisation Open Day SELECTION DIGITISATION DELIVERY You need to ensure that this common purpose is shared by all project stakeholders and is a thread running through the whole process from the very beginning to the end.
  • 5. #2: Know Project Management If you’ve never run a digitisation project before and only do one thing to prepare: study project management. Your project is more likely to succeed with an understanding of project management than a technical understanding of digitisation.
  • 6. Digitisation Open Day The nature of digitisation
  • 8. #3: There is no ‘best practice’ We can tell you how we created our digital library, but it’s unlikely that anyone is going to be able to leave today and, even with a blank check, put that into practice to solve their particular problems – there are too many variables.
  • 9. Digitisation Open Day Digitisation Doctor in a slide Got general questions about digitisation? The answer will always be: “It depends”
  • 10. #4: There are no simple projects (especially at the beginning)
  • 11. Digitisation Open Day Project problems post-mortem: Machinery issues Retrieval across 30 collections, 4 floors, 2 buildings, 2 states of access Copyright clearance in parallel 12% of selection not found Display issues
  • 12. #5: Imaging is the quickest step The imaging step is dwarfed by preceding preparation and subsequent digital asset management processes, yet it’s the most visible aspect of any digitisation project.
  • 13. 215B STACKS 1.22 STORAGE CONSERVATION 1.21 DIADEIS BOOKS IN STACKS START CONDITION? NO NOTE CATALOGUING IN SCOPE 1a YES FAIR STAY ON SHELF REPAIR 7 1c NO PRINT CAT? BOX ONLINE CAT? 1b POOR 8 YES GENERATE SHELF LIST NOTE NO 1d 1.22 STORE TO CATALOGUE? SINGLE SHELF LISTS DUPLICATE CHECK YES 9 2 CATALOGUE 3 SORT BY SIZE 4 10 1.22 STORE CHECK OUT CHECK OUT 5 DIGITISE NOT OK 6 LARGER UPDATE SHELF LIST 11 RETURN TO SHELF OK CON ASSESS 1.22 STORE NO WAY Imaging step within a preparation workflow
  • 14. #6: Metadata is really important (see Dave Thompson’s presentation) Lacking good metadata is an existential threat to your project – without it your digital content will simply disappear and never be seen by users.
  • 15. Digitisation Open Day Metadata • Digital objects „don‟t exist‟ without metadata – no search, no discovery • Metadata first, then digitisation – otherwise you don‟t know what you have, where it is, or any way of controlling it… • On average 50% of project time is spent on metadata and cataloguing • Must be shaped by user need and what an organisation is capable of delivering • Tension between low-volume digitisation with more metadata for a richer user experience or larger-scale digitisation with lighter metadata attached • Standards-based framework helpful for consistency, accuracy and efficiency in metadata input (e.g. Dublin Core, MARC21)
  • 17. #7: It’s lots of small tasks (repeated over and over…)
  • 18. Digitisation Open Day Tracking and retrieval 1. 2. 3. 4. 5. 6. 7. 8. 9. Generate unique ID Create ‘scan list’ Create „review file‟ Make unavailable to users Create barcodes Retrieve items Insert barcodes Deliver items for imaging Update tracking list [Re-work] a. b. c. d. e. f. Return Remove barcodes Update tracking list Make available to users Pray for no more re-work Repeat for next batch
  • 19. #8: Digi can damage your stuff (but not as much as you’d think)
  • 20. Digitisation Open Day Conservation • Most damage to collections comes from handling • Digitisation handles collections intensively in new ways • Survey to develop image capture approach and identify out of scope material • Survey detail depends on collection • Training for photographers and digital preparators • Actual preparation of materials (staples, openings) • Digitisation is not preservation
  • 21. #9: Digitisation is not preservation This should not be a guiding principle of your project: Generally your original physical material is going to last much longer than your digital manifestation – no competition. You’ve just created a second collection of material that you need to ‘preserve’ and manage. Preservation doesn’t mean much in a digital context – it’s actually a contradiction from traditional usage, which succeeds by restricting access – what we are interested in is sustainable access.
  • 22. #10: Copyright + sensitivity = workflow
  • 23. Digitisation Open Day Copyright and sensitivity • UK copyright law is lagging behind the needs of today‟s economy • UK copyright is held by the creator and not the owner of a work, making a rights risk assessment essential for most projects • Rights clearance of works on an item-by-item basis is unworkable in the context of mass digitisation • Small organisations without legal support are unlikely to take the risk of digitising orphan works, or anything else that carries potential copyright risk
  • 24. Digitisation Open Day ProQuest EEB Project Overview Project Scope: 14,000 books 5.5 million images Incunabula to 1700 Printed outside UK Access in UK and HINARI – 15 years 3600 books now online: http://eeb.chadwyck.com
  • 25. Digitisation Open Day Phase 2 projects Reading Room / Project X Forensics and Sex temporary exhibitions Western Manuscripts 1000-1650
  • 26. Digitisation Open Day Useful resources THORNTON, E. (2013) Digitisation Doctor Workshop. 15th April 2013. Available from: http://blog.wellcomelibrary.org/2013/05/resources-fromdigitisation-doctor-workshop-now-available HENSHAW, C. and KILEY, R. (2013) The Wellcome Library, Digital. Ariadne. July 2013. Available from: http://www.ariadne.ac.uk/issue71/henshaw-kiley JISC, Project Management for Digitisation, JISC Digital Media. Available from: http://www.jiscdigitalmedia.ac.uk/guide/project-management-fora-digitisation-project BRACK, M. (2012) Bridging the Gap: Library digital collections, innovation and the user. Thesis submitted in partial fulfilment of the requirements of King‟s College London for the Degree of Masters in Digital Asset Management. Available from: http://nsla.org.au/publication/bridginggap-library-digital-collections-innovation-and-user

Notes de l'éditeur

  1. After three of these sessions, time to add some flavour…Some of us here have been ‘hacking’ our jobs recently, trying to summarise our experiences in pith statements… 10’s a nice number Some of them may come across as imperatives, but really these are just personal observations…
  2. Following on from the 10 UX commandments…Who are you doing this project for? Knowing the answer to that question will affect every other decision you make in a project…
  3. Example: strictly speaking selection and delivery not my responsibility as digi project manager, but these are key stakeholders for every digi project.Who is using this stuff – how is it delivered – does the person selecting the stuff understand how it will be used?
  4. If you’ve never run a digi project before and only do one thing to prepare, this should be it… Your digi project is more likely to succeed with an understanding of project management than, say, a technical understanding of digitisation…Please don’t read any books on digitisation, read about pmgmt instead…
  5. It’s very important that you strive for an appreciation of both of the digital and the physical and how they interact in order to execute a good digi project.
  6. Importance of project management…
  7. So I said please don’t read books on digitisation… that also because it’s going to be hard to find information directly applicable to your projects there…There are lowest common denominators, which you’ll find from your own experience and talking to someone who’s done this stuff…We can tell you how we created our digital library, but it’s unlikely that anyone is going to be able to leave today and, even with a blank check, put that into practice to solve their particular problems – there are too many variables… Institutional cultureResourcesContent typeAudience
  8. Every digitisation project is differentNot really a ‘best practice’, certainly not for everyone who is doing digitisation (large and small institutions etc.) – it’s contextWhat we have is a shared goal (sustainable access), it’s getting there….
  9. An innocent-looking project (!) – ‘only’ 2,000 books, and a robot to do them on – what could go wrong?Machinery issues – smokin’Retrieval – ‘theme’ issue – 30 collections, 4 floors, 2 buildings, 2 states of access (closed and open shelves)Return (general classification!) – couldn’t find 12% of themFirst use of Goobi WTSCopyright clearance – not feasible for mass digi (us and BL)Display – confusion over covers presented together at image 1 and 2
  10. Please don’t tell anyone in the photography studio when you go up, but it is…First of all, it’s dwarfed by the preceding and succeeding steps in the workflow – Second, it’s actually something that you can apply basic parameters to (image specs, file format, camera set-up) – once established you’re all set…Just don’t use scanners and you’ll be fine…
  11. Digitisation is mostly preparation…Here is the imaging step in the workflow…You’ll later see this entire workflow itself dwarfed by Dave’s system architecture…
  12. No good metadata for your project is an existential threat to your project…
  13. [All the different places where you use metadata] …You need good cataloguing to do digi – you shouldn’t start without it…Otherwise you don’t know what you have, where it is, or any way of controlling it…In particular you need administrative metadata that connects back to the physical object you’re digitising…Which goes back to bridging the gap between physical and digital…With metadata you have to string that thread through from beginning to end…Don’t forget that physical link to the digital…
  14. It’s very time-consuming… could use lots of examples from workflow: photography, metadata addition…Often, especially if you’re in a library, you’re using legacy systems that really prefer to think digital delivery doesn’t exist… So you have to retrofit this clanger one piece at a time…
  15. Excel spread sheets and barcodes are your best friends…
  16. This should not be a guiding principle of your digi project.In some cases, like film stock or highly deteriorated items, this could be true…But generally your original physical material is going to last much longer than your digital manifestation – no competition.You’ve just created a second collection of material that you need to ‘preserve’ and manage…Preservation doesn’t mean much in a digital context – it’s actually a contradiction from traditional usage (restricting access) – we are interested in access…You might say, fine, we’ll restrict access to preserve our originals. Two potential implications: don’t create a self-imposed obsolescence for your physical building (there might be someone upstairs who wonders why they’re keeping London real estate for stuff that’s only available online - some people still think that ‘going digital’ equates to reducing costs)What would your users think?Time and again it seems that the physical originals are consulted more frequently after digitisation.
  17. The workflow: Digitisation involves a lot of stakeholders…Also slices through the traditional library organisation…