SlideShare une entreprise Scribd logo
1  sur  40
Crowdsourcing Transcription
with Open Source Software
Ben Brumfield
MAC Fall Symposium 2013
Why Transcribe?

Crowdsourcing can be
− Tagging
− Georectification
− Identification

But if you've got scanned documents, you've got
a problem
Serendipity: One Volunteer's Story
Nat Wooding
– Semi-retired data analyst
– 200 pages of Julia Brumfield's 1923 diary in nine
months
– No relation to diarist
Serendipity: One Volunteer's Story
Nat Wooding
– Semi-retired data analyst
– 200 pages of Julia Brumfield's 1923 diary in nine
months
– No relation to diarist
– Great uncle was diarist's letter carrier, also
named Nat Wooding
Why Crowdsource?
Free Labor!
Why Crowdsource?
Free Labor!
“Free as in beer”
“Free as in speech”
“Free as in....
Free as in puppy!
http://www.flickr.com/photos/magnusbrath/7614518858/
Why Crowdsource?
“At its best, crowdsourcing is not about
getting someone to do work for you, it is
about offering your users the
opportunity to participate in public
memory.”
– Trevor Owens, “Crowdsourcing Cultural Heritage:
The Objectives are Upside-down”
Why Crowdsource?
“By engaging the public in digitising our
collections, we are
− Increasing the scientific literacy of the public
− Providing increased access to our collections
− Building an advocacy network for our collections
and our institutions.”
– Paul Flemons, Australian Museum
Why Crowdsource?

Convert website visitors into volunteers

Convert volunteers into advocates

What's next?
Questions?
Choosing a Transcription Platform

The good news:
– More than 30 tools to choose from!
Choosing a Transcription Platform

The good news:
– More than 30 tools to choose from!

The bad news:
– More than 30 tools to choose from!
Selection Factors
● Source Material
● Transcript Purpose
● Organizational/Project Management Fit
● Financial and Technical Resources
Source Material
● Is it of interest to anyone else?
● Is it under copyright?
● Does it need restricted access?
● Is it composed of “text” or “records”?
● How complex is the layout? How
important is that layout?
Purpose
•How will you be using the transcribed data?
– Traditional print editions
– Searchable online editions
•Do you want to use the system to analyze
the text?
•Do you need to import the transcripts into
other systems?
•Is public engagement the only goal?
Organizational Fit
•How important is traditional editorial
workflow?
•Will you rely on volunteers? How will you
find and motivate them?
•What is the duration of the project?
•Is there a "final version"?
•Is TEI a mandate?
Financial and Technical Resources
•System administrators to install non-hosted
software?
•Money to pay hosting costs?
•Programming skills to customize a tool?
•Money to pay programmers for
customization?
•Support for on-going costs to keep the site
running, however small?
The Tools
● Recent (oldest started in 2005)
● Influenced by origin
● Still pretty raw
● Most require tech expertise for set-up and
customization
● All require making trade-offs
http://tinyurl.com/TranscriptionToolGDoc
Open-source, On-site Tools
Scripto
Bentham Transcription Desk
NARA Transcribr Drupal Module
Zooniverse Scribe
Quick Definitions
MediaWiki: Popular software framework for
runnning wiki projects
Wikipedia, Wikisource, Wiktionary, Wikitravel:
Projects running on MediaWiki
WikiMedia: Organization running many—but not
all—MediaWiki-based wiki projects.
Hosted Tools
Virtual Transcription Laboratory
Wikisource.org
FromThePage.com
Virtual Transcription Laboratory
Virtual Transcription Laboratory
Wikisource
Live demo of State Library of Queensland on
Wikisource showing project page, edit screen,
and editorial workflow.
Recommendation of Lori and the GLAMWiki
group to help organizations navigate the
community.
FromThePage
Live demo of FromThePage showing edit
screen, wiki-linking a single term, read pages
for a subject, full-text search on name variants,
and auto-link.
Thanks!
Ben Brumfield
benwbrum@gmail.com
@benwbrum
http://manuscripttranscription.blogspot.com
My transcription tools:
– FromThePage.com
– OpenSourceIndexing.org
http://tinyurl.com/TranscriptionToolGDoc
Crowdsourcing Transcription with Open Source Software

Contenu connexe

Similaire à Crowdsourcing Transcription with Open Source Software

Geek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceGeek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceRussell Pavlicek
 
F+ presentation public en
F+ presentation public enF+ presentation public en
F+ presentation public enSergiy Gladkyy
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Anselm Hook
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 KeynotePeter Wang
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereJ T "Tom" Johnson
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Marianijessicamariani
 
The Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationThe Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationMargaret-Anne Storey
 
Why Computer Science is a Great Choice
Why Computer Science is a Great ChoiceWhy Computer Science is a Great Choice
Why Computer Science is a Great Choiceturingfan
 
Accessibility & Universal Design
Accessibility & Universal DesignAccessibility & Universal Design
Accessibility & Universal DesignSrutiVijaykumar
 
Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Tim O'Reilly
 
What is open source?
What is open source?What is open source?
What is open source?Ahmet Bulut
 
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriCommunity, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriDemi Ben-Ari
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected FacilityRyan Duggan
 
Open source for Libraries
Open source for LibrariesOpen source for Libraries
Open source for LibrariesNicole Baratta
 
Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)Dorothea Salo
 
Open Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesOpen Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesNicole Baratta
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryIndranil Das Gupta
 
Digital Libraries and the quest for information curation
Digital Libraries and the quest for information curationDigital Libraries and the quest for information curation
Digital Libraries and the quest for information curationLuis Borges Gouveia
 

Similaire à Crowdsourcing Transcription with Open Source Software (20)

Geek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceGeek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open Source
 
F+ presentation public en
F+ presentation public enF+ presentation public en
F+ presentation public en
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 Keynote
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Mariani
 
Evc2014
Evc2014Evc2014
Evc2014
 
The Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationThe Elusive Nature of Software Documentation
The Elusive Nature of Software Documentation
 
Why Computer Science is a Great Choice
Why Computer Science is a Great ChoiceWhy Computer Science is a Great Choice
Why Computer Science is a Great Choice
 
Accessibility & Universal Design
Accessibility & Universal DesignAccessibility & Universal Design
Accessibility & Universal Design
 
Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Ficod 2011 (keynote file)
Ficod 2011 (keynote file)
 
What is open source?
What is open source?What is open source?
What is open source?
 
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriCommunity, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected Facility
 
Open source for Libraries
Open source for LibrariesOpen source for Libraries
Open source for Libraries
 
Open Source for Libraries
Open Source for LibrariesOpen Source for Libraries
Open Source for Libraries
 
Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)
 
Open Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesOpen Your Mind: Open Source in Libraries
Open Your Mind: Open Source in Libraries
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the Library
 
Digital Libraries and the quest for information curation
Digital Libraries and the quest for information curationDigital Libraries and the quest for information curation
Digital Libraries and the quest for information curation
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Dernier (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Crowdsourcing Transcription with Open Source Software

  • 1. Crowdsourcing Transcription with Open Source Software Ben Brumfield MAC Fall Symposium 2013
  • 2. Why Transcribe?  Crowdsourcing can be − Tagging − Georectification − Identification  But if you've got scanned documents, you've got a problem
  • 3.
  • 4. Serendipity: One Volunteer's Story Nat Wooding – Semi-retired data analyst – 200 pages of Julia Brumfield's 1923 diary in nine months – No relation to diarist
  • 5. Serendipity: One Volunteer's Story Nat Wooding – Semi-retired data analyst – 200 pages of Julia Brumfield's 1923 diary in nine months – No relation to diarist – Great uncle was diarist's letter carrier, also named Nat Wooding
  • 6.
  • 7.
  • 9. Why Crowdsource? Free Labor! “Free as in beer” “Free as in speech” “Free as in....
  • 10. Free as in puppy! http://www.flickr.com/photos/magnusbrath/7614518858/
  • 11. Why Crowdsource? “At its best, crowdsourcing is not about getting someone to do work for you, it is about offering your users the opportunity to participate in public memory.” – Trevor Owens, “Crowdsourcing Cultural Heritage: The Objectives are Upside-down”
  • 12.
  • 13.
  • 14. Why Crowdsource? “By engaging the public in digitising our collections, we are − Increasing the scientific literacy of the public − Providing increased access to our collections − Building an advocacy network for our collections and our institutions.” – Paul Flemons, Australian Museum
  • 15. Why Crowdsource?  Convert website visitors into volunteers  Convert volunteers into advocates  What's next?
  • 17. Choosing a Transcription Platform  The good news: – More than 30 tools to choose from!
  • 18. Choosing a Transcription Platform  The good news: – More than 30 tools to choose from!  The bad news: – More than 30 tools to choose from!
  • 19. Selection Factors ● Source Material ● Transcript Purpose ● Organizational/Project Management Fit ● Financial and Technical Resources
  • 20. Source Material ● Is it of interest to anyone else? ● Is it under copyright? ● Does it need restricted access? ● Is it composed of “text” or “records”? ● How complex is the layout? How important is that layout?
  • 21. Purpose •How will you be using the transcribed data? – Traditional print editions – Searchable online editions •Do you want to use the system to analyze the text? •Do you need to import the transcripts into other systems? •Is public engagement the only goal?
  • 22. Organizational Fit •How important is traditional editorial workflow? •Will you rely on volunteers? How will you find and motivate them? •What is the duration of the project? •Is there a "final version"? •Is TEI a mandate?
  • 23. Financial and Technical Resources •System administrators to install non-hosted software? •Money to pay hosting costs? •Programming skills to customize a tool? •Money to pay programmers for customization? •Support for on-going costs to keep the site running, however small?
  • 24. The Tools ● Recent (oldest started in 2005) ● Influenced by origin ● Still pretty raw ● Most require tech expertise for set-up and customization ● All require making trade-offs http://tinyurl.com/TranscriptionToolGDoc
  • 25. Open-source, On-site Tools Scripto Bentham Transcription Desk NARA Transcribr Drupal Module Zooniverse Scribe
  • 26. Quick Definitions MediaWiki: Popular software framework for runnning wiki projects Wikipedia, Wikisource, Wiktionary, Wikitravel: Projects running on MediaWiki WikiMedia: Organization running many—but not all—MediaWiki-based wiki projects.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34. Hosted Tools Virtual Transcription Laboratory Wikisource.org FromThePage.com
  • 37. Wikisource Live demo of State Library of Queensland on Wikisource showing project page, edit screen, and editorial workflow. Recommendation of Lori and the GLAMWiki group to help organizations navigate the community.
  • 38. FromThePage Live demo of FromThePage showing edit screen, wiki-linking a single term, read pages for a subject, full-text search on name variants, and auto-link.
  • 39. Thanks! Ben Brumfield benwbrum@gmail.com @benwbrum http://manuscripttranscription.blogspot.com My transcription tools: – FromThePage.com – OpenSourceIndexing.org http://tinyurl.com/TranscriptionToolGDoc