SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Crowdsourced
Manuscript Transcription
         Ben Brumfield
     Roots and Routes 2012
Not just crowdsourcing...
● Collaborative work
● Off-site solo work
● Private work
Not just manuscripts...
●   Maps
●   Textiles
●   Music
●   Flawed OCR
Not just transcription...
● Indexing
● Editing
● Identification

Counting seals on Arctic ice caps.
What it isn't
We'll concentrate on web-based tools for
extracting text from images, not addressing:
● Oral History
● Video
● Audio Transcription
● Image Manipulation
● Transcription/Facsimile Display

Tools exist for these tasks, nevertheless.
Break
What materials are you working with outside of
modern, printed books and websites?
Origins (Approaches)
Two Approaches and one Dead End
● Indexing
● Editing
● Tagging
Indexing
●   Structured Data
●   Extracts from Text vs. Representing Text
●   Databases for Search and Analysis
●   Granular Quality Control
●   Gamification
Editing
●   Books, Diaries, Letters, Articles
●   Representing Text
●   Traditional Editorial Workflow
●   Digital or Print Editions
Tagging
● Too small
● Too imprecise
Origins (Traditions)
●   OCR Correction
●   Documentary Editing
●   Genealogy
●   Natural Science
●   Astronomy

Split this into 5 slides
Online Tools
● Recent (none older than 2005)
● Influenced by origin
● Still pretty raw
● Most require tech expertise for set-up and
  customization
● All require making trade-offs
Lab Session 1: Breadth
NYPL What's on the Menu
  Indexing

Wikisource
  Editing
Selection Factors
●   Source Material
●   Transcript Purpose
●   Organizational/Project Management Fit
●   Financial and Technical Resources
Source Material
Evaluating your source material:
● Is it of interest to anyone else?
● Is it under copyright?
● Does it need restricted access?
● Is it composed of documents or records?
● Is it non-textual?
● How complex is the layout? How important
  is that layout?
Purpose
How will you be using the transcribed data?
● Traditional print editions
● Searchable online editions
● Do you want to use the system to analyze
  the text?
● How do you want to analyze the text?
● Is public engagement a goal?
● Should the transcripts be open?
Organizational/Project Management Fit

● How important is traditional editorial
  workflow?
● Will you rely on volunteers? How will you
  motivate them?
● What is the duration of the project?
● Is there a "final version"?
● Is TEI a mandate?
Financial and Technical Resources
Do you have or need:
● System administrators to install non-hosted
  software?
● Money to pay hosting costs?
● Programming skills to customize a tool?
● Money to pay programmers for
  customization?
● Support for on-going costs to keep the site
  running, however small?
Lab Session 2: Markup Options
FromThePage

TranscribeBentham
Technical Questions to Answer
● Where are the images now?
● How do images get into the system?
● How do transcripts get out of the system?
● How mature is the underlying technology?
● How configurable is the technology?
● How does the system work with the public
  face of your project?
● Where does the metadata live?
● Who will maintain this? How long?
● How many sites are using this system?
Wikisource
Pro:
● Mediawiki plus its add-on modules (e.g.
  print-on-demand, export).
● Wikimedia community.
● Incredibly mature.
Con:
● Wikimedia policy.
● Public editing.
● Limited mark-up.
Bentham Transcription Desk
Pro:
● MediaWiki is very mature.
● TEI Toolbar (can also be used on other
  systems)
● Deployed outside original project.

Con:
● Development efforts halted.
Scripto
Pro:
● Team at CHNM has a great track record.
● Your CMS is your public face.
● MediaWiki is very mature.
● Deployed and under active development.

Con:
● Your CMS handles all metadata.
● Mark-up is extremely limited.
FromThePage
Pro:
● Designed for intensive editing and indexing.
● Semantic mark-up and analysis.
● Hosting available.

Con:
● Single developer (me).
● No TEI mark-up.
Islandora TEI Editor
Caveat: I don't know much about this tool or
this team.

● Based on Drupal and Fedora
● Supports TEI via friendly interface
● Many Drupal-based projects considering it.
T-PEN
Caveat: I don't know much about this tool.

●   Designed for medieval manuscripts.
●   Supports TEI natively.
●   Line-by-line interface.
●   Hosted version available.
Scribe
Pro:
● Excellent for complex layout or non-
  documentary transcription.
● Zooniverse team is large, well-funded,
  experienced.
● Configurable.
Con:
● No automated tool for loading images or
  viewing transcript database (yet!)
● No concept of image-as-a-text.
Pybossa
Caveat: I don't know much about this tool or
this team.

● Open Knowledge Foundation's
  crowdsourcing task management tool.
● Designed for tabular data.
● Google Spreadsheet data entry.
● Extremely young.
TextLab
Caveat: I don't know much about this tool or
this team.

● Melville Electronic Library.
● Direct addition of TEI tags to image.
Lab Session 3: Configuration
Scribe
  Old Weather,
  What's the Score,
  Development deployments
Find me
                Ben Brumfield
           benwbrum@gmail.com
 http://manuscripttranscription.blogspot.com/
                @benwbrum

Contenu connexe

Similaire à Roots and Routes: Crowdsourced Manuscript Transcription Workshop

Scalable, good, cheap
Scalable, good, cheapScalable, good, cheap
Scalable, good, cheapMarc Cluet
 
Computer Programming Overview
Computer Programming OverviewComputer Programming Overview
Computer Programming Overviewagorolabs
 
What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...Stefano Fago
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software houseParis Apostolopoulos
 
The Professional Programmer
The Professional ProgrammerThe Professional Programmer
The Professional ProgrammerDave Cross
 
Python in Industry
Python in IndustryPython in Industry
Python in IndustryDharmit Shah
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsKris Buytaert
 
HOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxHOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxssuser62b2da
 
Agile Development: Key to smart software development
Agile Development: Key to smart software developmentAgile Development: Key to smart software development
Agile Development: Key to smart software developmentJerlyn Manohar
 
Building Better FLOSS Community Relationships @ FB
Building Better  FLOSS Community Relationships @ FBBuilding Better  FLOSS Community Relationships @ FB
Building Better FLOSS Community Relationships @ FBDavide Cavalca
 
Picking the right architecture and sticking to it
Picking the right architecture and sticking to itPicking the right architecture and sticking to it
Picking the right architecture and sticking to itPetter Holmström
 
Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Bambang Purnomosidi D. P.
 
We Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeWe Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeDocker, Inc.
 
Path dependent-development (PyCon India)
Path dependent-development (PyCon India)Path dependent-development (PyCon India)
Path dependent-development (PyCon India)ncoghlan_dev
 
Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)ncoghlan_dev
 
Dynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayDynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayArtur Babyuk
 

Similaire à Roots and Routes: Crowdsourced Manuscript Transcription Workshop (20)

Scalable, good, cheap
Scalable, good, cheapScalable, good, cheap
Scalable, good, cheap
 
Computer Programming Overview
Computer Programming OverviewComputer Programming Overview
Computer Programming Overview
 
What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software house
 
Cloud accounting software uk
Cloud accounting software ukCloud accounting software uk
Cloud accounting software uk
 
The Professional Programmer
The Professional ProgrammerThe Professional Programmer
The Professional Programmer
 
IT Career Planning v2
IT Career Planning v2IT Career Planning v2
IT Career Planning v2
 
Python in Industry
Python in IndustryPython in Industry
Python in Industry
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey Results
 
HOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxHOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptx
 
Agile Development: Key to smart software development
Agile Development: Key to smart software developmentAgile Development: Key to smart software development
Agile Development: Key to smart software development
 
Building Better FLOSS Community Relationships @ FB
Building Better  FLOSS Community Relationships @ FBBuilding Better  FLOSS Community Relationships @ FB
Building Better FLOSS Community Relationships @ FB
 
Picking the right architecture and sticking to it
Picking the right architecture and sticking to itPicking the right architecture and sticking to it
Picking the right architecture and sticking to it
 
Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...
 
Ploneide
PloneidePloneide
Ploneide
 
Learning to code in 2020
Learning to code in 2020Learning to code in 2020
Learning to code in 2020
 
We Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeWe Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps Code
 
Path dependent-development (PyCon India)
Path dependent-development (PyCon India)Path dependent-development (PyCon India)
Path dependent-development (PyCon India)
 
Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)
 
Dynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayDynatech presentation for TSI Career Day
Dynatech presentation for TSI Career Day
 

Dernier

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Dernier (20)

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Roots and Routes: Crowdsourced Manuscript Transcription Workshop

  • 1. Crowdsourced Manuscript Transcription Ben Brumfield Roots and Routes 2012
  • 2. Not just crowdsourcing... ● Collaborative work ● Off-site solo work ● Private work
  • 3. Not just manuscripts... ● Maps ● Textiles ● Music ● Flawed OCR
  • 4. Not just transcription... ● Indexing ● Editing ● Identification Counting seals on Arctic ice caps.
  • 5. What it isn't We'll concentrate on web-based tools for extracting text from images, not addressing: ● Oral History ● Video ● Audio Transcription ● Image Manipulation ● Transcription/Facsimile Display Tools exist for these tasks, nevertheless.
  • 6. Break What materials are you working with outside of modern, printed books and websites?
  • 7. Origins (Approaches) Two Approaches and one Dead End ● Indexing ● Editing ● Tagging
  • 8. Indexing ● Structured Data ● Extracts from Text vs. Representing Text ● Databases for Search and Analysis ● Granular Quality Control ● Gamification
  • 9. Editing ● Books, Diaries, Letters, Articles ● Representing Text ● Traditional Editorial Workflow ● Digital or Print Editions
  • 10. Tagging ● Too small ● Too imprecise
  • 11. Origins (Traditions) ● OCR Correction ● Documentary Editing ● Genealogy ● Natural Science ● Astronomy Split this into 5 slides
  • 12. Online Tools ● Recent (none older than 2005) ● Influenced by origin ● Still pretty raw ● Most require tech expertise for set-up and customization ● All require making trade-offs
  • 13. Lab Session 1: Breadth NYPL What's on the Menu Indexing Wikisource Editing
  • 14. Selection Factors ● Source Material ● Transcript Purpose ● Organizational/Project Management Fit ● Financial and Technical Resources
  • 15. Source Material Evaluating your source material: ● Is it of interest to anyone else? ● Is it under copyright? ● Does it need restricted access? ● Is it composed of documents or records? ● Is it non-textual? ● How complex is the layout? How important is that layout?
  • 16. Purpose How will you be using the transcribed data? ● Traditional print editions ● Searchable online editions ● Do you want to use the system to analyze the text? ● How do you want to analyze the text? ● Is public engagement a goal? ● Should the transcripts be open?
  • 17. Organizational/Project Management Fit ● How important is traditional editorial workflow? ● Will you rely on volunteers? How will you motivate them? ● What is the duration of the project? ● Is there a "final version"? ● Is TEI a mandate?
  • 18. Financial and Technical Resources Do you have or need: ● System administrators to install non-hosted software? ● Money to pay hosting costs? ● Programming skills to customize a tool? ● Money to pay programmers for customization? ● Support for on-going costs to keep the site running, however small?
  • 19. Lab Session 2: Markup Options FromThePage TranscribeBentham
  • 20. Technical Questions to Answer ● Where are the images now? ● How do images get into the system? ● How do transcripts get out of the system? ● How mature is the underlying technology? ● How configurable is the technology? ● How does the system work with the public face of your project? ● Where does the metadata live? ● Who will maintain this? How long? ● How many sites are using this system?
  • 21. Wikisource Pro: ● Mediawiki plus its add-on modules (e.g. print-on-demand, export). ● Wikimedia community. ● Incredibly mature. Con: ● Wikimedia policy. ● Public editing. ● Limited mark-up.
  • 22. Bentham Transcription Desk Pro: ● MediaWiki is very mature. ● TEI Toolbar (can also be used on other systems) ● Deployed outside original project. Con: ● Development efforts halted.
  • 23. Scripto Pro: ● Team at CHNM has a great track record. ● Your CMS is your public face. ● MediaWiki is very mature. ● Deployed and under active development. Con: ● Your CMS handles all metadata. ● Mark-up is extremely limited.
  • 24. FromThePage Pro: ● Designed for intensive editing and indexing. ● Semantic mark-up and analysis. ● Hosting available. Con: ● Single developer (me). ● No TEI mark-up.
  • 25. Islandora TEI Editor Caveat: I don't know much about this tool or this team. ● Based on Drupal and Fedora ● Supports TEI via friendly interface ● Many Drupal-based projects considering it.
  • 26. T-PEN Caveat: I don't know much about this tool. ● Designed for medieval manuscripts. ● Supports TEI natively. ● Line-by-line interface. ● Hosted version available.
  • 27. Scribe Pro: ● Excellent for complex layout or non- documentary transcription. ● Zooniverse team is large, well-funded, experienced. ● Configurable. Con: ● No automated tool for loading images or viewing transcript database (yet!) ● No concept of image-as-a-text.
  • 28. Pybossa Caveat: I don't know much about this tool or this team. ● Open Knowledge Foundation's crowdsourcing task management tool. ● Designed for tabular data. ● Google Spreadsheet data entry. ● Extremely young.
  • 29. TextLab Caveat: I don't know much about this tool or this team. ● Melville Electronic Library. ● Direct addition of TEI tags to image.
  • 30. Lab Session 3: Configuration Scribe Old Weather, What's the Score, Development deployments
  • 31. Find me Ben Brumfield benwbrum@gmail.com http://manuscripttranscription.blogspot.com/ @benwbrum