SlideShare une entreprise Scribd logo
1  sur  29
Using Monograph Compound
Objects to display the Orleans
Parish School Board Minute
Books
• The Collection
the history, the data, the images, the deadline
• The Strategy
monograph compound objects w/a tab-delimited text
file
• The Results
what went well, what didn’t, next steps
• Some Alternatives
considering other digital text collection display
methods
Bound volumes of Orleans
Parish School Board meeting
minutes.
Dates Covered: 1841-1996
Includes the Civil War and
Desegregation. Scholars as far
away as Japan consult this
collection on site
Thanks to UNO history
professor Al Kennedy who
rescued many of the documents
from being discarded
Indexed by:

• VOLUME

A Board of Regents grant
allowed UNO Midlo
Center for New Orleans
Studies historians to
summarize and index +/900 pages of meeting
minutes from just
before, during, and right
after the Civil War.

• MEETING
• Meeting Title
• Meeting Date
• Board Members
present, absent
• Keywords
• Meeting Summary
• PAGE
• Page Summaries
• Page Dates
Based on what was indexed as part
of the grant, our data structure would
have to support the following:
• Data
• Volume-Level Metadata (Title:
Municipality, District, and Dates
Covered)
• "Chapter"-Level Metadata (Meeting
Information, Keywords, Dates)
• Page-Level Metadata (Page
Summaries, Dates)

Given CONTENTdm as our
repository tool, how would we make
this happen?
"Monograph" is the compound object structure that would allow
us to keep the volume-meeting-page structure and retain the
index data created for all those levels (incl. page level).
Ultimately will be data on 300-600 pages for dozens of volumes
of minute books. The UNO History Dept. provided data in Excel
for first three indexed volumes.
Verdict: Convert Excel file into tab-delimited text for import
into CONTENTdm.
What is a tab-delimited text file?
• a plain text file without formatting where data fields (Excel
cells) are separated by a "tab" character
• file is saved with extension ".txt"
• similar to a CSV file where a comma separates the values
instead of a tab
How do you make a tab-delimited text file from an Excel file?

When saving your Excel spreadsheet, choose "Text (Tab
delimited)" from the "Save as type" drop down box under the file
name

Remember where you save it. You will need to tell CONTENTdm
where to find it later.
More information:
• Microsoft instructions: http://office.microsoft.com/en-us/excelhelp/import-or-export-text-txt-or-csv-filesHP010099725.aspx#BMexport
• CONTENTdm Help: "Using Tab-Delimited Text Files":
http://www.contentdm.org/help6/projectclient/entering5.asp
What kind of columns are necessary to tell CONTENTdm
how to structure your "monograph?“
• Which rows are chapters?
• Which rows are pages?

Some terminology:
• Object: Book-level; the entire bound volume of minutes;
contains chapters, etc.
• Item: Page-level; an individual page within a book/object.
• CONTENTdm Field = Excel Column
• CONTENTdm Record = Excel Row
Our "Object":
Minute Book Volume 1, City of Lafayette, June 1, 1847 - July
5, 1854
Unique Identifier: op000001
Our "Items":
347 pages
(op000001_0001.jpg, op000001_0002.jpg, etc. etc. etc.)
Our "Chapters":
Meeting, June 1, 1847 (Pages 1-4)
Meeting, June 11, 1847 (Pages 5-10)
Meeting, June 24, 1847 (Pages 11-14)
etc. etc. etc.
After creating a column for all the fields you want to
populate in CONTENTdm (i.e. Title, Creator, etc.), you
need two columns at the start of the Excel spreadsheet:
1. CDM_LVL - tells CONTENTdm where you want this row
to fall in the book-chapter-page hierarchy.
2. CDM_LVL_NAME - this is what will display as the title
of this row in the table of contents (i.e. "Chapter 9" or
"Page 135")
Some libraries will not add a separate row for the
"Chapter," but since we have metadata at that level, here is
how we assigned levels for the OPSB project:
CDM_LVL Assigned Level
0

Book / Object

1

Meeting /
Chapter

2

Page / Item

NOTE: CONTENTdm
will allow up to nine
levels in a
monograph
compound object.
CDM_LVL

CDM_LVL_NAME TITLE

CREATOR

PAGE

DESCRIPTION

KEYWORDS

FILE NAME

City of Lafayette City of Lafayette Orleans Parish
Meeting Minutes, Meeting Minutes, School Board
1847-1854
1847-1854
0

City of Lafayette Front Cover
Meeting Minutes,
1847-1854

Orleans Parish
School Board

Public Board of Administrators
meeting minutes, 1847-1854

op1_0001.j
pg

1

City of Lafayette Meeting, June 1,
Meeting Minutes, 1847
1847-1854

Orleans Parish 1-4
School Board

Discussion of whipping,
Superintendent's monthly
report, discussion of library,
and discussion of attendance
rules.

2

Meeting, June 1,
1847

Page 1

Orleans Parish 1
School Board

Charges were leveled against
Mrs. Smith for severely
whipping a student.

op1_0002.j
pg

2

Meeting, June 1,
1847

Page 2

Orleans Parish 2
School Board

Monthly Superintendent
report discussion

op1_0003.j
pg

2

Meeting, June 1,
1847

Page 3

Orleans Parish 3
School Board

Monthly Superintendent
report discussion cont.

op1_0004.j
pg

2

Meeting, June 1,
1847

Page 4

Orleans Parish 4
School Board

Discussion of attendance
rules.

op1_0005.j
pg

1

City of Lafayette Meeting, June 11, Orleans Parish 5-10
Meeting Minutes, 1847
School Board
1847-1854

Results of whipping
Discipline
investigation was sole topic of
discussion.

op1_0006.j
pg

discipline;
attendance;
expenses

op1_0002.j
pg
Once you have created a project in project client, add a
compound object:
Choose “Compound Object Wizard” in the “Add using”
drop-down box and click “Add.”
Choose “Monograph” from the list of compound object types.
Yes, we will be using a tab-delimited text file.
Browse to find your tab-delimited text file.
Browse to find the directory where your page (item) files are
saved.

NOTE: All image
(page) files for an
object (book) must
be saved in the
same directory.
“Label pages using
tab-delimited text file”
will label each page
with its actual title
as opposed to something
like “op000005_0039”…
Click through the summaries and click “Finish” to upload
the files
to CONTENTdm.
Notice how it is adding more items than you have pages?

“But I only had 347 pages!!!”
This is because of all the added structure rows (chapters,
etc.),
which CONTENTdm counts as items:

571 rows in Excel = 347 page rows plus all the
chapter/meeting-level rows.
Table of Contents
Navigation is Confusing
• Multiple expansions are
necessary to get to
page links
• "Plus" (+) expansion
icon very tiny. Difficult to
see to get the idea that
it should be clicked on
and hard to hit with the
mouse pointer.
Book metadata
("Object Description")
difficult to tell from page
metadata ("Description")
• not clear to user what
these terms mean
• helped by suppressing
certain repeated fields
Users give up before they find
“Search by Date”
• "Narrow your search by Date"
only gives a few options, which
seem random.
• After "Advanced Search,“ user
must find and click another tiny
link to “Search by Date.”
• “Search by Date” returns every
individual page in a date range
- quite a few results, given that
each volume is 300 to 600
pages long. Either need a
better way to filter or need to
take date off page records.
• Have since added many more unindexed books to the
original three indexed as part of the grant. We hope there
will be support to index these as well.
• Would like to ask historians or library staff to further index
these by Municipality / District. This information is in the
title but is not split out as data. Complicated because it
changed over time…
• Would like to add CQRs, other search mechanisms to
supplement CDM search and take advantage of rich
data.
• PAGE TURNER!!!!!
• Logical way for users to also download complete PDF of
minute books…
TEI Encoding

METS

What it is
Not page images - take the text of a
work, encode it in XML using the TEI
standard, and write a Web app to
output the XML file(s).

What it is
An XML "wrapper" that builds a
structure around other
metadata records (i.e. Dublin
Core page records, etc.). This
structure could include such
levels as chapter, page,
paragraph, sentence, headline,
caption, and much more.

In Action:
Folger Digital Texts:
http://www.folgerdigitaltexts.org/

In Action:
The (CUA) Tower Online:
http://tower.lib.cua.edu/

NOTE: You can encode Dublin Core records, TEI transcriptions, and more within a
METS wrappers. CONTENTdm can handle METS through the Flex Loader
(usually via a vendor).
• Creating Compound Objects (Documents, Monographs,
Postcards, and Picture Cubes):
http://www.contentdm.com/USC/tutorials/compoundwizard.pdf
• Adding Compound Objects with Tab-Delimited Text:
http://www.contentdm.com/help6/objects/adding3a.asp
• Clemson University documentation (more detailed
instruction and uses more levels): http://libraryweb.clemson.edu/wiki/images/9/92/Using_a_tabdelimited_for_mongraphs.pdf
CDM_LVL

CDM_LVL_NAME

TITLE

CREATOR

PAGE

DESCRIPTION

KEYWORDS

FILE NAME

A Very Exciting
Tale

A Very
Smith, Joe
Exciting Tale

0

A Very Exciting
Tale

Front Cover

Smith, Joe

1

Cover of the book

fiction;
excitement

js000001_0001.jpg

1

A Very Exciting
Tale

Chapter 1

Smith, Joe

2-4

Our hero wakes
up

js000001_0002.jpg

2

Chapter 1

Page 2

Smith, Joe

2

Joe gets out of
bed.

js000001_0002.jpg

2

Chapter 1

Page 3

Smith, Joe

3

Joe has breakfast.

js000001_0003.jpg

2

Chapter 1

Page 4

Smith, Joe

4

Joe goes to work.

js000001_0004.jpg

Contenu connexe

Similaire à Text Collections and CONTENTdm

Using the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related toolsUsing the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related toolsAudra Eagle Yun
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budgetJolene81
 
SharePoint and Open XML
SharePoint and Open XMLSharePoint and Open XML
SharePoint and Open XMLBecky Bertram
 
Info systems databases
Info systems databasesInfo systems databases
Info systems databasesMR Z
 
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...Michael Rys
 
Geek Sync | Tips for Data Warehouses and Other Very Large Databases
Geek Sync | Tips for Data Warehouses and Other Very Large DatabasesGeek Sync | Tips for Data Warehouses and Other Very Large Databases
Geek Sync | Tips for Data Warehouses and Other Very Large DatabasesIDERA Software
 
Data Organization in InnoDB
Data Organization in InnoDBData Organization in InnoDB
Data Organization in InnoDBZhaoyang Wang
 
Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...aneatrour
 
Cooperative Cataloging Projects: Managing Them for Best Results
Cooperative Cataloging Projects: Managing Them for Best ResultsCooperative Cataloging Projects: Managing Them for Best Results
Cooperative Cataloging Projects: Managing Them for Best ResultsNASIG
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aidkgerber
 
IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365DocFluix, LLC
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentNASIG
 
Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)Bilal Arshad
 
Std 10 Computer Chapter 1 introduction to HTML (Part 1)
Std 10 Computer Chapter 1  introduction to HTML (Part 1)Std 10 Computer Chapter 1  introduction to HTML (Part 1)
Std 10 Computer Chapter 1 introduction to HTML (Part 1)Nuzhat Memon
 

Similaire à Text Collections and CONTENTdm (20)

Using the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related toolsUsing the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related tools
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budget
 
Introduction to ms access database
Introduction to ms access databaseIntroduction to ms access database
Introduction to ms access database
 
SharePoint and Open XML
SharePoint and Open XMLSharePoint and Open XML
SharePoint and Open XML
 
Info systems databases
Info systems databasesInfo systems databases
Info systems databases
 
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...
Best Practices and Performance Tuning of U-SQL in Azure Data Lake (SQL Konfer...
 
Database Technologies
Database TechnologiesDatabase Technologies
Database Technologies
 
SQLServer Database Structures
SQLServer Database Structures SQLServer Database Structures
SQLServer Database Structures
 
Inventor Content Center: Adding Information
Inventor Content Center:   Adding InformationInventor Content Center:   Adding Information
Inventor Content Center: Adding Information
 
Geek Sync | Tips for Data Warehouses and Other Very Large Databases
Geek Sync | Tips for Data Warehouses and Other Very Large DatabasesGeek Sync | Tips for Data Warehouses and Other Very Large Databases
Geek Sync | Tips for Data Warehouses and Other Very Large Databases
 
Data Organization in InnoDB
Data Organization in InnoDBData Organization in InnoDB
Data Organization in InnoDB
 
Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...
 
Cooperative Cataloging Projects: Managing Them for Best Results
Cooperative Cataloging Projects: Managing Them for Best ResultsCooperative Cataloging Projects: Managing Them for Best Results
Cooperative Cataloging Projects: Managing Them for Best Results
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aid
 
IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365
 
MS Access Intro
MS Access IntroMS Access Intro
MS Access Intro
 
Apache Hive
Apache HiveApache Hive
Apache Hive
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides Content
 
Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)Introduction to oracle database (basic concepts)
Introduction to oracle database (basic concepts)
 
Std 10 Computer Chapter 1 introduction to HTML (Part 1)
Std 10 Computer Chapter 1  introduction to HTML (Part 1)Std 10 Computer Chapter 1  introduction to HTML (Part 1)
Std 10 Computer Chapter 1 introduction to HTML (Part 1)
 

Plus de Gena Chattin

Finding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the WebFinding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the WebGena Chattin
 
Digital Collection Management with CONTENTdm and Omeka
Digital Collection Management with CONTENTdm and OmekaDigital Collection Management with CONTENTdm and Omeka
Digital Collection Management with CONTENTdm and OmekaGena Chattin
 
Business Research Instruction - Corporate Strategy, October 23, 2012
Business Research Instruction - Corporate Strategy, October 23, 2012Business Research Instruction - Corporate Strategy, October 23, 2012
Business Research Instruction - Corporate Strategy, October 23, 2012Gena Chattin
 
Basic Business Library Instruction, Oct. 18, 2012, for University of New Orleans
Basic Business Library Instruction, Oct. 18, 2012, for University of New OrleansBasic Business Library Instruction, Oct. 18, 2012, for University of New Orleans
Basic Business Library Instruction, Oct. 18, 2012, for University of New OrleansGena Chattin
 
Orphan Works at Home and Abroad
Orphan Works at Home and AbroadOrphan Works at Home and Abroad
Orphan Works at Home and AbroadGena Chattin
 
Processing the Leon L. Dubois Collection
Processing the Leon L. Dubois CollectionProcessing the Leon L. Dubois Collection
Processing the Leon L. Dubois CollectionGena Chattin
 

Plus de Gena Chattin (7)

Finding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the WebFinding Primary Sources and Digital Collections on the Web
Finding Primary Sources and Digital Collections on the Web
 
Digital Collection Management with CONTENTdm and Omeka
Digital Collection Management with CONTENTdm and OmekaDigital Collection Management with CONTENTdm and Omeka
Digital Collection Management with CONTENTdm and Omeka
 
Business Research Instruction - Corporate Strategy, October 23, 2012
Business Research Instruction - Corporate Strategy, October 23, 2012Business Research Instruction - Corporate Strategy, October 23, 2012
Business Research Instruction - Corporate Strategy, October 23, 2012
 
Basic Business Library Instruction, Oct. 18, 2012, for University of New Orleans
Basic Business Library Instruction, Oct. 18, 2012, for University of New OrleansBasic Business Library Instruction, Oct. 18, 2012, for University of New Orleans
Basic Business Library Instruction, Oct. 18, 2012, for University of New Orleans
 
Orphan Works at Home and Abroad
Orphan Works at Home and AbroadOrphan Works at Home and Abroad
Orphan Works at Home and Abroad
 
Processing the Leon L. Dubois Collection
Processing the Leon L. Dubois CollectionProcessing the Leon L. Dubois Collection
Processing the Leon L. Dubois Collection
 
Suzanne Briet
Suzanne BrietSuzanne Briet
Suzanne Briet
 

Dernier

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Text Collections and CONTENTdm

  • 1. Using Monograph Compound Objects to display the Orleans Parish School Board Minute Books
  • 2. • The Collection the history, the data, the images, the deadline • The Strategy monograph compound objects w/a tab-delimited text file • The Results what went well, what didn’t, next steps • Some Alternatives considering other digital text collection display methods
  • 3. Bound volumes of Orleans Parish School Board meeting minutes. Dates Covered: 1841-1996 Includes the Civil War and Desegregation. Scholars as far away as Japan consult this collection on site Thanks to UNO history professor Al Kennedy who rescued many of the documents from being discarded
  • 4. Indexed by: • VOLUME A Board of Regents grant allowed UNO Midlo Center for New Orleans Studies historians to summarize and index +/900 pages of meeting minutes from just before, during, and right after the Civil War. • MEETING • Meeting Title • Meeting Date • Board Members present, absent • Keywords • Meeting Summary • PAGE • Page Summaries • Page Dates
  • 5. Based on what was indexed as part of the grant, our data structure would have to support the following: • Data • Volume-Level Metadata (Title: Municipality, District, and Dates Covered) • "Chapter"-Level Metadata (Meeting Information, Keywords, Dates) • Page-Level Metadata (Page Summaries, Dates) Given CONTENTdm as our repository tool, how would we make this happen?
  • 6. "Monograph" is the compound object structure that would allow us to keep the volume-meeting-page structure and retain the index data created for all those levels (incl. page level).
  • 7. Ultimately will be data on 300-600 pages for dozens of volumes of minute books. The UNO History Dept. provided data in Excel for first three indexed volumes. Verdict: Convert Excel file into tab-delimited text for import into CONTENTdm. What is a tab-delimited text file? • a plain text file without formatting where data fields (Excel cells) are separated by a "tab" character • file is saved with extension ".txt" • similar to a CSV file where a comma separates the values instead of a tab
  • 8.
  • 9. How do you make a tab-delimited text file from an Excel file? When saving your Excel spreadsheet, choose "Text (Tab delimited)" from the "Save as type" drop down box under the file name Remember where you save it. You will need to tell CONTENTdm where to find it later. More information: • Microsoft instructions: http://office.microsoft.com/en-us/excelhelp/import-or-export-text-txt-or-csv-filesHP010099725.aspx#BMexport • CONTENTdm Help: "Using Tab-Delimited Text Files": http://www.contentdm.org/help6/projectclient/entering5.asp
  • 10. What kind of columns are necessary to tell CONTENTdm how to structure your "monograph?“ • Which rows are chapters? • Which rows are pages? Some terminology: • Object: Book-level; the entire bound volume of minutes; contains chapters, etc. • Item: Page-level; an individual page within a book/object. • CONTENTdm Field = Excel Column • CONTENTdm Record = Excel Row
  • 11. Our "Object": Minute Book Volume 1, City of Lafayette, June 1, 1847 - July 5, 1854 Unique Identifier: op000001 Our "Items": 347 pages (op000001_0001.jpg, op000001_0002.jpg, etc. etc. etc.) Our "Chapters": Meeting, June 1, 1847 (Pages 1-4) Meeting, June 11, 1847 (Pages 5-10) Meeting, June 24, 1847 (Pages 11-14) etc. etc. etc.
  • 12. After creating a column for all the fields you want to populate in CONTENTdm (i.e. Title, Creator, etc.), you need two columns at the start of the Excel spreadsheet: 1. CDM_LVL - tells CONTENTdm where you want this row to fall in the book-chapter-page hierarchy. 2. CDM_LVL_NAME - this is what will display as the title of this row in the table of contents (i.e. "Chapter 9" or "Page 135")
  • 13. Some libraries will not add a separate row for the "Chapter," but since we have metadata at that level, here is how we assigned levels for the OPSB project: CDM_LVL Assigned Level 0 Book / Object 1 Meeting / Chapter 2 Page / Item NOTE: CONTENTdm will allow up to nine levels in a monograph compound object.
  • 14. CDM_LVL CDM_LVL_NAME TITLE CREATOR PAGE DESCRIPTION KEYWORDS FILE NAME City of Lafayette City of Lafayette Orleans Parish Meeting Minutes, Meeting Minutes, School Board 1847-1854 1847-1854 0 City of Lafayette Front Cover Meeting Minutes, 1847-1854 Orleans Parish School Board Public Board of Administrators meeting minutes, 1847-1854 op1_0001.j pg 1 City of Lafayette Meeting, June 1, Meeting Minutes, 1847 1847-1854 Orleans Parish 1-4 School Board Discussion of whipping, Superintendent's monthly report, discussion of library, and discussion of attendance rules. 2 Meeting, June 1, 1847 Page 1 Orleans Parish 1 School Board Charges were leveled against Mrs. Smith for severely whipping a student. op1_0002.j pg 2 Meeting, June 1, 1847 Page 2 Orleans Parish 2 School Board Monthly Superintendent report discussion op1_0003.j pg 2 Meeting, June 1, 1847 Page 3 Orleans Parish 3 School Board Monthly Superintendent report discussion cont. op1_0004.j pg 2 Meeting, June 1, 1847 Page 4 Orleans Parish 4 School Board Discussion of attendance rules. op1_0005.j pg 1 City of Lafayette Meeting, June 11, Orleans Parish 5-10 Meeting Minutes, 1847 School Board 1847-1854 Results of whipping Discipline investigation was sole topic of discussion. op1_0006.j pg discipline; attendance; expenses op1_0002.j pg
  • 15. Once you have created a project in project client, add a compound object:
  • 16. Choose “Compound Object Wizard” in the “Add using” drop-down box and click “Add.”
  • 17. Choose “Monograph” from the list of compound object types. Yes, we will be using a tab-delimited text file.
  • 18. Browse to find your tab-delimited text file. Browse to find the directory where your page (item) files are saved. NOTE: All image (page) files for an object (book) must be saved in the same directory.
  • 19.
  • 20. “Label pages using tab-delimited text file” will label each page with its actual title as opposed to something like “op000005_0039”…
  • 21. Click through the summaries and click “Finish” to upload the files to CONTENTdm. Notice how it is adding more items than you have pages? “But I only had 347 pages!!!”
  • 22. This is because of all the added structure rows (chapters, etc.), which CONTENTdm counts as items: 571 rows in Excel = 347 page rows plus all the chapter/meeting-level rows.
  • 23. Table of Contents Navigation is Confusing • Multiple expansions are necessary to get to page links • "Plus" (+) expansion icon very tiny. Difficult to see to get the idea that it should be clicked on and hard to hit with the mouse pointer.
  • 24. Book metadata ("Object Description") difficult to tell from page metadata ("Description") • not clear to user what these terms mean • helped by suppressing certain repeated fields
  • 25. Users give up before they find “Search by Date” • "Narrow your search by Date" only gives a few options, which seem random. • After "Advanced Search,“ user must find and click another tiny link to “Search by Date.” • “Search by Date” returns every individual page in a date range - quite a few results, given that each volume is 300 to 600 pages long. Either need a better way to filter or need to take date off page records.
  • 26. • Have since added many more unindexed books to the original three indexed as part of the grant. We hope there will be support to index these as well. • Would like to ask historians or library staff to further index these by Municipality / District. This information is in the title but is not split out as data. Complicated because it changed over time… • Would like to add CQRs, other search mechanisms to supplement CDM search and take advantage of rich data. • PAGE TURNER!!!!! • Logical way for users to also download complete PDF of minute books…
  • 27. TEI Encoding METS What it is Not page images - take the text of a work, encode it in XML using the TEI standard, and write a Web app to output the XML file(s). What it is An XML "wrapper" that builds a structure around other metadata records (i.e. Dublin Core page records, etc.). This structure could include such levels as chapter, page, paragraph, sentence, headline, caption, and much more. In Action: Folger Digital Texts: http://www.folgerdigitaltexts.org/ In Action: The (CUA) Tower Online: http://tower.lib.cua.edu/ NOTE: You can encode Dublin Core records, TEI transcriptions, and more within a METS wrappers. CONTENTdm can handle METS through the Flex Loader (usually via a vendor).
  • 28. • Creating Compound Objects (Documents, Monographs, Postcards, and Picture Cubes): http://www.contentdm.com/USC/tutorials/compoundwizard.pdf • Adding Compound Objects with Tab-Delimited Text: http://www.contentdm.com/help6/objects/adding3a.asp • Clemson University documentation (more detailed instruction and uses more levels): http://libraryweb.clemson.edu/wiki/images/9/92/Using_a_tabdelimited_for_mongraphs.pdf
  • 29. CDM_LVL CDM_LVL_NAME TITLE CREATOR PAGE DESCRIPTION KEYWORDS FILE NAME A Very Exciting Tale A Very Smith, Joe Exciting Tale 0 A Very Exciting Tale Front Cover Smith, Joe 1 Cover of the book fiction; excitement js000001_0001.jpg 1 A Very Exciting Tale Chapter 1 Smith, Joe 2-4 Our hero wakes up js000001_0002.jpg 2 Chapter 1 Page 2 Smith, Joe 2 Joe gets out of bed. js000001_0002.jpg 2 Chapter 1 Page 3 Smith, Joe 3 Joe has breakfast. js000001_0003.jpg 2 Chapter 1 Page 4 Smith, Joe 4 Joe goes to work. js000001_0004.jpg