Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
WebWise 2014: Developing a Digitization Initiative at PBS (National Digital Stewardship Residency)
1. Developing a Digitization
Initiative at PBS
Lauren Work
WebWise 2014
2/11/2014
@squaredsong | lawork@pbs.org
“PBS Headquarters in Crystal City” by melanie.phung, used under CC BY
3. Project Description
1. Develop selection criteria for at-risk media
2. Create digitization workflow for
selected media
3. Policy update recommendations
4. Selection Criteria for Digitization
-Focus on institutional context
-Research & use existing tools
-Columbia University: AVDb
-University of Illinois: AvSAP
-Find,
organize & refine data
5. PBS Selection Criteria
-Title held on multiple formats
-Unique title (emphasis on orphans)
-Copyright and descriptive metadata available
-Title fits with current PBS/LC deposit policy
-Title held elsewhere
Selection criteria for digitization – lessons learnedInstitutional context = what is your institutional mission and how can you align your digital projects with that mission (if they aren’t already?). Demonstration & alignment of digital projects & preservation is important. Institutional context will also shape how you develop your selection criteria.For example, PBS has specific needs drove my particular selection criteria & may shape how you spend your time and resources: -Outright focus on a selection of media based on a temporary assignment (this could also fit ideas like grants, etc.) -Time limit, huge collection scope and inability to playback media means less focus on physical criteria and more focus on -Broadcast & distribution environment, not a library -No rights to majority of collection - Existing deposit agreement with Library of Congress shapes institutional policy -Huge collection holding 40 years of formats and material -Multiple system migrations means lost or very limited metadata -Member stations may be unaware of what they hold due to money & staff limitationsFind org & refine data:Materials, especially A/V have huge backlogs in cataloging in academic institutions & archives. PBS members stations don’t have the staff or time to do thorough inventories (though the inventory project through CPB helped a few years back)Think about what metadata schema may work best for you – PBS does not catalog as is focused on broadcast and using a trafficing system vs. a cataloging system.
Share. Figuring out what we have collectively will help everyone be able to focus on identifying unique items within their own collections.For PBS, I plan to make this documented collection overlap via my selection criteria public to aid both researchers and collection managers at various member stations, universities, and other organizations be able to reference content.