SlideShare une entreprise Scribd logo
1  sur  47
NEW STRATEGIES FOR MSS

  Bradley Daigle (@BradleyDaigle) / Mike Durbin -
                University of Virginia
What is a manuscript?
What’s a manuscript
What are we dealing with?
Complexity
Manual Processes
Workflow Challenges


Digital orphans

Access rights

Hybrid collections
EADs
bring the search to the
             user, don’t make the user
             have to understand how
             we organize descriptive
             information!




Made by archivists for archivists
New technology, new needs
Instant integration
Single infrastructure
What are we doing?
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://ead.lib.virginia.edu/vivaead/published/document.xsl" type="text/xsl"?>
<ead xmlns="urn:isbn:1-931666-22-9" id="viu01215">
 <eadheader audience="internal" langencoding="iso639-2b" findaidstatus="edited-full-draft" scriptencoding="iso15924" dateencoding="iso8601"
countryencoding="iso3166-1" repositoryencoding="iso15511">
  <eadid publicid="PUBLIC &amp;#34;-//University of Virginia::Library::Special Collections Dept.//TEXT (US::ViU::viu01215::A Guide to the Papers
of John Dos Passos 1865-1998)//EN &amp;#34;viu01215.xml&amp;#34;" countrycode="US" mainagencycode="US-ViU">PUBLIC
        "-//University of Virginia::Library::Special Collections
        Dept.//TEXT (US::ViU::viu01215::A Guide to the Papers of
        John Dos Passos 1865-1998)//EN "viu01215.xml"</eadid>
  <filedesc>
    <titlestmt>
     <titleproper>A Guide to the Papers of John Dos Passos
       <date era="ce" calendar="gregorian">1865-1998</date></titleproper>
     <subtitle id="sort">Dos Passos, John, Papers
       <num type="collectionnumber">5950</num></subtitle>
     <author>Special Collections Staff</author>
    </titlestmt>
    <publicationstmt>
     <publisher>Special Collections, University of Virginia Library
</publisher>
How is this different?
Think about finding aids not EADs

Let the archivists focus on what they do best

IP landscapes are flexible

  human and machine actionable

Avoid the digitization / description dilemma

Not worry about variable levels of description

Optimized for digital surrogates and born digital content
Not all rainbows and unicorns
Challenges with integrating search results

Relational database could get quite large

Complex data storage model

Migration of legacy content
Looking ahead
Granular circulation

Risk Management

Create virtual collections

Alternate metadata options for description

Archival prioritization of search results
Data Model Constraints
Unknowns
 Metadata
   Format
   Publication-ready?
   Unique ids?
 Workflows to support
   Only “complete” collections ingested?
   Editing after ingestion?
   Editing happen in the repository or out of the repository?
Data Model Goals
 Allow for multiple hierarchies to describe the same resources
 Allow for metadata in various formats
 Support ingest of “finished” EADs but anticipate future edits,
replacements and reorganizations
Data Model
From Finding Aid
                                 Collection




                                Component
                                 Component
                                  Component




                   Item            Item       Item
                      Item           Item       Item
                         Item          Item       Item
Data Model
                             Collection
From Finding Aid




                             Compo
                            Component
                              Compo
                              nent
                               nent


                      Ite
                   Item          Ite
                               Item
                                 Ite        Ite
                                          Item
                                           Ite
                     Ite
                     m           m
                                 m         m
                                           m
                      m
Data Model
                             Collection               MARC
                                                      MARC
From Finding Aid                                      MARC
                                                                      From Catalog




                             Compo
                            Component
                              Compo           Container   Container
                              nent
                               nent


                      Ite
                   Item          Ite
                               Item
                                 Ite        Ite
                                          Item
                                           Ite
                     Ite
                     m           m
                                 m         m
                                           m
                      m
Data Model
                             Collection               MARC
                                                      MARC
From Finding Aid                                      MARC
                                                                      From Catalog




                             Compo
                            Component
                              Compo           Container   Container
                              nent
                               nent


                      Ite
                   Item          Ite
                               Item
                                 Ite        Ite
                                          Item
                                           Ite
                     Ite
                     m           m
                                 m         m
                                           m
                      m
Data Model
                             Collection               MARC
                                                      MARC
From Finding Aid                                      MARC
                                                                      From Catalog




                             Compo
                            Component
                              Compo           Container   Container
                              nent
                               nent


                      Ite
                   Item          Ite
                               Item
                                 Ite        Ite
                                          Item
                                           Ite
                     Ite
                     m           m
                                 m         m
                                           m
                      m
Data Model
                             Collection               MARC
                                                      MARC
From Finding Aid                                      MARC
                                                                               From Catalog




                             Compo
                            Component
                              Compo           Container     Container
                              nent
                               nent


                      Ite
                   Item          Ite
                               Item
                                 Ite        Ite
                                          Item
                                           Ite
                     Ite                                        Digitized
                     m           m
                                 m         m
                                           m
                                                                                    From Digitization and
                      m                                           Item             patron request workflow



                                                          Digitized
                                                               Digitized
                                                             File
                                                                   Digitized
                                                                  File
                                                                       File
Data Model
                                            Collectio
                                                                        MARC
                                                n
            From Finding Aid
                                                                                                        From Catalog



                                        Component
                                            Comp
                                             Comp
                                                                 Container     Container




                                              Item                                                           From Digitization
                               Item                       Item
                                Ite
                                 Ite            Ite            Ite
                                                                Ite                Digitized
                                                                                                            and patron request
                                                                                                                 workflow
                                                                                     Item


                                                                             Digitized
                                                                                  Digitized
                                                                                File
                                                                                      Digitized
                                                                                     File
                                                                                          File
                                                                                                     Digitized
                                Collectio                                                              Item
                                                        Item
                                    n
                                                          Item
From Finding Aid or
other collection
                                                             Item                              Digitized
description source                                                                                  Digitized
                                                                                                  File
                                                                                                        Digitized
                                                                                                       File
                                                                                                            File
Fedora Metadata Philosophy


“Catalog in the format that is most suited to your materials but disseminate in
the format that’s most suited to your use”
Metadata Model
                                                                R C
                                   A
                    Collection                     MARC



                                 M
                   Compo
                                 ML
                    Compo
                               X
                   Component               Container       Container
                    nent

    D
                     nent


 EA    Ite
     Item
        m
         Ite
                      Item
                         Ite        ItemIte
                                         Ite
                                         m                       Digitized
          m               m               m                        Item



                                                       Digitized File
                                                             Digitized File



   D
                                                                  Digitized File




 A
                                                                                       Digitized



E L
                                                                                         Item
      Collection                 Item
                                   Item




  M
                                        Item
                                                                             Digitized File
                                                                                   Digitized File
                                                                                        Digitized File
Dissemination Needs and Support

Finding Aids
Discovery UI Support
Discovery Index Records
Indexing
Philosophy
  Based around discovery and presentation needs
Technical Implementation
  XSLT-based Fedora Disseminator
    Pulls data from the entire RDF graph to build index records
    Reindexing would be triggered by editing or submission workflow
  Solr Index serves to cache collection structure and metadata
Discovery Interface
VIRGO
  Blacklight
    Ruby on rails
    Solr
  Custom integrations
    Fedora
    ILS (Sirsi)
    PRIMO
Development Process
Centered on User Experience
  Started with wireframes
  Included major stakeholders from the beginning
Balanced competing needs
  Archivists
    Asserts the importance of the context, collection and archival
   descriptive practice.
  Researcher
    Wants to be able to find all relevant materials across traditional silos.
  Web surfer
    Cares less about where something came from and more about being
Development Status
“Complete”
  Data model
  EAD  Fedora processing/ingest
  UI enhancements to Virgo (Blacklight)
Short term goals
  Include large volume of finding aids
  Implement robust policy support
  Refine the user interface as needed
Longer term goals
  Place robust archival description tools on top of the Fedora Data Model
Thank you!



Bradley Daigle - bradley@virginia.edu / @bradleydaigle

Mike Durbin - md5wz@virginia.edu

Contenu connexe

Dernier

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Dernier (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

En vedette

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

En vedette (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

Archival Data Presentation for Digital Library Federation

  • 1. NEW STRATEGIES FOR MSS Bradley Daigle (@BradleyDaigle) / Mike Durbin - University of Virginia
  • 2. What is a manuscript?
  • 4.
  • 5. What are we dealing with?
  • 8. Workflow Challenges Digital orphans Access rights Hybrid collections
  • 10. bring the search to the user, don’t make the user have to understand how we organize descriptive information! Made by archivists for archivists
  • 14.
  • 15.
  • 16.
  • 17. What are we doing? <?xml version="1.0" encoding="UTF-8"?> <?xml-stylesheet href="http://ead.lib.virginia.edu/vivaead/published/document.xsl" type="text/xsl"?> <ead xmlns="urn:isbn:1-931666-22-9" id="viu01215"> <eadheader audience="internal" langencoding="iso639-2b" findaidstatus="edited-full-draft" scriptencoding="iso15924" dateencoding="iso8601" countryencoding="iso3166-1" repositoryencoding="iso15511"> <eadid publicid="PUBLIC &amp;#34;-//University of Virginia::Library::Special Collections Dept.//TEXT (US::ViU::viu01215::A Guide to the Papers of John Dos Passos 1865-1998)//EN &amp;#34;viu01215.xml&amp;#34;" countrycode="US" mainagencycode="US-ViU">PUBLIC "-//University of Virginia::Library::Special Collections Dept.//TEXT (US::ViU::viu01215::A Guide to the Papers of John Dos Passos 1865-1998)//EN "viu01215.xml"</eadid> <filedesc> <titlestmt> <titleproper>A Guide to the Papers of John Dos Passos <date era="ce" calendar="gregorian">1865-1998</date></titleproper> <subtitle id="sort">Dos Passos, John, Papers <num type="collectionnumber">5950</num></subtitle> <author>Special Collections Staff</author> </titlestmt> <publicationstmt> <publisher>Special Collections, University of Virginia Library </publisher>
  • 18. How is this different? Think about finding aids not EADs Let the archivists focus on what they do best IP landscapes are flexible human and machine actionable Avoid the digitization / description dilemma Not worry about variable levels of description Optimized for digital surrogates and born digital content
  • 19. Not all rainbows and unicorns
  • 20. Challenges with integrating search results Relational database could get quite large Complex data storage model Migration of legacy content
  • 21. Looking ahead Granular circulation Risk Management Create virtual collections Alternate metadata options for description Archival prioritization of search results
  • 22. Data Model Constraints Unknowns Metadata Format Publication-ready? Unique ids? Workflows to support Only “complete” collections ingested? Editing after ingestion? Editing happen in the repository or out of the repository?
  • 23. Data Model Goals Allow for multiple hierarchies to describe the same resources Allow for metadata in various formats Support ingest of “finished” EADs but anticipate future edits, replacements and reorganizations
  • 24. Data Model From Finding Aid Collection Component Component Component Item Item Item Item Item Item Item Item Item
  • 25. Data Model Collection From Finding Aid Compo Component Compo nent nent Ite Item Ite Item Ite Ite Item Ite Ite m m m m m m
  • 26. Data Model Collection MARC MARC From Finding Aid MARC From Catalog Compo Component Compo Container Container nent nent Ite Item Ite Item Ite Ite Item Ite Ite m m m m m m
  • 27. Data Model Collection MARC MARC From Finding Aid MARC From Catalog Compo Component Compo Container Container nent nent Ite Item Ite Item Ite Ite Item Ite Ite m m m m m m
  • 28. Data Model Collection MARC MARC From Finding Aid MARC From Catalog Compo Component Compo Container Container nent nent Ite Item Ite Item Ite Ite Item Ite Ite m m m m m m
  • 29. Data Model Collection MARC MARC From Finding Aid MARC From Catalog Compo Component Compo Container Container nent nent Ite Item Ite Item Ite Ite Item Ite Ite Digitized m m m m m From Digitization and m Item patron request workflow Digitized Digitized File Digitized File File
  • 30. Data Model Collectio MARC n From Finding Aid From Catalog Component Comp Comp Container Container Item From Digitization Item Item Ite Ite Ite Ite Ite Digitized and patron request workflow Item Digitized Digitized File Digitized File File Digitized Collectio Item Item n Item From Finding Aid or other collection Item Digitized description source Digitized File Digitized File File
  • 31. Fedora Metadata Philosophy “Catalog in the format that is most suited to your materials but disseminate in the format that’s most suited to your use”
  • 32. Metadata Model R C A Collection MARC M Compo ML Compo X Component Container Container nent D nent EA Ite Item m Ite Item Ite ItemIte Ite m Digitized m m m Item Digitized File Digitized File D Digitized File A Digitized E L Item Collection Item Item M Item Digitized File Digitized File Digitized File
  • 33. Dissemination Needs and Support Finding Aids Discovery UI Support Discovery Index Records
  • 34. Indexing Philosophy Based around discovery and presentation needs Technical Implementation XSLT-based Fedora Disseminator Pulls data from the entire RDF graph to build index records Reindexing would be triggered by editing or submission workflow Solr Index serves to cache collection structure and metadata
  • 35. Discovery Interface VIRGO Blacklight Ruby on rails Solr Custom integrations Fedora ILS (Sirsi) PRIMO
  • 36. Development Process Centered on User Experience Started with wireframes Included major stakeholders from the beginning Balanced competing needs Archivists Asserts the importance of the context, collection and archival descriptive practice. Researcher Wants to be able to find all relevant materials across traditional silos. Web surfer Cares less about where something came from and more about being
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46. Development Status “Complete” Data model EAD  Fedora processing/ingest UI enhancements to Virgo (Blacklight) Short term goals Include large volume of finding aids Implement robust policy support Refine the user interface as needed Longer term goals Place robust archival description tools on top of the Fedora Data Model
  • 47. Thank you! Bradley Daigle - bradley@virginia.edu / @bradleydaigle Mike Durbin - md5wz@virginia.edu

Notes de l'éditeur

  1. \n
  2. \n
  3. \n
  4. \n
  5. Not so much about mass digitization as it is patron requests (a harder problem set)\n
  6. \n
  7. \n
  8. \n
  9. Large EADs - search and browse not integrated - separate interfaces - discover through structure\n
  10. Large EADs - search and browse not integrated - separate interfaces - discover through structure\n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. \n
  39. \n
  40. \n
  41. \n
  42. \n
  43. \n
  44. \n
  45. \n