SlideShare une entreprise Scribd logo
1  sur  23
Everyone's A Mechanic:
   Building a Simple E-
      records workflow
                          Brad Houston
      University of Wisconsin-Milwaukee
                          April 20, 2013
Are e-records worth it?
​Well, *I* think so...​



    
        ​Improved access to content
    
        ​More information about context
    
        ​Increased manipulability for research analysis

​BUT...



    
        ​Sheer volume increases opacity
    
        ​Digital preservation and "dark ages"
    
        ​Time and Money for new systems (?)
Where would we like to be with E-
records?




                     Source: JohnVW on Flickr
Where are we right now?




                Source: an0nym0n0us on Flickr
The Mechanic Metaphor

 Borrowed from a Helen Tibbo talk at
Purdue University, September 2012
    
        “In the early days of the automobile, everyone
        was a mechanic.”

 The scary implication: You have to know
enough about your process to fix things

 The exciting implication: You can make it
as simple or as complex as you need
What's standing in our way?

 ​Uncertainty re: appropriate procedures

 ​Unfamiliarity with e-records tools and systems

 ​Unfamiliarity with e-records as a medium

 ​Perceived complexity of metadata and/or
preservation requirements/systems
​The common thread: letting the perfect be
 the enemy of the good!​
Dirty Little Secret about e-records:

You already know how to do this!



    
        Accessioning: gain intellectual/physical
        control, identify potential problems
    
        Arrangement: Put files in series/other logical
        order
    
        Description: Provide access at various levels
        of the collection
    
        Preservation: Ensure the ongoing
        integrity/usability of the materials
The Local Catalyst/Example

    ​Office of the Chancellor
    
        ​Records from personal and office computers
    
        ​Various file types and formats​
    
        ​Some pre-appraisal by office staff and archivists
    
        Large volume of files– automation a must

    ​The desired end product: an AIP!
    
        Short-Term: Provide basic description and
        preservation metadata
    
        Long-Term: Prepare for ingest into future repository
​
The following worked for us, but…

 ​The tools that work for us may not be the
best for *your* needs!
    
        ​Take stock of your own e-records holdings
    
        ​Browse tool catalogs/reviews and experiment
    
        Document what you did to your files and why​

 The “Chewing Gum/Baling Wire” approach
    
        Different tools gather complementary data
        
            …But check outputs for redundancies
        
            One tool’s failure won’t bring down the whole
            thing (probably)
Accessioning and Pre-appraisal

 Goals: Establish authenticity, perform
collection overview and QC

    Duke Data Accessioner
     
         ​Quick and easy checksum generation
     
         ​Basic technical metadata for PREMIS—XML
          format

 ​DROID and/or JHOVE
     
         ​File Format Identification and validation
     
         ​DROID reporting gives overview of collection
     
         Initial triaging for preservation?
The Working Copy Watershed

 ​From this step on, minimize changes to
your originals:
    
        Create a working copy for weeding/
        arrangement
    
        ​Write-protect originals!

 Creating a disk image (*.iso, *.uif, etc.)
may be helpful for preserving fixity here
    
        See also digital forensics tools for increased
        authenticity
Appraisal and Arrangement

 ​Goals: Move files to reflect logical order,
identify/restrict confidential info

 FreeCommander (and family)
    
        ​Two-pane browsing– easy arrangement
    
        ​integrated viewer for quick appraisal

 ​Firefly SSN Finder
    
        ​Identifies Social security, credit card #s, other
         sensitive info
    
        ​Supplements, not replaces, manual inspection
Description: File Level

 Goals: extract technical/descriptive metadata
automatically; improve discovery

 ​EXIFTool
    
        Pulls embedded metadata from files (esp. photos)
    
        ​ Exports data into CSV for tabular description

 ​ReNamer
    
        Standardize file names, strip special characters
    
        ​Option to add embedded metadata to filenames
n.b. Automation is especially key for this step.
(Think MPLP!)
File-Level Description Table
Description: Collection Level

 Goals: discovery of collection as a whole

 ​No special tools necessary- Describe as you
would paper records!

 ​That said, a few EAD considerations...
    
        ​<phystech> should include hardware, OS, and
         software needed to render all formats
    
        Describe at series/folder, not file, level
    
        ​Consider IP and/or confidentiality issues if including
         digital object links
Preservation

 ​Goal: Avoid obsolescence and/or
technology failure
Digital Preservation Software Platform



    
        ​Normalizes files to preservation formats
    
        ​Logs every preservation action taken
        
            n.b. Use this *in addition* to other metadata
            gathered.
​File Storage Location



    
        ​Use stable media or network storage (backed
         up) -- i.e. *not* CDs, floppies, etc.
    
        ​Best practice: 2 onsite copies, 1 offsite copy​
XENA AIP Example
Putting it all together
 
     Determine needed metadata for
     preservation/access
      −   Delete extraneous output
 
     Collocate if you can (XSLT, etc.); document
     what's where if you can't
 
     Provide access to as much or as little
     metadata as you need
 
     Keep the originals for authenticity
      −   Access copies for everyday usage
And don’t forget…
Digital preservation, like         
                                       Click to edit the
car maintenance, is an                 outline text format
ongoing process.                       −   Second Outline
 
     Know your collections                 Level
 
     Have a monitoring plan                
                                               Third Outline
 
     Keep up with best                         Level
     practices, discussions
                                                −   Fourth
 
     Don't give up! Anything you   I hated this car…Outline to my
                                                     but it got me
     can do is helpful.            destination. (Most of the time.)
                                                    Level
                                                       
                                                          Fifth
                                                          Outline
                                                          Level
Resources
  ​AIMS    white paper on Born-Digital
 Records
  http://www.digitalcurationservices.org/aims/white-paper/

  ​Digital   Curation Centre (UK)
  http://digitalcurationexchange.org/
  ​The   Signal: LOC Digital Preservation blog
  http://blogs.loc.gov/digitalpreservation/
  Digital    Curation Exchange
  http://digitalcurationexchange.org/
  ​Practical    E-records
Thank You
             Brad Houston
      University Records Archivist
   University of Wisconsin-Milwaukee
       Libraries, Archives Dept.
          houstobn@uwm.edu
 This presentation available for download
                    at:
https://www.box.com/s/vx22f1jus8821d20z
                    y5t

Contenu connexe

Similaire à Everyone's A Mechanic

Digitization Basics for Archives and Special Collections – Part 2: Store and ...
Digitization Basics for Archives and Special Collections – Part 2: Store and ...Digitization Basics for Archives and Special Collections – Part 2: Store and ...
Digitization Basics for Archives and Special Collections – Part 2: Store and ...WiLS
 
Disk Image!...and then what? Strategies for sustainable long-term storage an...
Disk Image!...and then what?  Strategies for sustainable long-term storage an...Disk Image!...and then what?  Strategies for sustainable long-term storage an...
Disk Image!...and then what? Strategies for sustainable long-term storage an...Helen Bailey
 
Pikas Asist2007 PIM Senior Engineers Final
Pikas Asist2007 PIM Senior Engineers FinalPikas Asist2007 PIM Senior Engineers Final
Pikas Asist2007 PIM Senior Engineers FinalChristina Pikas
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and CurateGarethKnight
 
Unified characterisation, please
Unified characterisation, pleaseUnified characterisation, please
Unified characterisation, pleaseAndy Jackson
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the WildTim Donohue
 
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year?
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year? BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year?
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year? panagenda
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for PreservationStephen Gray
 
Digital Forensic is a part of forensic that focuses on investigati.docx
Digital Forensic is a part of forensic that focuses on investigati.docxDigital Forensic is a part of forensic that focuses on investigati.docx
Digital Forensic is a part of forensic that focuses on investigati.docxcuddietheresa
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
EPrints and the Cloud
EPrints and the CloudEPrints and the Cloud
EPrints and the CloudLeslie Carr
 
Digital Forensics in the Archive
Digital Forensics in the ArchiveDigital Forensics in the Archive
Digital Forensics in the ArchiveGarethKnight
 
Getting started in digital preservation
Getting started in digital preservationGetting started in digital preservation
Getting started in digital preservationSarah Jones
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Mal Booth
 
Internal lunchtime-learning--2013 jan22--data management
Internal lunchtime-learning--2013 jan22--data managementInternal lunchtime-learning--2013 jan22--data management
Internal lunchtime-learning--2013 jan22--data managementMiles Baltrusaitis
 
Bodleian Library's DAMS system
Bodleian Library's DAMS systemBodleian Library's DAMS system
Bodleian Library's DAMS systembenosteen
 
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docx
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docxComputer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docx
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docxmaxinesmith73660
 
Approaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataApproaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataChris Freeland
 

Similaire à Everyone's A Mechanic (20)

SHAREmodule2
SHAREmodule2SHAREmodule2
SHAREmodule2
 
Digitization Basics for Archives and Special Collections – Part 2: Store and ...
Digitization Basics for Archives and Special Collections – Part 2: Store and ...Digitization Basics for Archives and Special Collections – Part 2: Store and ...
Digitization Basics for Archives and Special Collections – Part 2: Store and ...
 
Disk Image!...and then what? Strategies for sustainable long-term storage an...
Disk Image!...and then what?  Strategies for sustainable long-term storage an...Disk Image!...and then what?  Strategies for sustainable long-term storage an...
Disk Image!...and then what? Strategies for sustainable long-term storage an...
 
Pikas Asist2007 PIM Senior Engineers Final
Pikas Asist2007 PIM Senior Engineers FinalPikas Asist2007 PIM Senior Engineers Final
Pikas Asist2007 PIM Senior Engineers Final
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and Curate
 
Unified characterisation, please
Unified characterisation, pleaseUnified characterisation, please
Unified characterisation, please
 
Digital Preservation in the Wild
Digital Preservation in the WildDigital Preservation in the Wild
Digital Preservation in the Wild
 
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year?
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year? BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year?
BP301: Q: What’s Your Second Most Valuable Asset and Nearly Doubles Every Year?
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for Preservation
 
Digital Forensic is a part of forensic that focuses on investigati.docx
Digital Forensic is a part of forensic that focuses on investigati.docxDigital Forensic is a part of forensic that focuses on investigati.docx
Digital Forensic is a part of forensic that focuses on investigati.docx
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
EPrints and the Cloud
EPrints and the CloudEPrints and the Cloud
EPrints and the Cloud
 
Electronic Records
Electronic RecordsElectronic Records
Electronic Records
 
Digital Forensics in the Archive
Digital Forensics in the ArchiveDigital Forensics in the Archive
Digital Forensics in the Archive
 
Getting started in digital preservation
Getting started in digital preservationGetting started in digital preservation
Getting started in digital preservation
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)
 
Internal lunchtime-learning--2013 jan22--data management
Internal lunchtime-learning--2013 jan22--data managementInternal lunchtime-learning--2013 jan22--data management
Internal lunchtime-learning--2013 jan22--data management
 
Bodleian Library's DAMS system
Bodleian Library's DAMS systemBodleian Library's DAMS system
Bodleian Library's DAMS system
 
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docx
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docxComputer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docx
Computer Forensics chap 3+4.DS_Store__MACOSXComputer Foren.docx
 
Approaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic dataApproaches to preserving digitized taxonomic data
Approaches to preserving digitized taxonomic data
 

Plus de Brad Houston

Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Reading the Library General Records Schedule
Reading the Library General Records ScheduleReading the Library General Records Schedule
Reading the Library General Records ScheduleBrad Houston
 
Legal Issues in Records Management
Legal Issues in Records ManagementLegal Issues in Records Management
Legal Issues in Records ManagementBrad Houston
 
Interpreting The Personnel General Records Schedule
Interpreting The Personnel General Records ScheduleInterpreting The Personnel General Records Schedule
Interpreting The Personnel General Records ScheduleBrad Houston
 
Electronic Records Management
Electronic Records ManagementElectronic Records Management
Electronic Records ManagementBrad Houston
 
Finding and Reading General Records Schedules
Finding and Reading General Records SchedulesFinding and Reading General Records Schedules
Finding and Reading General Records SchedulesBrad Houston
 
The Basics of UWM Records Management
The Basics of UWM Records ManagementThe Basics of UWM Records Management
The Basics of UWM Records ManagementBrad Houston
 
"Delete it Or Keep It?"
"Delete it Or Keep It?""Delete it Or Keep It?"
"Delete it Or Keep It?"Brad Houston
 
The Basics of UWM Records Management
The Basics of UWM Records ManagementThe Basics of UWM Records Management
The Basics of UWM Records ManagementBrad Houston
 

Plus de Brad Houston (11)

Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans
Data management plansData management plans
Data management plans
 
Reading the Library General Records Schedule
Reading the Library General Records ScheduleReading the Library General Records Schedule
Reading the Library General Records Schedule
 
Legal Issues in Records Management
Legal Issues in Records ManagementLegal Issues in Records Management
Legal Issues in Records Management
 
Interpreting The Personnel General Records Schedule
Interpreting The Personnel General Records ScheduleInterpreting The Personnel General Records Schedule
Interpreting The Personnel General Records Schedule
 
Electronic Records Management
Electronic Records ManagementElectronic Records Management
Electronic Records Management
 
Finding and Reading General Records Schedules
Finding and Reading General Records SchedulesFinding and Reading General Records Schedules
Finding and Reading General Records Schedules
 
The Basics of UWM Records Management
The Basics of UWM Records ManagementThe Basics of UWM Records Management
The Basics of UWM Records Management
 
E-Mail Management
E-Mail ManagementE-Mail Management
E-Mail Management
 
"Delete it Or Keep It?"
"Delete it Or Keep It?""Delete it Or Keep It?"
"Delete it Or Keep It?"
 
The Basics of UWM Records Management
The Basics of UWM Records ManagementThe Basics of UWM Records Management
The Basics of UWM Records Management
 

Dernier

ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 

Dernier (20)

ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 

Everyone's A Mechanic

  • 1. Everyone's A Mechanic: Building a Simple E- records workflow Brad Houston University of Wisconsin-Milwaukee April 20, 2013
  • 2. Are e-records worth it? ​Well, *I* think so...​   ​Improved access to content  ​More information about context  ​Increased manipulability for research analysis ​BUT...   ​Sheer volume increases opacity  ​Digital preservation and "dark ages"  ​Time and Money for new systems (?)
  • 3. Where would we like to be with E- records? Source: JohnVW on Flickr
  • 4. Where are we right now? Source: an0nym0n0us on Flickr
  • 5. The Mechanic Metaphor  Borrowed from a Helen Tibbo talk at Purdue University, September 2012  “In the early days of the automobile, everyone was a mechanic.”  The scary implication: You have to know enough about your process to fix things  The exciting implication: You can make it as simple or as complex as you need
  • 6. What's standing in our way?  ​Uncertainty re: appropriate procedures  ​Unfamiliarity with e-records tools and systems  ​Unfamiliarity with e-records as a medium  ​Perceived complexity of metadata and/or preservation requirements/systems ​The common thread: letting the perfect be the enemy of the good!​
  • 7. Dirty Little Secret about e-records: You already know how to do this!   Accessioning: gain intellectual/physical control, identify potential problems  Arrangement: Put files in series/other logical order  Description: Provide access at various levels of the collection  Preservation: Ensure the ongoing integrity/usability of the materials
  • 8. The Local Catalyst/Example  ​Office of the Chancellor  ​Records from personal and office computers  ​Various file types and formats​  ​Some pre-appraisal by office staff and archivists  Large volume of files– automation a must  ​The desired end product: an AIP!  Short-Term: Provide basic description and preservation metadata  Long-Term: Prepare for ingest into future repository ​
  • 9. The following worked for us, but…  ​The tools that work for us may not be the best for *your* needs!  ​Take stock of your own e-records holdings  ​Browse tool catalogs/reviews and experiment  Document what you did to your files and why​  The “Chewing Gum/Baling Wire” approach  Different tools gather complementary data  …But check outputs for redundancies  One tool’s failure won’t bring down the whole thing (probably)
  • 10. Accessioning and Pre-appraisal  Goals: Establish authenticity, perform collection overview and QC  Duke Data Accessioner  ​Quick and easy checksum generation  ​Basic technical metadata for PREMIS—XML format  ​DROID and/or JHOVE  ​File Format Identification and validation  ​DROID reporting gives overview of collection  Initial triaging for preservation?
  • 11.
  • 12. The Working Copy Watershed  ​From this step on, minimize changes to your originals:  Create a working copy for weeding/ arrangement  ​Write-protect originals!  Creating a disk image (*.iso, *.uif, etc.) may be helpful for preserving fixity here  See also digital forensics tools for increased authenticity
  • 13. Appraisal and Arrangement  ​Goals: Move files to reflect logical order, identify/restrict confidential info  FreeCommander (and family)  ​Two-pane browsing– easy arrangement  ​integrated viewer for quick appraisal  ​Firefly SSN Finder  ​Identifies Social security, credit card #s, other sensitive info  ​Supplements, not replaces, manual inspection
  • 14.
  • 15. Description: File Level  Goals: extract technical/descriptive metadata automatically; improve discovery  ​EXIFTool  Pulls embedded metadata from files (esp. photos)  ​ Exports data into CSV for tabular description  ​ReNamer  Standardize file names, strip special characters  ​Option to add embedded metadata to filenames n.b. Automation is especially key for this step. (Think MPLP!)
  • 17. Description: Collection Level  Goals: discovery of collection as a whole  ​No special tools necessary- Describe as you would paper records!  ​That said, a few EAD considerations...  ​<phystech> should include hardware, OS, and software needed to render all formats  Describe at series/folder, not file, level  ​Consider IP and/or confidentiality issues if including digital object links
  • 18. Preservation  ​Goal: Avoid obsolescence and/or technology failure Digital Preservation Software Platform   ​Normalizes files to preservation formats  ​Logs every preservation action taken  n.b. Use this *in addition* to other metadata gathered. ​File Storage Location   ​Use stable media or network storage (backed up) -- i.e. *not* CDs, floppies, etc.  ​Best practice: 2 onsite copies, 1 offsite copy​
  • 20. Putting it all together  Determine needed metadata for preservation/access − Delete extraneous output  Collocate if you can (XSLT, etc.); document what's where if you can't  Provide access to as much or as little metadata as you need  Keep the originals for authenticity − Access copies for everyday usage
  • 21. And don’t forget… Digital preservation, like  Click to edit the car maintenance, is an outline text format ongoing process. − Second Outline  Know your collections Level  Have a monitoring plan  Third Outline  Keep up with best Level practices, discussions − Fourth  Don't give up! Anything you I hated this car…Outline to my but it got me can do is helpful. destination. (Most of the time.) Level  Fifth Outline Level
  • 22. Resources ​AIMS white paper on Born-Digital Records http://www.digitalcurationservices.org/aims/white-paper/ ​Digital Curation Centre (UK) http://digitalcurationexchange.org/ ​The Signal: LOC Digital Preservation blog http://blogs.loc.gov/digitalpreservation/ Digital Curation Exchange http://digitalcurationexchange.org/ ​Practical E-records
  • 23. Thank You Brad Houston University Records Archivist University of Wisconsin-Milwaukee Libraries, Archives Dept. houstobn@uwm.edu This presentation available for download at: https://www.box.com/s/vx22f1jus8821d20z y5t

Notes de l'éditeur

  1. You&apos;ve heard from Cate about getting the records into your repository, so I&apos;m going to take some time to talk about what you do with them once you&apos;ve got them. Before we get to that, however, it&apos;s worth re-considering why keeping records in electronic format is valuable, and what some of the major challenges might be.
  2. Now, of course, we&apos;d all like to have the Ferrari version of an e-records system, with all the bells and whistles that implies: Trusted Digital Repository certification, automatic monitoring for bit rot, an integrated access system, and everything else we think about when we think about the ideal digital preservation system. For most of us, however, our e-records process looks a bit more like this:
  3. Now, nobody goes out and buys this car. (Well, OK, maybe if you&apos;re a teenager and need your first set of wheels.) But this car sort of sneaks up on you-- you ignore the check engine light here, you don&apos;t fix the rust spots there, and pretty soon your bumper is falling off. It&apos;s the same thing with digital preservation-- it&apos;s easy to put the stuff into the backlog until the day you&apos;re ready to do the stuff &apos;right&apos;-- but all the while, you&apos;re moving farther and farther away from the original hardware and software environment, your files are degrading, and soon enough you get to the point where your e-records are irreparably damaged-- or unreadable altogether.
  4. Again: you cannot do this, unless you want to have the digital file equivalent of the car from slide 4. Eventually, you need to have something that gets you from point A to point B, even if you think it&apos;s ungainly or awkward. This isn&apos;t to say that you should give up on the Ferrari digipres environment altogether, but in the meantime, while you&apos;re waiting for funding or staffing to materialize for that, you can use the best qualities of that system to aspire towards.
  5. Ultimately, you are not doing anything different with e-records than you would do with analog records. There are really only two main differences: one, you have the potential to be able to provide a level of descriptive granularity that previous generations of users would have killed for; and two, it is much, much easier to lose the records to preservation problems if you aren&apos;t paying attention. Hopefully, the tools I&apos;m going to go over will help you leverage the first and avoid the second.
  6. So, this was our first sort of “crossroads” point, in that we had to decide, in the absence of a dedicated repository, what exactly we were trying to accomplish through running these records through their paces. Ultimately, we realized that collecting this metadata served a couple of purposes. Short-term, we wanted to provide the basic technical data to allow future-us to determine what these files were and how to access them. Long-term, we hope to use the metadata we&apos;re generating for when we DO get a repository to facilitate ingest. That in mind, we started testing. And testing. And testing.
  7. Because ultimately, if you&apos;re looking for a swiss-army tool for every conceivable file format, you&apos;re probably looking in vain. In our case, we mostly have been dealing with documents, presentation files, images, and a few audio files. You may have different needs, some of which you don&apos;t even know about yet, which is why it is so important to know your own collection and be familiar with the tool options before you get started.
  8. remember: use &quot;we used&quot; for emphasis and why in this case We found Duke Data Accessioner to be a useful tool both because it generated checksums before and after ingest, but also because it maintained a record of the file structure of the original documents. The XML generated provided us with a nice seed document for PREMIS preservation metadata, and the migration aspect of the tool allowed us to easily create working copies, which I&apos;ll talk about in a minute.
  9. This is an example of DROID in action. I&apos;ve highlighted an example of a file that has been identified as mischaracterized, in this case a Word Doc that has the characteristics of a wordperfect document. In this case, the author probably tried to reformat by changing the file extension, which doesn&apos;t really work at all-- more on this in a bit. But what this allows us to do is go to that file, do a quick appraisal on it, and either fix or weed it, depending on its importance.
  10. So, to go back to my car metaphor, everything you&apos;ve been doing so far has been to the original, which here represents the car body, engine, transmission, axels, etc. Now we&apos;re going to create a working copy, which allows you to preserve the authenticity of the originals while improving accessibility on the user end. Think of this as “tricking out” your car-- adding a spoiler, decals, a giant bass speaker in the trunk, etc. Had I thought of this metaphor earlier than last night, I would have put a photo of the “Pimp My Ride” guy here, but nobody&apos;s perfect.
  11. So, as noted, because you&apos;re keeping the SIP as your authenticity copy, you have a little more leeway to rearrange to reflect logical order. This is especially important in cases where no apparent order exists, such as the accessions you get where someone just threw stuff onto disks as quickly as possible. One aspect of processing that can get overlooked during this kind of MPLP processing is dealing with sensitive information. Luckily, most of what you&apos;re wanting to restrict falls into specific patterns, which is where programs like Firefly come in.
  12. Here&apos;s an example of FreeCommander at use during the arrangement process. In this case, I&apos;m using it to show a before and after folder-renaming process, but as you can see it greatly increases your ability to move files around without clicking through 8 separate windows. There&apos;s also a compare feature to make sure that all relevant files have been moved from source to destination, and you can get more in-depth information about the files from the details list, which is another quick way to appraise and/or describe the accession.
  13. Now, File level description is useful in facilitating searching across numerous documents on known quantities, such as author, date, subject, etc., but you also don&apos;t want to spend too much time on it. We&apos;ve used a couple of tools here to increase the discoverability of files within our collection. EXIFTool is very good at pulling out embedded metadata from document and common application files, especially photos. It is, however, only as good as the metadata your creators put into it, so this is an opportunity for records management to shine. We also found Renamer very useful to mass rename folders and files to include more descriptive info. Authenticity!
  14. In most cases, I prefer that tools export to XML to facilitate later ingest into a repository, but since EXIFTool spits out CSV files, we were able to use that to provide some rudimentary access to collections. This is a simple flat spreadsheet, but you can see that it allows you to quickly full-text search by name, filter by date, sort by author, title, keywords, etc. If you have any sort of facility with database design you can do a lot more with this, such as grouping by folder, etc. You also have the opportunity to massage some of the metadata en masse at this step-- I might change ces5 to Carlos Santiago for clarity in the access table, e.g.
  15. In a way, everything you&apos;ve been doing up to this point is leading up to preservation, because all that metadata you&apos;ve collected is going to allow you to have records of what your files are and what hardware and software you need to open them. If digital archeology isn&apos;t your thing, however, you might want to look into normalizing files to preservation file formats. We used NAA&apos;s DPSP because it provides a sort of all-in-one service-- it normalizes files, keeps the originals bitstreams in XML wrappers, moves the files to the location of your choice, and-- importantly-- automatically documents every action it takes.
  16. This is an example of an Archival Information Package generated by XENA, which is the normalization component of the DPSP. In this case, XENA is able to render the original file via one of its embedded viewers, but it also provides the XML technical metadata wrapper so that the file will self-document in case the XENA viewer is unavailable. The thing I like about XENA in particular is that you can export both the XML and the normalized and original copies from the XENA interface, which improves the longevity of the AIP even further.
  17. Ultimately, whether you use all or none of the tools just described, there are going to be a few things you will always want to do with your electronic records during processing to make sure they have been dealt with properly. It may seem like you&apos;re keeping a lot of copies of the same material, and you&apos;re right-- but that&apos;s really by design. Especially if you follow the advice on the preservation slide and keep the copies in separate locations, you are going to lessen your chance for loss or failure of the files in question. Think of it as insurance for that car you just spent all that money on tricking out.
  18. In the end, the “car” you&apos;re going to end up with is going to be a lot like my trusty 2000 Mazda Protege. This car had no pickup, the radio had been ripped out, there was a giant dent in the side, the AC didn&apos;t work... but crucially, I kept the systems that kept it actually running up to date, and so regardless of its lack of creature comforts, I was usually able to get from Point A to Point B. So it is with processing electronic records-- as long as you are doing your basic maintenance, you will be able to keep your e-recs going much longer than you would have otherwise, even if that Ferrari is still somewhat down the road.