2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
1. Courtney C. Mumma, MAS/MLIS
UBC – SLAIS Digital Records Forensics ARST 556B
May 15, 2013, Vancouver, BC
Introduction to
2. What is Archivematica?
● digital preservation/curation system
● designed to maintain standards-based, long-
term access to collections of digital objects
● free and open-source (AGPLv3)
● supported by Artefactual Systems Inc.
5. The Digital Preservation Problem:
1. Rapid technological change drives constant system upgrades,
migrations and retirement of legacy technologies.
2. Incompatible, obsolete, obscure or proprietary systems and file
formats.
3. Loss or damage to bitstreams due the fragility of digital storage
media, system error, or human error.
6. The Digital Preservation Problem:
4. The overwhelming volume of digital information objects created
daily, each with many possible copies and versions.
5. The lack or loss of adequate metadata describing digital
information objects.
6. Accidental or malicious content alteration.
7. The Digital Preservation Problem:
7. Doubts about the reliability and integrity of electronic records
and the inability to vouch for their authenticity.
8. The complexity of digital information objects which requires
preservation of their content, structure, context, presentation,
behaviour as intellectual entities as well as bitstreams.
9. The lack of formally recognized organizational responsibility,
resources and enterprise architecture components that facilitate
digital curation, preservation and long-term access.
8.
9.
10. now future
bitstream
storage media
packaging
storage device
storage driver
file system
error correction operating system
application software user interface
input / output devices
metadata
find
relate / bind
authenticate
contextualize
stored
copied
protected
Accessible?
Usable?
Authentic?
compression
decryption
file format
character encoding fonts
codec
Responsible?
Architecture?
Resources?
content
context
structure
presentation
behaviour
13. Data Management
Preservation Planning
Archival Storage
Ingest
Administration
SIP
MANAGEMENT
AIP Access DIP
P
R
O
D
U
C
E
R
C
O
N
S
U
M
E
R
Open Archival Information System (OAIS)
reference model (ISO-STD 14721)
https://www.archivematica.org/wiki/OAIS_Use_Cases
https://www.archivematica.org/wiki/OAIS_Activity_Diagrams
14. What is Archivematica?
● allows users to process digital objects from ingest
to access in conformance with the ISO-OAIS
functional model
● creates high-quality, standards-compliant Archival
Information Packages (AIP)
● provides an architecture for implementing
preservation strategies
● provides a framework for evaluating and
implementing format policies
15.
16. `
web-based dashboard
monitor and control
web
server
MCP
server
micro-service
processing clients
watched
directory
success
error
fileshare
success
digital curation
micro-services
python
scripts
FOSS
tools
AIP
DIP
SIP or
transfer of
digital objects
& metadata
17.
18. The METS file
<dmdSec> (descriptive metadata)
Dublin Core XML
EAD XML
MODS XML
[whatever] XML
<amdSec> (administrative metadata)
<techMD>
PREMIS: object
<digiProvMD>
PREMIS: events
PREMIS: agents
<rightsMD>
PREMIS: rights
<fileSec> (a list of the files and their roles and relationships)
<structMap> (a representation of the physical structure of the AIP)
19. Preservation planning
● A two-pronged approach:
– Normalization on ingest
– Preservation of the original file to support future
strategies such as migration and emulation
● Normalization relies on format policies based on an
analysis of the significant characteristics of file
formats
● A format policy indicates the actions, tools and settings to
apply to a file of a particular file format (e.g. normalization to
preservation and/or access format)
21. Archivematica format policies
● Criteria for selecting default formats:
● Non-proprietary
● Freely available specifications
● Widely used/endorsed by major repositories
● No compression/lossless compression
● Tools available to write and render the format
● Format policies will change as community
standards, practices and tools evolve.
24. Archivematica Clients / Partners
● 30 – 50 users worldwide
● Active discussion list, twitter feed, website
● Courses, community participation
● Current Artefactual clients:
– UBC Library
– UofA Library
– SFU Library
– SFU Archives
– City of Vancouver Archives
– Rockefeller Archive Center
– International Monetary Fund Archives
– Columbia University Library
– Museum of Modern Art (MoMA)
– Yale University Library
25. Foundation or
Steering Committee
Governance
Coordination
Funding
Promotion
Users
Lead institutions
Funding
Development
All users
Bug reports
Enhancement requests
Code patches
Documentation
Promotion Open Source Software
Code
Knowledge
Community
Service Providers
Development
Technical Support
Hosting
Training
Promotion
Code
Time
Money
Knowledge
Code
Time
Money
Knowledge
Time
Money
Knowledge
30. Participate
➔Collaborate with the Digital Preservation Community
➔Library of Congress Tools Showcase - try out tools
and provide feedback in open forums
➔Download and provide feedback to BitCurator – digital
forensics tools for archivists
➔Leverage social media – follow digital archivists and
leading institutions and initiatives on Twitter, read
archives blogs
➔Follow @Snarkivist @Archivematica