SlideShare une entreprise Scribd logo
1  sur  29
Preservation of Research Data:
Dataverse / Archivematica Integration
Allan Bell | Associate University Librarian,
The University of British Columbia
Leanne Trimble | Data & Geospatial Librarian,
OCUL Scholars Portal
The UBC Context
University of British Columbia Digital
Preservation Strategy
● Digital Preservation Program
○ cIRcle, DSpace-based repository
○ Digitized collections in CONTENTdm
○ New and legacy born digital archival material
○ Websites (Archive-IT)
○ Soon, Abacus Dataverse, Research Data
University of British Columbia Digital
Preservation Strategy
● Use Archivematica as a tool to apply OAIS-compliant
preservation processes
● Integrate Archivematica with existing systems used
to manage digital objects
● Build internal technical and staff capacity
OAIS reference model
Archivematica
● “a free and open source digital
preservation system that is
designed to maintain standards-
based, long term access to
collections of digital objects”
http://www.archivematica.org
● micro-services provide
integrated suite of software
tools in compliance with ISO-
OAIS model
Digital Preservation Program
CiRcle (Dspace)
• Archivematica receives
submissions from DSpace
• Also have Archivematica to
DSpace workflow
Digital Preservation Program
CONTENTdm
• Master files uploaded to
Archivematica
• Archivematica produces
access versions and pushes
to CONTENTdm
Digital Preservation Program
RBSC/UA born-digital acquisition workflow
Digital Preservation Program TRAC Self
Audit
• Trustworthy Repositories Audit and Certification
(evolved into ISO 16363)
• Widely accepted criteria for assessing
trustworthiness of digital repositories
• TRAC checklist is an auditing tool to assess the
reliability, commitment and readiness of
institutions to assume long-term preservation
responsibilities
What is TRAC?
• The TRAC metrics assess three areas:
a. Organizational Infrastructure - the repository's
administrative, staffing, financial, and legal functions
b. Digital Object Management - the handling of digital
objects from ingest to access
c. Technology, Technical Infrastructure and Security - the
technology used to handle ingested objects
• These criteria represent best practices and current thinking
about the organizational and technological needs of
trustworthy digital repositories.
TRAC Compliant Repositories
Centre for Research Libraries has audited and
certified five repositories:
•Chronopolis Report
•CLOCKSS
•Hathitrust Report
•Portico Report
•Scholars Portal
Digital Preservation Program
Conclusions
• Greater comfort with and
understanding of the challenges
around archiving digitized and
born digital material
• Establishing a comprehensive
digital preservation program is
complex!
• Having tools is important, also
need policies and procedures for
certification (if desired)
Abacus Dataverse: Research Data
Management
● UBC hosted instance for four Research Universities in British
Columbia since 2014
○ Abacus DSpace launched in 2009
● 1,700 studies (more than 28,000 files)
● Actively used by researchers
● Each school has full control and added discoverability for their
data
○ Licensed data but also growing institutional research data
collections
○ Each institution has its own subnet with
■ OAI export to Summon (common Library Discovery Layer)
■ Separate Dataverses for institutional research data
The Ontario Context
OCUL & Scholars Portal
Who?
• 21 university libraries in Ontario
What?
• Collective purchasing
• Shared digital infrastructure
• Collaborative planning and
assessment
How?
*Scholars Portal*
• OCUL’s shared technology
infrastructure, housing shared
collections
More information:
http://www.ocul.on.ca/
OCUL/SP & Research Data Management
Dataverse (OCUL hosted instance)
– Hosted for OCUL since 2011
– 330 studies (about 4,000 files)
– Actively used by researchers from 7-8 institutions
– Many in social science disciplines but some in
sciences (agriculture, polar research, geophysics,
nursing…)
OCUL/SP & Research Data Management
• Services are evolving at each institution
• Still trying to get a handle on:
– RDM support services required by researchers
– RDM infrastructure requirements
– RDM costs
– Role of regional consortia in RDM services
OCUL/SP & Digital Preservation
• Trustworthy Digital Repository (TDR) certified
for electronic journal content (since 2013)
• Currently working on Ontario Library Research
Cloud (OLRC) project (2015 completion)
•Data Preservation: strong interest
National initiatives in Canada
‘Portage’
Canadian Association of Research Libraries led
project aimed at building a library-based
research data management network
2 aspects:
• Network of expertise for research data
management
• A national preservation and discovery network
for research data
National preservation network
Dataverse / Archivematica
Integration
Dataverse/Archivematica Integration
Dataverse
• Data
• Metadata (DDI &
other)
Archivematica
• Accept data and
metadata
• Perform preservation
functions
• Create Archival
Information Packages
(AIPs)
Archival
storage
?
Local Data Repository
(e.g. at SP or UBC)
Preservation Infrastructure (Portage)
Integration Middleware
• Harvest content via Dataverse API (no
SWORD client capability ATM)
• Package and submit to Archivematica
using SWORD
Project Participants
• Artefactual – Evelyn McLellan, Justin Simpson
• Dataverse – Phil Durbin, Eleni Castro (& others)
• Scholars Portal – Leanne Trimble, Alan Darnell
• UBC – Allan Bell, Eugene Barsky
• University of Alberta – Geoff Harder, Chuck
Humphrey, Larry Laliberte, Peter Binkley
• Simon Fraser University – Alex Garnett
Functional Requirements
● Develop “middleware” which can transfer
studies from Dataverse to Archivematica
- Detect newly published studies & “major” new
versions
- Harvest released studies from Dataverse
- Utilize SWORD protocol
- Submit to Archivematica
- One Dataverse study = 1 SIP = 1 AIP
Functional Requirements (2)
● Investigate Archivematica pipeline decisions
for data formats coming from Dataverse
- File format normalization?
- Connecting versions of the same dataset to one
another?
- Handling DDI (and other) metadata records?
Possible features for future stages
• Dataverse as a SWORD client
• Mechanism within Dataverse for researchers
to specify which datasets they want to target
for preservation
• Returning information from Archivematica
back to Dataverse (indication of preservation
status within Dataverse)
Next Steps
• University of Toronto procurement process
underway to contract the development work
to Artefactual
• Develop the middleware (2015)
• Recruit researchers to contribute data to ingest
(concurrent with development work)

Contenu connexe

Tendances

Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...EDINA, University of Edinburgh
 
Persistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John KunzePersistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John Kunzedatascienceiqss
 
Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Jisc
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareEDINA, University of Edinburgh
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Nancy Pontika
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingMaaike Duine
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - IntroductionMaaike Duine
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.FAIRDOM
 
Report of the second FAIRDOM foundry
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundryFAIRDOM
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace
 
Rots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesRots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesASIS&T
 
Open Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKOpen Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKEDINA, University of Edinburgh
 
Research data management free online courses, publisher policies
Research data management   free online courses, publisher policiesResearch data management   free online courses, publisher policies
Research data management free online courses, publisher policiesNikesh Narayanan
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...DuraSpace
 

Tendances (20)

Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Persistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John KunzePersistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John Kunze
 
Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Organising and Documenting Data
Organising and Documenting DataOrganising and Documenting Data
Organising and Documenting Data
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - Introduction
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
 
Report of the second FAIRDOM foundry
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundry
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
Rots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesRots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal Agencies
 
Open Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKOpen Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UK
 
Research data management free online courses, publisher policies
Research data management   free online courses, publisher policiesResearch data management   free online courses, publisher policies
Research data management free online courses, publisher policies
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
 
Researh data management
Researh data managementResearh data management
Researh data management
 

Similaire à Preservation of Research Data: Dataverse / Archivematica Integration by Allan Bell and Leanne Trimble

L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP PilotL&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP PilotCASRAI
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...Jenny Mitcham
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaJisc RDM
 
Institutional Repositories.pptx
Institutional Repositories.pptxInstitutional Repositories.pptx
Institutional Repositories.pptxSheejamolMathew
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with ArchivematicaJenny Mitcham
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Transforming University Research - Mar 2006
Transforming University Research - Mar 2006Transforming University Research - Mar 2006
Transforming University Research - Mar 2006Jill Patrick
 
suresh oclc (3).pptx
suresh oclc (3).pptxsuresh oclc (3).pptx
suresh oclc (3).pptxSURESHTOMAR3
 
The workflows for the ingest of digital objects into a repository/digital l...
The workflows for the ingest of  digital objects into a repository/digital l...The workflows for the ingest of  digital objects into a repository/digital l...
The workflows for the ingest of digital objects into a repository/digital l...Hong (Jenny) Jing
 
Providing First World Library services By using Koha, DSpace, vufind and Drupal
Providing First World Library services By using  Koha, DSpace, vufind and DrupalProviding First World Library services By using  Koha, DSpace, vufind and Drupal
Providing First World Library services By using Koha, DSpace, vufind and DrupalNur Ahammad
 
Implementing Open Source DAM at HIstoric New England
Implementing Open Source DAM at HIstoric New EnglandImplementing Open Source DAM at HIstoric New England
Implementing Open Source DAM at HIstoric New Englanddwig
 
Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research dataARDC
 
MetadataTheory: Introduction to Repositories (8th of 10)
MetadataTheory: Introduction to Repositories (8th of 10)MetadataTheory: Introduction to Repositories (8th of 10)
MetadataTheory: Introduction to Repositories (8th of 10)Nikos Palavitsinis, PhD
 
Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Jenny Mitcham
 
Data workshop preso
Data workshop preso Data workshop preso
Data workshop preso Doug Moncur
 
Capture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web ArchivingCapture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web ArchivingKristen Yarmey
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
 

Similaire à Preservation of Research Data: Dataverse / Archivematica Integration by Allan Bell and Leanne Trimble (20)

L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP PilotL&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via Archivematica
 
Institutional Repositories.pptx
Institutional Repositories.pptxInstitutional Repositories.pptx
Institutional Repositories.pptx
 
Research Data Management at The University of Edinburgh
Research Data Management at The University of EdinburghResearch Data Management at The University of Edinburgh
Research Data Management at The University of Edinburgh
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Transforming University Research - Mar 2006
Transforming University Research - Mar 2006Transforming University Research - Mar 2006
Transforming University Research - Mar 2006
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Ppls mvm2
 
suresh oclc (3).pptx
suresh oclc (3).pptxsuresh oclc (3).pptx
suresh oclc (3).pptx
 
The workflows for the ingest of digital objects into a repository/digital l...
The workflows for the ingest of  digital objects into a repository/digital l...The workflows for the ingest of  digital objects into a repository/digital l...
The workflows for the ingest of digital objects into a repository/digital l...
 
Providing First World Library services By using Koha, DSpace, vufind and Drupal
Providing First World Library services By using  Koha, DSpace, vufind and DrupalProviding First World Library services By using  Koha, DSpace, vufind and Drupal
Providing First World Library services By using Koha, DSpace, vufind and Drupal
 
Implementing Open Source DAM at HIstoric New England
Implementing Open Source DAM at HIstoric New EnglandImplementing Open Source DAM at HIstoric New England
Implementing Open Source DAM at HIstoric New England
 
Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
 
MetadataTheory: Introduction to Repositories (8th of 10)
MetadataTheory: Introduction to Repositories (8th of 10)MetadataTheory: Introduction to Repositories (8th of 10)
MetadataTheory: Introduction to Repositories (8th of 10)
 
Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...
 
Data workshop preso
Data workshop preso Data workshop preso
Data workshop preso
 
Capture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web ArchivingCapture All the URLs: First Steps in Web Archiving
Capture All the URLs: First Steps in Web Archiving
 
EDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNetEDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNet
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 

Plus de datascienceiqss

Citing Data in Journal Articles using JATS by Deborah A. Lapeyre
Citing Data in Journal Articles using JATS by Deborah A. LapeyreCiting Data in Journal Articles using JATS by Deborah A. Lapeyre
Citing Data in Journal Articles using JATS by Deborah A. Lapeyredatascienceiqss
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...datascienceiqss
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtreedatascienceiqss
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaidatascienceiqss
 
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
DataTags: Sharing Privacy Sensitive Data by Latanya SweeneyDataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeneydatascienceiqss
 
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...datascienceiqss
 
Geospatial Data Visualization: WorldMap Integration by Raman Prasad
Geospatial Data Visualization: WorldMap Integration by Raman PrasadGeospatial Data Visualization: WorldMap Integration by Raman Prasad
Geospatial Data Visualization: WorldMap Integration by Raman Prasaddatascienceiqss
 
Sharing Data Through Plots with Plotly by Alex Johnson
Sharing Data Through Plots with Plotly by Alex JohnsonSharing Data Through Plots with Plotly by Alex Johnson
Sharing Data Through Plots with Plotly by Alex Johnsondatascienceiqss
 
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...datascienceiqss
 
MIT Libraries Dataverse by Katherine McNeill
MIT Libraries Dataverse by Katherine McNeillMIT Libraries Dataverse by Katherine McNeill
MIT Libraries Dataverse by Katherine McNeilldatascienceiqss
 
American Journal of Political Science & The Odum Institute: Promoting Researc...
American Journal of Political Science & The Odum Institute: Promoting Researc...American Journal of Political Science & The Odum Institute: Promoting Researc...
American Journal of Political Science & The Odum Institute: Promoting Researc...datascienceiqss
 
Political Analysis Dataverse by Jonathan N. Katz
Political Analysis Dataverse by Jonathan N. KatzPolitical Analysis Dataverse by Jonathan N. Katz
Political Analysis Dataverse by Jonathan N. Katzdatascienceiqss
 
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...datascienceiqss
 
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...datascienceiqss
 
Contributing Code to Dataverse by Gustavo Durand
Contributing Code to Dataverse by Gustavo DurandContributing Code to Dataverse by Gustavo Durand
Contributing Code to Dataverse by Gustavo Duranddatascienceiqss
 
Dataverse 4.0 UX by Elizabeth Quigley
Dataverse 4.0 UX by Elizabeth QuigleyDataverse 4.0 UX by Elizabeth Quigley
Dataverse 4.0 UX by Elizabeth Quigleydatascienceiqss
 
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...datascienceiqss
 

Plus de datascienceiqss (17)

Citing Data in Journal Articles using JATS by Deborah A. Lapeyre
Citing Data in Journal Articles using JATS by Deborah A. LapeyreCiting Data in Journal Articles using JATS by Deborah A. Lapeyre
Citing Data in Journal Articles using JATS by Deborah A. Lapeyre
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
 
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
DataTags: Sharing Privacy Sensitive Data by Latanya SweeneyDataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
 
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
 
Geospatial Data Visualization: WorldMap Integration by Raman Prasad
Geospatial Data Visualization: WorldMap Integration by Raman PrasadGeospatial Data Visualization: WorldMap Integration by Raman Prasad
Geospatial Data Visualization: WorldMap Integration by Raman Prasad
 
Sharing Data Through Plots with Plotly by Alex Johnson
Sharing Data Through Plots with Plotly by Alex JohnsonSharing Data Through Plots with Plotly by Alex Johnson
Sharing Data Through Plots with Plotly by Alex Johnson
 
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
 
MIT Libraries Dataverse by Katherine McNeill
MIT Libraries Dataverse by Katherine McNeillMIT Libraries Dataverse by Katherine McNeill
MIT Libraries Dataverse by Katherine McNeill
 
American Journal of Political Science & The Odum Institute: Promoting Researc...
American Journal of Political Science & The Odum Institute: Promoting Researc...American Journal of Political Science & The Odum Institute: Promoting Researc...
American Journal of Political Science & The Odum Institute: Promoting Researc...
 
Political Analysis Dataverse by Jonathan N. Katz
Political Analysis Dataverse by Jonathan N. KatzPolitical Analysis Dataverse by Jonathan N. Katz
Political Analysis Dataverse by Jonathan N. Katz
 
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
 
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
 
Contributing Code to Dataverse by Gustavo Durand
Contributing Code to Dataverse by Gustavo DurandContributing Code to Dataverse by Gustavo Durand
Contributing Code to Dataverse by Gustavo Durand
 
Dataverse 4.0 UX by Elizabeth Quigley
Dataverse 4.0 UX by Elizabeth QuigleyDataverse 4.0 UX by Elizabeth Quigley
Dataverse 4.0 UX by Elizabeth Quigley
 
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil...
 

Dernier

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 

Dernier (20)

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 

Preservation of Research Data: Dataverse / Archivematica Integration by Allan Bell and Leanne Trimble

  • 1. Preservation of Research Data: Dataverse / Archivematica Integration Allan Bell | Associate University Librarian, The University of British Columbia Leanne Trimble | Data & Geospatial Librarian, OCUL Scholars Portal
  • 3. University of British Columbia Digital Preservation Strategy ● Digital Preservation Program ○ cIRcle, DSpace-based repository ○ Digitized collections in CONTENTdm ○ New and legacy born digital archival material ○ Websites (Archive-IT) ○ Soon, Abacus Dataverse, Research Data
  • 4. University of British Columbia Digital Preservation Strategy ● Use Archivematica as a tool to apply OAIS-compliant preservation processes ● Integrate Archivematica with existing systems used to manage digital objects ● Build internal technical and staff capacity
  • 6. Archivematica ● “a free and open source digital preservation system that is designed to maintain standards- based, long term access to collections of digital objects” http://www.archivematica.org ● micro-services provide integrated suite of software tools in compliance with ISO- OAIS model
  • 7. Digital Preservation Program CiRcle (Dspace) • Archivematica receives submissions from DSpace • Also have Archivematica to DSpace workflow
  • 8. Digital Preservation Program CONTENTdm • Master files uploaded to Archivematica • Archivematica produces access versions and pushes to CONTENTdm
  • 9. Digital Preservation Program RBSC/UA born-digital acquisition workflow
  • 10. Digital Preservation Program TRAC Self Audit • Trustworthy Repositories Audit and Certification (evolved into ISO 16363) • Widely accepted criteria for assessing trustworthiness of digital repositories • TRAC checklist is an auditing tool to assess the reliability, commitment and readiness of institutions to assume long-term preservation responsibilities
  • 11. What is TRAC? • The TRAC metrics assess three areas: a. Organizational Infrastructure - the repository's administrative, staffing, financial, and legal functions b. Digital Object Management - the handling of digital objects from ingest to access c. Technology, Technical Infrastructure and Security - the technology used to handle ingested objects • These criteria represent best practices and current thinking about the organizational and technological needs of trustworthy digital repositories.
  • 12. TRAC Compliant Repositories Centre for Research Libraries has audited and certified five repositories: •Chronopolis Report •CLOCKSS •Hathitrust Report •Portico Report •Scholars Portal
  • 13. Digital Preservation Program Conclusions • Greater comfort with and understanding of the challenges around archiving digitized and born digital material • Establishing a comprehensive digital preservation program is complex! • Having tools is important, also need policies and procedures for certification (if desired)
  • 14. Abacus Dataverse: Research Data Management ● UBC hosted instance for four Research Universities in British Columbia since 2014 ○ Abacus DSpace launched in 2009 ● 1,700 studies (more than 28,000 files) ● Actively used by researchers ● Each school has full control and added discoverability for their data ○ Licensed data but also growing institutional research data collections ○ Each institution has its own subnet with ■ OAI export to Summon (common Library Discovery Layer) ■ Separate Dataverses for institutional research data
  • 16. OCUL & Scholars Portal Who? • 21 university libraries in Ontario What? • Collective purchasing • Shared digital infrastructure • Collaborative planning and assessment How? *Scholars Portal* • OCUL’s shared technology infrastructure, housing shared collections More information: http://www.ocul.on.ca/
  • 17. OCUL/SP & Research Data Management Dataverse (OCUL hosted instance) – Hosted for OCUL since 2011 – 330 studies (about 4,000 files) – Actively used by researchers from 7-8 institutions – Many in social science disciplines but some in sciences (agriculture, polar research, geophysics, nursing…)
  • 18. OCUL/SP & Research Data Management • Services are evolving at each institution • Still trying to get a handle on: – RDM support services required by researchers – RDM infrastructure requirements – RDM costs – Role of regional consortia in RDM services
  • 19. OCUL/SP & Digital Preservation • Trustworthy Digital Repository (TDR) certified for electronic journal content (since 2013) • Currently working on Ontario Library Research Cloud (OLRC) project (2015 completion) •Data Preservation: strong interest
  • 21. ‘Portage’ Canadian Association of Research Libraries led project aimed at building a library-based research data management network 2 aspects: • Network of expertise for research data management • A national preservation and discovery network for research data
  • 24. Dataverse/Archivematica Integration Dataverse • Data • Metadata (DDI & other) Archivematica • Accept data and metadata • Perform preservation functions • Create Archival Information Packages (AIPs) Archival storage ? Local Data Repository (e.g. at SP or UBC) Preservation Infrastructure (Portage) Integration Middleware • Harvest content via Dataverse API (no SWORD client capability ATM) • Package and submit to Archivematica using SWORD
  • 25. Project Participants • Artefactual – Evelyn McLellan, Justin Simpson • Dataverse – Phil Durbin, Eleni Castro (& others) • Scholars Portal – Leanne Trimble, Alan Darnell • UBC – Allan Bell, Eugene Barsky • University of Alberta – Geoff Harder, Chuck Humphrey, Larry Laliberte, Peter Binkley • Simon Fraser University – Alex Garnett
  • 26. Functional Requirements ● Develop “middleware” which can transfer studies from Dataverse to Archivematica - Detect newly published studies & “major” new versions - Harvest released studies from Dataverse - Utilize SWORD protocol - Submit to Archivematica - One Dataverse study = 1 SIP = 1 AIP
  • 27. Functional Requirements (2) ● Investigate Archivematica pipeline decisions for data formats coming from Dataverse - File format normalization? - Connecting versions of the same dataset to one another? - Handling DDI (and other) metadata records?
  • 28. Possible features for future stages • Dataverse as a SWORD client • Mechanism within Dataverse for researchers to specify which datasets they want to target for preservation • Returning information from Archivematica back to Dataverse (indication of preservation status within Dataverse)
  • 29. Next Steps • University of Toronto procurement process underway to contract the development work to Artefactual • Develop the middleware (2015) • Recruit researchers to contribute data to ingest (concurrent with development work)