Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data

•

4 j'aime•3,752 vues

Betsy Gunia, David Fearon, Benjamin Brosius, Tim DiLauro JHU Data Management Services Johns Hopkins University Sheridan Libraries A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data Research Data Access & Preservation Summit 2013 Baltimore, MD April 4, 2013 #rdap13

Formation

An Example Workflow for Depositing to a Research Data Repository:
A Case Study for Archiving Publication Data
Betsy Gunia, David Fearon, Benjamin Brosius, Tim DiLauro | JHU Data Management Services | Johns Hopkins University Sheridan Libraries | datamanagement@jhu.edu
Data
•Pilot project with two, graduating doctoral students
•Biomedical engineering field. Largely image data
•Data already published, which differs from our usual
service model of working with researchers at the
beginning of their project
JHU Data Archive
•Used alpha-release of Data Conservancy software [1]
•Discipline-agnostic and data as primary objects
•A collection of data may have an associated
metadata file, structured or unstructured
•Not yet publicly-accessible
Understanding Research
•Met with students for initial overview of research
•Read publications to map data products and activity
that created them
•As shown in Fig. 1, provided a framework to organize
data and ensure that all data were included (students
could not locate all their data)
Organizing Data
•Completed several in-depth meetings with students
•Created new folders and subfolders with students
present, and moved files to appropriate location
•Discussed data content, instrument(s) used, and file
naming conventions used, if any
•Experimented with directory structures based on
publication figures or research methods. Students
and advisor decided that organizing by figure was
more useful for data reuse
•Did not rename files due to time constraints and
lack of consistency in filenames
Packaging
•Used BagIt (v. 0.97) and TAR for packaging format
•Used MD5 checksums for data (payload) and tag files
•Created a documentation folder for our unstructured
metadata (Fig. 2), which we treated as a tag file and
not part of the payload
•One “bag” per publication
•Unsurprisingly, it is hard for researchers to recall information
about their data after a few years. This pilot project reinforced
the importance of working with scientists early in their
research, which is our usual service model.
•Due to time constraints and student recollection, our metadata
creation was limited to folder and file documentation (Fig. 2).
•Closely reading and mapping the students' research was central
to being able to ask them relevant questions about the data.
•The BagIt specification worked well for packaging.
Future Work
This pilot project began the process of formalizing our archiving
processes, but we have much more to do! The Data Conservancy
software will have improved functionality over the coming years,
which has implications for how we evolve the process for
archiving. For example, we currently cannot hide deposited data
in the JHU Data Archive; however, researchers may want to
transfer data to us before their project is complete and ready for
public access. We need to develop rigorous processes for
ensuring that we maintain the integrity of the data during the
often significant alterations required to archive datasets that are
useful to others.
Figure 1. Example of data flow diagram Figure 2. Example of unstructured metadata. Folder
and file documentation
Conclusions
[1] http://dataconservancy.org/software/Copyright © 2013, by JHU Data Management Services

Contenu connexe

Tendances

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

National Information Standards Organization (NISO)

Praetzellis "Data Management Planning and Tools"

National Information Standards Organization (NISO)

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

National Information Standards Organization (NISO)

Feb 26 NISO Training Thursday Crafting a Scientific Data Management Plan About the Training Addressing a data management plan for the first time can be an intimidating exercise. Join NISO for a hands-on workshop that will guide you through the elements of creating a data management plan, including gathering necessary information, identifying needed resources, and navigating potential pitfalls. Participants explore the important components of a data management plan and critique excerpts of sample plans provided by the instructors. This session is meant to be a guided, step-by-step session that will follow the February 18 NISO Virtual Conference, Scientific Data Management: Caring for Your Institution and its Intellectual Wealth. About the Instructors Kiyomi D. Deards, MSLIS, Assistant Professor, University of Nebraska-Lincoln Libraries Jennifer Thoegersen, Data Curation Librarian, University of Nebraska-Lincoln Libraries

NISO Training Thursday Crafting a Scientific Data Management Plan

National Information Standards Organization (NISO)

RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...

ASIS&T

Building and providing data management services a framework for everyone!

Renaine Julian

RDAP14 Poster: The DCC’s institutional engagement program: changing approache...

ASIS&T

RDAP14: DataONE: Data Observation Network for Earth

ASIS&T

RDAP14: Building a data management and curation program on a shoestring budget

ASIS&T

Talk at CNI 2015 Spring Membership Meeting in Seattle on April 14th, 2015, see http://www.cni.org/events/membership-meetings/upcoming-meeting/spring-2015/ Abstract: The goal of the InFoLiS project is to connect research data and publications. Links between data and literature are created automatically by means of text mining and made available as Linked Open Data (LOD) for seamless integration into different retrieval systems. This enables scientists to directly access information about corresponding research data in a literature information system, and, vice versa, it is possible to directly find different interpretations and analyses in the literature of the same research data. In our talk, we will describe our methods for generating the links and give insight into the Linked Data infrastructure including the services we are currently building. Most importantly, we will detail how our solutions can be used by other institutions and invite all interested participants to discuss with us their ideas and thoughts on the requirements for these services to ensure broad interoperability with existing systems and infrastructures. InFoLiS is a joint project by the GESIS – Leibniz Institute for the Social Sciences, Cologne, Mannheim University Library, and Mannheim University supported by a grant from the DFG – German Research Foundation.

Integration of research literature and data (InFoLiS)

Philipp Zumstein

RDAP 16: DMPs and Public Access: Agency and Data Service Experiences

ASIS&T

RDAP14: Collaboration and tension between institutions and units providing da...

ASIS&T

RDAP14: Learning to Curate Panel

ASIS&T

Strasser "Effective data management and its role in open research"

National Information Standards Organization (NISO)

Zucca "Technology & Systems"

National Information Standards Organization (NISO)

RDAP14: University-wide Research Data Management Policy

ASIS&T

Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...

National Information Standards Organization (NISO)

Services, policy, guidance and training: Improving research data management a...

Robin Rice

RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...

ASIS&T

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

National Information Standards Organization (NISO)

Tendances (20)

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

Praetzellis "Data Management Planning and Tools"

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

NISO Training Thursday Crafting a Scientific Data Management Plan

RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...

Building and providing data management services a framework for everyone!

RDAP14 Poster: The DCC’s institutional engagement program: changing approache...

RDAP14: DataONE: Data Observation Network for Earth

RDAP14: Building a data management and curation program on a shoestring budget

Integration of research literature and data (InFoLiS)

RDAP 16: DMPs and Public Access: Agency and Data Service Experiences

RDAP14: Collaboration and tension between institutions and units providing da...

RDAP14: Learning to Curate Panel

Strasser "Effective data management and its role in open research"

Zucca "Technology & Systems"

RDAP14: University-wide Research Data Management Policy

Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...

Services, policy, guidance and training: Improving research data management a...

RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

Similaire à Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data

Data Curation: A New Frontier in Faculty-Librarian Collaboration

jpotter49505

A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.

Immersive informatics - research data management at Pitt iSchool and Carnegie...

Keith Webster

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

National Information Standards Organization (NISO)

Documentation and Metdata - VA DM Bootcamp

Sherry Lake

INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN

Arhiv družboslovnih podatkov

Data Management for Research (New Faculty Orientation)

aaroncollie

Spring 2014 Data Management Lab: Session 1 Slides (more details at http://ulib.iupui.edu/digitalscholarship/dataservices/datamgmtlab) What you will learn: 1. Build awareness of research data management issues associated with digital data. 2. Introduce methods to address common data management issues and facilitate data integrity. 3. Introduce institutional resources supporting effective data management methods. 4. Build proficiency in applying these methods. 5. Build strategic skills that enable attendees to solve new data management problems.

Data Management Lab: Session 1 Slides

IUPUI

Survey of research data management practices up2010digschol2011

heila1

Managing your research data

University of York Library

DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...

University of California Curation Center

Support Your Data, Kyoto University

Stephanie Simms

21 07 14 rdm swansea_whelf_copy

rachaelwhitfield

Survey of research data management practices up2010

heila1

Digital Curation 101 - Taster

Digital Curation Centre (DCC)

Research data lifecycle diagram

Steven Cracknell

Discovery event stuart lee (the humanities researcher)

RDTF-Discovery

Data management

Graça Gabriel

Small Science: First Impressions of Curation Needs. Presentation at Digital L...

Sarah Shreeves

Open Access to Research Data: Challenges and Solutions

Martin Donnelly

20170222 ku-librarians勉強会 #211 :海外研修報告：英国大学図書館を北から南へ巡る旅

kulibrarians

Similaire à Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data (20)

Data Curation: A New Frontier in Faculty-Librarian Collaboration

Immersive informatics - research data management at Pitt iSchool and Carnegie...

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

Documentation and Metdata - VA DM Bootcamp

INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN

Data Management for Research (New Faculty Orientation)

Data Management Lab: Session 1 Slides

Survey of research data management practices up2010digschol2011

Managing your research data

DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...

Support Your Data, Kyoto University

21 07 14 rdm swansea_whelf_copy

Survey of research data management practices up2010

Digital Curation 101 - Taster

Research data lifecycle diagram

Discovery event stuart lee (the humanities researcher)

Data management

Small Science: First Impressions of Curation Needs. Presentation at Digital L...

Open Access to Research Data: Challenges and Solutions

20170222 ku-librarians勉強会 #211 :海外研修報告：英国大学図書館を北から南へ巡る旅

Plus de ASIS&T

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)

ASIS&T

RDAP 16: Sustainability of data infrastructure: The history of science scienc...

ASIS&T

RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...

ASIS&T

RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)

ASIS&T

RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...

ASIS&T

RDAP 16 Poster: Interpreting Local Data Policies in Practice

ASIS&T

RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...

ASIS&T

RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...

ASIS&T

RDAP 16 Lightning: Spreading the love: Bringing data management training to s...

ASIS&T

RDAP 16 Lightning: RDM Discussion Group: How'd that go?

ASIS&T

RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...

ASIS&T

RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker

ASIS&T

RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...

ASIS&T

RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...

ASIS&T

RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data

ASIS&T

RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration

ASIS&T

RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...

ASIS&T

RDAP 16: How do we know where to grow? Assessing Research Data Services at th...

ASIS&T

RDAP 16: I built it. They came. Now what? (Panel 2, Sustainability)

ASIS&T

RDAP 16: Building Sustainable Services at the Small(er) Scale (Panel 4, Measu...

ASIS&T

Plus de ASIS&T (20)