SlideShare une entreprise Scribd logo
1  sur  14
Data-PASS:
How collaborative preservation works
Micah Altman, Harvard University

IASSIST 2010, Ithaca New York
                                       1
What’s next?

 ✤   What is Data-PASS?

 ✤   Challenges of preserving scientific evidence

 ✤   Converging trends

 ✤   Benefits of institutional collaboration

 ✤   Evolving structure of collaboration

 ✤   Services and infrastructure

                                                   2
How collaborative preservation works.
Collaborators and Co-Conspirators

 ✤   Margaret Adams, Caroline Arms, Ed Bachman, Nitin Borwankar,
     Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling,
     Jonathan Crabtree, Darrell Donakowski, Myron Gutmann, Gary
     King, Patrick King, Jared Lyle, Marc Maynard, Amy Pienta, Lois
     Timms-Ferrarra, Copeland Young.

 ✤   Research Support
     Thanks to the Library of Congress (PA#NDP03-1), the National
     Science Foundation (DMS-0835500, SES 0112072), IMLS
     (LG-05-09-0041-09), the Harvard University Library, the Institute for
     Quantitative Social Science, the Harvard-MIT Data Center, and the
     Murray Research Archive.
                                                                             3
How collaborative preservation works.
What is Data-PASS?

 ✤   Data-PASS is a broad-based partnership of data archives dedicated to acquiring and preserving data at-
     risk of being lost to the social science research community.

 ✤   Data-PASS partners have rescued thousands of data sets and created the largest catalog of social science
     data in existence.

 ✤   Data-PASS partners collaborate to

     ✤   identify and promote good archival practices,

     ✤   seek out at-risk research data,

     ✤   build preservation infrastructure,

     ✤   and mutually safeguard collections.

 ✤   Our current initiatives include:

     ✤   improving data citation practices,

     ✤   automatic policy-based archival replication
                                                                                                                4
How collaborative preservation works.
Challenges of Preserving
    Scientific Evidence
✤   Scientists expectations are changing

    ✤    Movements toward open access and open data

    ✤    Specialized workflow systems

    ✤    Diversity of approaches to managing replication and community data

✤   Scientific change creates technical challenges:

    ✤    Forms, formats, and research workflows change

    ✤    Data is not self-documenting

    ✤    Intellectual property & privacy law are evolving

    ✤    Resources to deal with these changes are limited

✤   Much of the empirical base of science becomes lost                                          Source: Wikimedia Commons



    ✤    Journal articles & books are only summaries

    ✤    Full replication is expensive or impossible

    ✤    This slows scientific progress:
         cooked results, publication bias, citation authority distortion, challenges of meta-
         analysis
                                                                                                                            5
How collaborative preservation works.
Converging trends in preservation
 ✤   Standardized criteria for evaluating trustworthiness of archives

     ✤   TRAC; NARA TDR; Drambora

 ✤   Collaborative stewardship by memory institutions

     ✤   Meta-Archive, CLOCKSS, COPUL, PeDALS, ADN, Chronopolis

 ✤   Technology for replication and verification

     ✤   Solutions developed within the library/archival community:
         LOCKSS, IRODS, ACE, Duraspace

     ✤   Commercial HPC and Cloud solutions:
         Hadoop, Crashplan, Mozy, AWS, etc.

     ✤   P2P sharing:
         freenet, gnunet, Taho-LAFS
                                                                        6
How collaborative preservation works.
Benefits of Collaboration
  "Nothing new that is really interesting comes without collaboration" -- James Watson



 ✤
     General Benefits
     ✤   Exposure to funding opportunities; collection development leads
     ✤   Division of labor in tracking law, technology, information science
     ✤   Combined experience in preservation practice
 ✤   Data-PASS Focus*
     ✤
         Expanded discoverability of collections
         ✤
             Reach new audiences
         ✤
             Holdings across the joint collection are more complete
         ✤
             Virtual collections can be built from slices of the joint collection
     ✤
         Development and advocacy of archival good practices
         ✤
             (Current initiative: outreach to professional associations in support of data citation)
     ✤
         Insurance against institutional and technological failure                                         7
How collaborative preservation works.             * And the museum of obsolete data storage technologies
How Collaborative Stewardship acts as
 Insurance Against Preservation Failure
 ✤   Collaborative replication & stewardship can substantially
     mitigate preservation risk from:

     ✤   External threats to institution failure:

         ✤   funding loss; attacks;
             legal regime change;
             mission drift

     ✤   Institutional failure:

         ✤   Unintentional curatorial modification;
             Loss of institutional knowledge;
             Change in mission

 ✤   And also reduce preservation risk from:

     ✤   Media failure (from storage & media characteristics);
         Software & hardware infrastructure failures
                                                                 8
How collaborative preservation works.
Shared Infrastructure
 ✤   Shared infrastructure can

     ✤   reduce costs

     ✤   reduce risk

     ✤   coordinate operations

     ✤   validate shared standards

 ✤   Data-PASS Shared Infrastructure

     ✤   Shared Catalog

     ✤   Policy-Driven Distributed Replication
         (in development)

     ✤   The Dataverse Network
         (overlapping infrastructure)
                                                 9
How collaborative preservation works.
Shared Catalog
 ✤   Unified Discovery

     ✤   Simple & fielded search

     ✤   Virtual collection across entire catalog

     ✤   Browse by subject, data, source

 ✤   Metadata delivery

     ✤   Descriptive study, file, and variable
         information

     ✤   Provenance & rights metadata

     ✤   Human, OAI, Z39.50 interfaces

 ✤   Layered Services

     ✤   Data reformatting for delivery

     ✤   On-line analysis
                                                    10
How collaborative preservation works.
The Dataverse Network ®
                       For Organizations                                              For Scholars




✤     Dataverses are Data-PASS ready -- all dataverses can provide:   ✤   The Dataverse Network System is Open-Source and
      ✤    DDI (2.x) metadata export (intuitive form-based entry)     ✤   Creating a Dataverse requires no software.
      ✤    Catalog access through OAI-PMH (and Z39.50)                ✤   IQSS & MRA host an open DVN and offer no-cost
      ✤    LOCKSS compatibility                                           permanent storage:

      ✤    Version control (new); Terms of use metadata; Flexible                  http://dvn.iq.harvard.edu
           contributor-curator-editor workflows
                                                                                                                            11
How   collaborative preservation works. (better) self-archiving
           -- ideal for “living collections” &
Policy-Driven Distributed Replication

 ✤   Policy Based

     ✤   Preservation requirements shape policy

     ✤   Policy drives replication rules

     ✤   Auditing demonstrates conformance with
         preservation requirements

 ✤   Copies are distributed

     ✤   Across space

     ✤   Among institutions

     ✤   Across time (version history retained)

 ✤   Commitments scaled to participant resources

     ✤   Collection size

     ✤   Technology
                                                   12
How collaborative preservation works.
Structure of Collaboration
Areas of collaboration...                                 Steps to participation
 ✤   Partnership agreements                                 Partners agree to...

     agreement on good practice;                             ✤   Publishing metadata
     permission to preserve;
     partners offer to accept data transfer if archive fails ✤   Use of replication system

 ✤   Coordinated operations                                  ✤   Good archival practice
                                                                 (TRAC compliance not required)
     shared leads;
     regular communication;
                                                             ✤   Transfer protocols
     collegial review available
                                                            Partners use the following technlogies
 ✤   Shared good practice                                    ✤   Light-weight protocols:
                                                                 OAI-PMH + DDI 2-lite +
     metadata; preservation; confidentiality
                                                                 HTTP harvestable data
 ✤   Circle of gifts norm                                    ✤   Software:
                                                                 Could use a hosted dataverse or;
     in-kind effort & resource;                                  install open source OAI-PMH server, etc.
     contributions are voluntary & proportional
                                                             ✤   No fear - we can help!                     13
More Questions?



 ✤   Know of research data at risk of loss?

 ✤   Need help preserving your research data?

 ✤   Want more visibility and protection for your collections ?

                               http://data-pass.org
                           data-pass@icpsr.umich.edu
                                                                  14
How collaborative preservation works.

Contenu connexe

Similaire à Data-PASS: How Collaborative Presentation Works

Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
World Agroforestry (ICRAF)
 

Similaire à Data-PASS: How Collaborative Presentation Works (20)

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Research data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaSResearch data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaS
 
Long Term Preservation Dale Peters
Long Term Preservation Dale PetersLong Term Preservation Dale Peters
Long Term Preservation Dale Peters
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
The Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital PreservationThe Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital Preservation
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft AzureAccelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
 
Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 

Plus de Micah Altman

SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
Micah Altman
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Micah Altman
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
Micah Altman
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Micah Altman
 

Plus de Micah Altman (20)

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategies
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset Conversation
 
Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...
 
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset Conversation
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer Review
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer Review
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An Overview
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral Districting
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenary
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
 

Dernier

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Dernier (20)

ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

Data-PASS: How Collaborative Presentation Works

  • 1. Data-PASS: How collaborative preservation works Micah Altman, Harvard University IASSIST 2010, Ithaca New York 1
  • 2. What’s next? ✤ What is Data-PASS? ✤ Challenges of preserving scientific evidence ✤ Converging trends ✤ Benefits of institutional collaboration ✤ Evolving structure of collaboration ✤ Services and infrastructure 2 How collaborative preservation works.
  • 3. Collaborators and Co-Conspirators ✤ Margaret Adams, Caroline Arms, Ed Bachman, Nitin Borwankar, Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling, Jonathan Crabtree, Darrell Donakowski, Myron Gutmann, Gary King, Patrick King, Jared Lyle, Marc Maynard, Amy Pienta, Lois Timms-Ferrarra, Copeland Young. ✤ Research Support Thanks to the Library of Congress (PA#NDP03-1), the National Science Foundation (DMS-0835500, SES 0112072), IMLS (LG-05-09-0041-09), the Harvard University Library, the Institute for Quantitative Social Science, the Harvard-MIT Data Center, and the Murray Research Archive. 3 How collaborative preservation works.
  • 4. What is Data-PASS? ✤ Data-PASS is a broad-based partnership of data archives dedicated to acquiring and preserving data at- risk of being lost to the social science research community. ✤ Data-PASS partners have rescued thousands of data sets and created the largest catalog of social science data in existence. ✤ Data-PASS partners collaborate to ✤ identify and promote good archival practices, ✤ seek out at-risk research data, ✤ build preservation infrastructure, ✤ and mutually safeguard collections. ✤ Our current initiatives include: ✤ improving data citation practices, ✤ automatic policy-based archival replication 4 How collaborative preservation works.
  • 5. Challenges of Preserving Scientific Evidence ✤ Scientists expectations are changing ✤ Movements toward open access and open data ✤ Specialized workflow systems ✤ Diversity of approaches to managing replication and community data ✤ Scientific change creates technical challenges: ✤ Forms, formats, and research workflows change ✤ Data is not self-documenting ✤ Intellectual property & privacy law are evolving ✤ Resources to deal with these changes are limited ✤ Much of the empirical base of science becomes lost Source: Wikimedia Commons ✤ Journal articles & books are only summaries ✤ Full replication is expensive or impossible ✤ This slows scientific progress: cooked results, publication bias, citation authority distortion, challenges of meta- analysis 5 How collaborative preservation works.
  • 6. Converging trends in preservation ✤ Standardized criteria for evaluating trustworthiness of archives ✤ TRAC; NARA TDR; Drambora ✤ Collaborative stewardship by memory institutions ✤ Meta-Archive, CLOCKSS, COPUL, PeDALS, ADN, Chronopolis ✤ Technology for replication and verification ✤ Solutions developed within the library/archival community: LOCKSS, IRODS, ACE, Duraspace ✤ Commercial HPC and Cloud solutions: Hadoop, Crashplan, Mozy, AWS, etc. ✤ P2P sharing: freenet, gnunet, Taho-LAFS 6 How collaborative preservation works.
  • 7. Benefits of Collaboration "Nothing new that is really interesting comes without collaboration" -- James Watson ✤ General Benefits ✤ Exposure to funding opportunities; collection development leads ✤ Division of labor in tracking law, technology, information science ✤ Combined experience in preservation practice ✤ Data-PASS Focus* ✤ Expanded discoverability of collections ✤ Reach new audiences ✤ Holdings across the joint collection are more complete ✤ Virtual collections can be built from slices of the joint collection ✤ Development and advocacy of archival good practices ✤ (Current initiative: outreach to professional associations in support of data citation) ✤ Insurance against institutional and technological failure 7 How collaborative preservation works. * And the museum of obsolete data storage technologies
  • 8. How Collaborative Stewardship acts as Insurance Against Preservation Failure ✤ Collaborative replication & stewardship can substantially mitigate preservation risk from: ✤ External threats to institution failure: ✤ funding loss; attacks; legal regime change; mission drift ✤ Institutional failure: ✤ Unintentional curatorial modification; Loss of institutional knowledge; Change in mission ✤ And also reduce preservation risk from: ✤ Media failure (from storage & media characteristics); Software & hardware infrastructure failures 8 How collaborative preservation works.
  • 9. Shared Infrastructure ✤ Shared infrastructure can ✤ reduce costs ✤ reduce risk ✤ coordinate operations ✤ validate shared standards ✤ Data-PASS Shared Infrastructure ✤ Shared Catalog ✤ Policy-Driven Distributed Replication (in development) ✤ The Dataverse Network (overlapping infrastructure) 9 How collaborative preservation works.
  • 10. Shared Catalog ✤ Unified Discovery ✤ Simple & fielded search ✤ Virtual collection across entire catalog ✤ Browse by subject, data, source ✤ Metadata delivery ✤ Descriptive study, file, and variable information ✤ Provenance & rights metadata ✤ Human, OAI, Z39.50 interfaces ✤ Layered Services ✤ Data reformatting for delivery ✤ On-line analysis 10 How collaborative preservation works.
  • 11. The Dataverse Network ® For Organizations For Scholars ✤ Dataverses are Data-PASS ready -- all dataverses can provide: ✤ The Dataverse Network System is Open-Source and ✤ DDI (2.x) metadata export (intuitive form-based entry) ✤ Creating a Dataverse requires no software. ✤ Catalog access through OAI-PMH (and Z39.50) ✤ IQSS & MRA host an open DVN and offer no-cost ✤ LOCKSS compatibility permanent storage: ✤ Version control (new); Terms of use metadata; Flexible http://dvn.iq.harvard.edu contributor-curator-editor workflows 11 How collaborative preservation works. (better) self-archiving -- ideal for “living collections” &
  • 12. Policy-Driven Distributed Replication ✤ Policy Based ✤ Preservation requirements shape policy ✤ Policy drives replication rules ✤ Auditing demonstrates conformance with preservation requirements ✤ Copies are distributed ✤ Across space ✤ Among institutions ✤ Across time (version history retained) ✤ Commitments scaled to participant resources ✤ Collection size ✤ Technology 12 How collaborative preservation works.
  • 13. Structure of Collaboration Areas of collaboration... Steps to participation ✤ Partnership agreements Partners agree to... agreement on good practice; ✤ Publishing metadata permission to preserve; partners offer to accept data transfer if archive fails ✤ Use of replication system ✤ Coordinated operations ✤ Good archival practice (TRAC compliance not required) shared leads; regular communication; ✤ Transfer protocols collegial review available Partners use the following technlogies ✤ Shared good practice ✤ Light-weight protocols: OAI-PMH + DDI 2-lite + metadata; preservation; confidentiality HTTP harvestable data ✤ Circle of gifts norm ✤ Software: Could use a hosted dataverse or; in-kind effort & resource; install open source OAI-PMH server, etc. contributions are voluntary & proportional ✤ No fear - we can help! 13
  • 14. More Questions? ✤ Know of research data at risk of loss? ✤ Need help preserving your research data? ✤ Want more visibility and protection for your collections ? http://data-pass.org data-pass@icpsr.umich.edu 14 How collaborative preservation works.

Notes de l'éditeur

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n