SlideShare a Scribd company logo
1 of 41
Data Papers in the Network Era


          MacKenzie Smith
   Research Director, MIT Libraries
WHY DATA SHARING
IS IMPORTANT
Some rights reserved by mod as hell
“The NIH expects and supports the timely release and
              sharing of final research data from NIH-supported
              studies for use by other researchers.

            “Starting with the October 1, 2003 receipt
               date, investigators submitting an NIH application
               seeking $500,000 or more in direct costs in any single
               year are expected to include a plan for data sharing
               or state why data sharing is not possible.”
Application Guide for NIH and Other PHS Agencies
“Investigators are expected to share with other
   researchers, at no more than incremental cost
   and within a reasonable time, the primary
   data, samples, physical collections and other
   supporting materials created or gathered in the
   course of work under NSF grants. Grantees* are
   expected to encourage and facilitate such
   sharing.”

* Grantee = Research University or similar
“Proposals must include a supplementary
  document of no more than two pages labeled
  “Data Management Plan””
including…
policies for access and sharing including provisions
             for appropriate protection of
             privacy, confidentiality, security, intellectual
             property, or other rights or requirements;
           policies and provisions for re-use, re-
             distribution, and the production of derivatives;

NSF Grant Proposal Guide, January 2011
Some rights reserved by NASA Goddard Space Flight Center
WHY DATA SHARING
IS DIFFICULT
some rights reserved by jamieca
Some rights reserved by DARTProject
REUSABLE DATA IS….


structured, versioned and well-documented
formatted for long-term access
archived (backed up and secure)
findable and citable
legally unrestricted or clear usage policy
CONSIDER THE DATA PAPER
Data Paper

    “a formal publication whose
     primary purpose is to expose and
     describe data, as opposed to
     analyze and draw conclusions
     from it.”

http://neurocommons.org/report/data-publication.pdf
“The objective of the Journal is to provide critically evaluated physical and chemical property
data, fully documented as to the original sources and the criteria used for
evaluation, preferably with uncertainty analysis.”
FIG. 1. Temperature and pressure ranges of the experimental thermal conductivity data for normal hydrogen.

J. Phys. Chem. Ref. Data 40, 033101 (2011)
© 2011 American Institute of Physics
1. Organize peer-review, establish quality-
   control measures

2. Create citable entity

3. Establish cross-linking mechanisms with
   traditional papers, to enforce separation of
   concerns (methodology vs analysis)
4. Specify required documentation to make data
    re-usable, re-purposable

5. Apply standard interoperable legal license
   (CC0 or PDDL with normative attribution or
   CC-By with URI attribution)

6. Ensure archiving strategy in place
DATA PUBLISHING
INFRASTRUCTURE
WEB IDENTIFIERS



I2 (Institutional Identifiers)
WEB DATA STANDARDS
WEB VISUALIZATION TOOLS
ROLES AND RESPONSIBILITIES?
ROLES                                                   Springer,
                                                      Nature, BMC,
                                                       PLoS, WoS                Libraries,
      APS, ACS, AC
                                        Publishers                                  IT
        M, Sage
                                                                             Centers, Researc
       Commons
                                                                                h Admin
                        Scholarly
                                                           Institutions
                        Societies

     e.g.                               Research
Microsoft, Ora                           Groups                           Governments, F
     cle                                                                    oundations
Mendeley, Zot
     ero                 Tech
                                                             Funders
                       Companies


                     institutional,    Data Centers
                     disciplinary, c
                      ommercial
Some rights reserved by The University of Iowa Libraries
Portrait of Ada Lovelace. Some rights reserved by Aristocrat
THANK YOU

More Related Content

What's hot

Ecoinformatics Portal presentation
Ecoinformatics Portal presentationEcoinformatics Portal presentation
Ecoinformatics Portal presentation
TERN Australia
 

What's hot (9)

Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
The Uniform Resource Layer
The Uniform Resource LayerThe Uniform Resource Layer
The Uniform Resource Layer
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
Knowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, BonnKnowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, Bonn
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
 
Landing Pages - Joe Hourcle - RDAP12
Landing Pages - Joe Hourcle - RDAP12Landing Pages - Joe Hourcle - RDAP12
Landing Pages - Joe Hourcle - RDAP12
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
Ecoinformatics Portal presentation
Ecoinformatics Portal presentationEcoinformatics Portal presentation
Ecoinformatics Portal presentation
 

Viewers also liked

Changing Operations of Academic Libraries
Changing Operations of Academic LibrariesChanging Operations of Academic Libraries
Changing Operations of Academic Libraries
Charleston Conference
 
Revue de Veille eTourisme 2009
Revue de Veille eTourisme 2009Revue de Veille eTourisme 2009
Revue de Veille eTourisme 2009
Technofutur TIC
 

Viewers also liked (20)

What's Education Got to Do With It: Downsizing From the Big Deal
What's Education Got to Do With It: Downsizing From the Big DealWhat's Education Got to Do With It: Downsizing From the Big Deal
What's Education Got to Do With It: Downsizing From the Big Deal
 
ROAD: A New Free Service for Identifying and Selecting OA Scholarly Resources
ROAD: A New Free Service for Identifying and Selecting OA Scholarly ResourcesROAD: A New Free Service for Identifying and Selecting OA Scholarly Resources
ROAD: A New Free Service for Identifying and Selecting OA Scholarly Resources
 
Building eBook Collections for the Long Term
Building eBook Collections for the Long TermBuilding eBook Collections for the Long Term
Building eBook Collections for the Long Term
 
Print Reference Is It Dead Yet
Print Reference Is It Dead YetPrint Reference Is It Dead Yet
Print Reference Is It Dead Yet
 
From Silos to (Archives)Space
From Silos to (Archives)SpaceFrom Silos to (Archives)Space
From Silos to (Archives)Space
 
Weeding One Stepp at a Time
Weeding One Stepp at a TimeWeeding One Stepp at a Time
Weeding One Stepp at a Time
 
Adios to Paper Journals – Removed and Recycled – One Mile Long and 75 Tons
Adios to Paper Journals – Removed and Recycled – One Mile Long and 75 TonsAdios to Paper Journals – Removed and Recycled – One Mile Long and 75 Tons
Adios to Paper Journals – Removed and Recycled – One Mile Long and 75 Tons
 
Charleston Seminar JGrogg
Charleston Seminar JGroggCharleston Seminar JGrogg
Charleston Seminar JGrogg
 
Springer Reference
Springer ReferenceSpringer Reference
Springer Reference
 
Remote Storage (Eric Parker)
Remote Storage (Eric Parker)Remote Storage (Eric Parker)
Remote Storage (Eric Parker)
 
The Spaces Between: A Research Agenda between Libraries, Publishers, and Vend...
The Spaces Between: A Research Agenda between Libraries, Publishers, and Vend...The Spaces Between: A Research Agenda between Libraries, Publishers, and Vend...
The Spaces Between: A Research Agenda between Libraries, Publishers, and Vend...
 
The Importance of Being Free (Ashley Krenelka Chase)
The Importance of Being Free (Ashley Krenelka Chase)The Importance of Being Free (Ashley Krenelka Chase)
The Importance of Being Free (Ashley Krenelka Chase)
 
eBooks in Health Sciences - The Good, the Bad, and the Ugly (the 11th Annual ...
eBooks in Health Sciences - The Good, the Bad, and the Ugly (the 11th Annual ...eBooks in Health Sciences - The Good, the Bad, and the Ugly (the 11th Annual ...
eBooks in Health Sciences - The Good, the Bad, and the Ugly (the 11th Annual ...
 
Changing Operations of Academic Libraries
Changing Operations of Academic LibrariesChanging Operations of Academic Libraries
Changing Operations of Academic Libraries
 
Review of Current Seamless Transition Authentication Methodologies for Conten...
Review of Current Seamless Transition Authentication Methodologies for Conten...Review of Current Seamless Transition Authentication Methodologies for Conten...
Review of Current Seamless Transition Authentication Methodologies for Conten...
 
Being Earnest in the New Normal, Anthea Stratigos (CEO, Outsell, Inc.)
Being Earnest in the New Normal, Anthea Stratigos (CEO, Outsell, Inc.)Being Earnest in the New Normal, Anthea Stratigos (CEO, Outsell, Inc.)
Being Earnest in the New Normal, Anthea Stratigos (CEO, Outsell, Inc.)
 
It's Not Just About Weeding
It's Not Just About WeedingIt's Not Just About Weeding
It's Not Just About Weeding
 
Projet Megas - Livrable 3 - Soutenance orale
Projet Megas - Livrable 3 - Soutenance oraleProjet Megas - Livrable 3 - Soutenance orale
Projet Megas - Livrable 3 - Soutenance orale
 
Au royaume du e-tourisme le contenu est roi | Forum e-tourisme CRDTA 11 décem...
Au royaume du e-tourisme le contenu est roi | Forum e-tourisme CRDTA 11 décem...Au royaume du e-tourisme le contenu est roi | Forum e-tourisme CRDTA 11 décem...
Au royaume du e-tourisme le contenu est roi | Forum e-tourisme CRDTA 11 décem...
 
Revue de Veille eTourisme 2009
Revue de Veille eTourisme 2009Revue de Veille eTourisme 2009
Revue de Veille eTourisme 2009
 

Similar to Data Papers in the Network Era, by MacKenzie Smith, MIT Libraries

DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
Crossref
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
Susanna-Assunta Sansone
 

Similar to Data Papers in the Network Era, by MacKenzie Smith, MIT Libraries (20)

Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
Virtual Organizations 2.0: Social Constructs for Data-centered Collaborative ...
 
Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_final
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
Open Science
Open Science Open Science
Open Science
 
Cologne open access slides dec 2010
Cologne open access slides dec 2010Cologne open access slides dec 2010
Cologne open access slides dec 2010
 
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspectiveMalcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
 
Linked data in pharma R&D
Linked data in pharma R&DLinked data in pharma R&D
Linked data in pharma R&D
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
داده های پژوهشی
داده های پژوهشیداده های پژوهشی
داده های پژوهشی
 
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data ServicesNISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
 
Digital Science
Digital ScienceDigital Science
Digital Science
 
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Data Papers in the Network Era, by MacKenzie Smith, MIT Libraries

  • 1. Data Papers in the Network Era MacKenzie Smith Research Director, MIT Libraries
  • 3. Some rights reserved by mod as hell
  • 4. “The NIH expects and supports the timely release and sharing of final research data from NIH-supported studies for use by other researchers. “Starting with the October 1, 2003 receipt date, investigators submitting an NIH application seeking $500,000 or more in direct costs in any single year are expected to include a plan for data sharing or state why data sharing is not possible.” Application Guide for NIH and Other PHS Agencies
  • 5. “Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees* are expected to encourage and facilitate such sharing.” * Grantee = Research University or similar
  • 6. “Proposals must include a supplementary document of no more than two pages labeled “Data Management Plan”” including…
  • 7. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements; policies and provisions for re-use, re- distribution, and the production of derivatives; NSF Grant Proposal Guide, January 2011
  • 8. Some rights reserved by NASA Goddard Space Flight Center
  • 10.
  • 11. some rights reserved by jamieca
  • 12.
  • 13.
  • 14.
  • 15. Some rights reserved by DARTProject
  • 16.
  • 17. REUSABLE DATA IS…. structured, versioned and well-documented formatted for long-term access archived (backed up and secure) findable and citable legally unrestricted or clear usage policy
  • 19. Data Paper “a formal publication whose primary purpose is to expose and describe data, as opposed to analyze and draw conclusions from it.” http://neurocommons.org/report/data-publication.pdf
  • 20. “The objective of the Journal is to provide critically evaluated physical and chemical property data, fully documented as to the original sources and the criteria used for evaluation, preferably with uncertainty analysis.”
  • 21. FIG. 1. Temperature and pressure ranges of the experimental thermal conductivity data for normal hydrogen. J. Phys. Chem. Ref. Data 40, 033101 (2011) © 2011 American Institute of Physics
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. 1. Organize peer-review, establish quality- control measures 2. Create citable entity 3. Establish cross-linking mechanisms with traditional papers, to enforce separation of concerns (methodology vs analysis)
  • 27. 4. Specify required documentation to make data re-usable, re-purposable 5. Apply standard interoperable legal license (CC0 or PDDL with normative attribution or CC-By with URI attribution) 6. Ensure archiving strategy in place
  • 32.
  • 33.
  • 34.
  • 36. ROLES Springer, Nature, BMC, PLoS, WoS Libraries, APS, ACS, AC Publishers IT M, Sage Centers, Researc Commons h Admin Scholarly Institutions Societies e.g. Research Microsoft, Ora Groups Governments, F cle oundations Mendeley, Zot ero Tech Funders Companies institutional, Data Centers disciplinary, c ommercial
  • 37.
  • 38. Some rights reserved by The University of Iowa Libraries
  • 39.
  • 40. Portrait of Ada Lovelace. Some rights reserved by Aristocrat