SlideShare a Scribd company logo
1 of 19
The value of data curation as
part of the publishing process
Varsha Khodiyar, PhD
Biocuration 2019
Antarcticameltdowncoulddoublesealevelrise
1
Data curation as part of publishing / 10th April 2019
A brief history of data curation at Springer Nature
• Scientific Data launched
May 2014
• Novel manuscript format,
the Data Descriptor
• Focus on data generation
and data peer review
• Machine readable metadata
file generated by in house
curators (ISA-Tab format) for
each published Data
Descriptor
www.nature.com/scientificdata
2
Data curation as part of publishing / 10th April 2019
Data Descriptors have human and machine
understandable components
Human readable
representation of
study
i.e. article (HTML &
PDF)
Human readable
representation of
study
i.e. article (HTML &
PDF)
3
Data curation as part of publishing / 10th April 2019
Data Descriptors have human and machine
understandable components
Machine
accessible
representation
of study, i.e.
metadata
Human
readable
summary of
the metadata
4
Data curation as part of publishing / 10th April 2019
Output from Scientific Data’s curation process
Machine readable overview of how
sources and samples were turned
into the digital data outputs.
Curator captures key dataset
characteristics using ontology terms:
• Type of study
• What was measured
• How it was measured
• Any independent variables
• Sample characteristics e.g.
- Species
- Geographical location
- Environment type
scientificdata.isa-explorer.org
5
Data curation as part of publishing / 10th April 2019
Publishing a data paper with Scientific Data
Deposit
data in an
appropriate
repository
Draft a
manuscript
based on
the
template
Submit
your
manuscript
Peer review
of the
manuscript
Revise the
manuscript
as required
Make any
changes
requested
by the data
curators
The data
descriptor
is published
6
Data curation as part of publishing / 10th April 2019
A brief history of data curation at Springer Nature
• Research Data Support
service (RDS) launched April
2017
• Expansion of data curation
practice to other Springer
Nature journals
• Provide support and advice
on research data sharing, for
authors and editors
• Promote best practice for
sharing research data
associated with a publication
www.springernature.com/la/authors/research-data
7
Data curation as part of publishing / 10th April 2019
To help authors and journals follow good practice in sharing and archiving of
research data, we provide optional data deposition and curation services.
Springer Nature Research Data Support
Researchers
submit their
data files
securely
The Research
Data team
curates the data
and metadata
The data are
published and
linked to the
author’s paper
More information is available on our website here:
http://www.springernature.com/gb/group/data-policy/data-support-services
8
Data curation as part of publishing / 10th April 2019
Comprehensive
description
including the data
context of the
study and data
gathering method
Altmetrics provide
information on
downloads and
citations
Relevant categories
and keywords added to
enhance discoverability
of the data
Dataset assigned a
DOI
Source: https://doi.org/10.6084/m9.figshare.5259415
Example of curation output from Research Data Support
Licence to clarify
reuse conditions
9
Data curation as part of publishing / 10th April 2019
Example author feedback report
10
Data curation as part of publishing / 10th April 2019
Checks carried out by the curation team
 Most appropriate repository used?
 Data and metadata at the repository
consistent with manuscript?
 Terms of use and terms of access for
the data consistent with manuscript?
 Terms of data use consistent with
journal policy?
11
Data curation as part of publishing / 10th April 2019
Addition of missing information
Error correction
Suggestions
to increase
FAIRness
Improvements to manuscript tables, text or figures to aid
understanding and reuse of the work
Data access or data license conditions updated at repository or
manuscript to aid accessibility
Repository metadata improved to aid dataset discoverability
Improvements to file names and/or file structure at the repository
to aid understanding and reuse of the work
Possible outcomes of curation
Manuscript
text
Manuscript
figure
Manuscript
table
Data files at
the repository
12
Data curation as part of publishing / 10th April 2019
Curation outcomes at Scientific Data (Study period March
2018 to March 2019)
77% of manuscripts - no
issues identified
23% of manuscripts - at least 1 issue identified and
resolved
10% of manuscripts - errors identified and resolved
13
Data curation as part of publishing / 10th April 2019
RDS curation outcomes (Study period March 2018 to March
2019)
In 55% of RDS
curation jobs, the
curator suggested
updates to the
repository hosted
data files
Sensitive data removed
Missing data added
License conditions updated
File format & naming improved
Mandated data moved to specialist repositories
Supplementary Information moved to repository
Opaque language clarified
14
Data curation as part of publishing / 10th April 2019
We encourage the use of community endorsed ontologies,
standards and repositories where possible
15
Data curation as part of publishing / 10th April 2019
We encourage the use of community endorsed ontologies,
standards and repositories where possible
springernature.com/gp/authors/research-data-policy/repositories/12327124
16
Data curation as part of publishing / 10th April 2019
The Springer Nature research data curators
Joseph Salter
Development Editor
Tristan
Matthews
Assistant
Research Data
Editor
Graham Smith
Senior Research Data
Editor
Rebecca Grant
Research Data Manager
Alexandra Philiastides
Assistant Research
Data Editor
Varsha Khodiyar
Data Curation Manager
17
Data curation as part of publishing / 10th April 2019
• Springer Nature has had at least one research data curator since the launch
of Scientific Data in 2014.
• Since 2017, data curation has been available as a separate service for
increasing numbers of Springer Nature authors and editors.
• The Research Data team has built up significant expertise in the area of data
publishing.
• Our curators are able to identify and help resolve both minor and major
issues prior to articles and data being made public.
• Our curators increase the FAIRness of published research data
• We focus on increasing the Findability and Accessibility of data and
metadata in our curation processes.
• We encourage our authors to increase the Interoperability and
Reusability of their data and metadata; by using community ontologies
for metadata, and encouraging the use of community research data
infrastructure where this exists.
Summary: Curation as part of the publishing process
18
Data curation as part of publishing / 10th April 2019
18
The story behind the image
Antarctica meltdown could
double sea level rise
Researchers at Pennsylvania State University
have been considering how quickly a glacial ice
melt in Antarctica would raise sea levels. By
updating models with new discoveries and
comparing them with past sea-level rise events
they predict that a melting Antarctica could raise
oceans by more than 3 feet by the end of the
century if greenhouse gas emissions continued
unabated, roughly doubling previous total sea-
level rise estimates. Rising seas could put many
of the world’s coastlines underwater or at risk of
flooding and storm surges.
Varsha Khodiyar, PhD
Data Curation Manager
varsha.khodiyar@nature.com
@varsha_khodiyar
go.nature.com/ResearchDataServices
researchdata.springernature.com
researchdata@springernature.com
nature.com/scientificdata
scientificdata@nature.com
@scientificdata

More Related Content

What's hot

EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEDINA, University of Edinburgh
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsARDC
 
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 Peter McQuilton
 
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...Rebecca Grant
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
 
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine Felden
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine FeldenIntroduction to PANGAEA & EURO-BASIN Data Management, by Janine Felden
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine FeldenDTU - Technical University of Denmark
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowHistoric Environment Scotland
 
Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisEDINA, University of Edinburgh
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...ASIS&T
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries? Robin Rice
 
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...056-Science Europe Draft Proposal for a Sceince Europe position statement on ...
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...innovationoecd
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...ASIS&T
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data networkJisc RDM
 
Data management plan format
Data management plan formatData management plan format
Data management plan formatWouter Gerritsma
 
Gold, silver, bronze - research data network
Gold, silver, bronze - research data networkGold, silver, bronze - research data network
Gold, silver, bronze - research data networkJisc RDM
 
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...Increasing research impact: the national data registry - Alex Ball - Jisc Dig...
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...Jisc
 

What's hot (20)

EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directions
 
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
 
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
Hawkins "Implementation of the CONSER Standard Record"
Hawkins "Implementation of the CONSER Standard Record"Hawkins "Implementation of the CONSER Standard Record"
Hawkins "Implementation of the CONSER Standard Record"
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine Felden
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine FeldenIntroduction to PANGAEA & EURO-BASIN Data Management, by Janine Felden
Introduction to PANGAEA & EURO-BASIN Data Management, by Janine Felden
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and Analysis
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
 
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...056-Science Europe Draft Proposal for a Sceince Europe position statement on ...
056-Science Europe Draft Proposal for a Sceince Europe position statement on ...
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
 
Data management plan format
Data management plan formatData management plan format
Data management plan format
 
Gold, silver, bronze - research data network
Gold, silver, bronze - research data networkGold, silver, bronze - research data network
Gold, silver, bronze - research data network
 
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...Increasing research impact: the national data registry - Alex Ball - Jisc Dig...
Increasing research impact: the national data registry - Alex Ball - Jisc Dig...
 

Similar to The value of data curation as part of the publishing process

re3data - Registry of Research Data Repositories
re3data -  Registry of Research Data Repositoriesre3data -  Registry of Research Data Repositories
re3data - Registry of Research Data RepositoriesHeinz Pampel
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Rebecca Grant
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...EDINA, University of Edinburgh
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Robin Rice
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simonsARDC
 
Rebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDMRebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDMdri_ireland
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementARDC
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryRobin Rice
 
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GrahamSmith646206
 
Rscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simonsRscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simonsARDC
 
Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018SusanMRob
 
Adding valuethroughdatacuration
Adding valuethroughdatacurationAdding valuethroughdatacuration
Adding valuethroughdatacurationAPLICwebmaster
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspectivedri_ireland
 
20230513taibif-datapaper-tutorial_en.pdf.pdf
20230513taibif-datapaper-tutorial_en.pdf.pdf20230513taibif-datapaper-tutorial_en.pdf.pdf
20230513taibif-datapaper-tutorial_en.pdf.pdfjhujyunjhang
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
 

Similar to The value of data curation as part of the publishing process (20)

re3data - Registry of Research Data Repositories
re3data -  Registry of Research Data Repositoriesre3data -  Registry of Research Data Repositories
re3data - Registry of Research Data Repositories
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?Records professionals and Research Data - a new role?
Records professionals and Research Data - a new role?
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simons
 
Rebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDMRebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDM
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
 
Rscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simonsRscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simons
 
Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018
 
Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...
Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...
Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...
 
Adding valuethroughdatacuration
Adding valuethroughdatacurationAdding valuethroughdatacuration
Adding valuethroughdatacuration
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspective
 
20230513taibif-datapaper-tutorial_en.pdf.pdf
20230513taibif-datapaper-tutorial_en.pdf.pdf20230513taibif-datapaper-tutorial_en.pdf.pdf
20230513taibif-datapaper-tutorial_en.pdf.pdf
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 

More from Varsha Khodiyar

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceVarsha Khodiyar
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Varsha Khodiyar
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsVarsha Khodiyar
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccinesVarsha Khodiyar
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publicationVarsha Khodiyar
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositoriesVarsha Khodiyar
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?Varsha Khodiyar
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Varsha Khodiyar
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Varsha Khodiyar
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharingVarsha Khodiyar
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGVarsha Khodiyar
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshopVarsha Khodiyar
 
Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalVarsha Khodiyar
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesVarsha Khodiyar
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research dataVarsha Khodiyar
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflowVarsha Khodiyar
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterVarsha Khodiyar
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataVarsha Khodiyar
 

More from Varsha Khodiyar (20)

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and tests
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccines
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositories
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharing
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IG
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journal
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional Repositories
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific Data
 

Recently uploaded

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
Biological classification of plants with detail
Biological classification of plants with detailBiological classification of plants with detail
Biological classification of plants with detailhaiderbaloch3
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Ai in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxAi in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxsubscribeus100
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 

Recently uploaded (20)

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
Biological classification of plants with detail
Biological classification of plants with detailBiological classification of plants with detail
Biological classification of plants with detail
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
Ai in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxAi in communication electronicss[1].pptx
Ai in communication electronicss[1].pptx
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 

The value of data curation as part of the publishing process

  • 1. The value of data curation as part of the publishing process Varsha Khodiyar, PhD Biocuration 2019 Antarcticameltdowncoulddoublesealevelrise
  • 2. 1 Data curation as part of publishing / 10th April 2019 A brief history of data curation at Springer Nature • Scientific Data launched May 2014 • Novel manuscript format, the Data Descriptor • Focus on data generation and data peer review • Machine readable metadata file generated by in house curators (ISA-Tab format) for each published Data Descriptor www.nature.com/scientificdata
  • 3. 2 Data curation as part of publishing / 10th April 2019 Data Descriptors have human and machine understandable components Human readable representation of study i.e. article (HTML & PDF) Human readable representation of study i.e. article (HTML & PDF)
  • 4. 3 Data curation as part of publishing / 10th April 2019 Data Descriptors have human and machine understandable components Machine accessible representation of study, i.e. metadata Human readable summary of the metadata
  • 5. 4 Data curation as part of publishing / 10th April 2019 Output from Scientific Data’s curation process Machine readable overview of how sources and samples were turned into the digital data outputs. Curator captures key dataset characteristics using ontology terms: • Type of study • What was measured • How it was measured • Any independent variables • Sample characteristics e.g. - Species - Geographical location - Environment type scientificdata.isa-explorer.org
  • 6. 5 Data curation as part of publishing / 10th April 2019 Publishing a data paper with Scientific Data Deposit data in an appropriate repository Draft a manuscript based on the template Submit your manuscript Peer review of the manuscript Revise the manuscript as required Make any changes requested by the data curators The data descriptor is published
  • 7. 6 Data curation as part of publishing / 10th April 2019 A brief history of data curation at Springer Nature • Research Data Support service (RDS) launched April 2017 • Expansion of data curation practice to other Springer Nature journals • Provide support and advice on research data sharing, for authors and editors • Promote best practice for sharing research data associated with a publication www.springernature.com/la/authors/research-data
  • 8. 7 Data curation as part of publishing / 10th April 2019 To help authors and journals follow good practice in sharing and archiving of research data, we provide optional data deposition and curation services. Springer Nature Research Data Support Researchers submit their data files securely The Research Data team curates the data and metadata The data are published and linked to the author’s paper More information is available on our website here: http://www.springernature.com/gb/group/data-policy/data-support-services
  • 9. 8 Data curation as part of publishing / 10th April 2019 Comprehensive description including the data context of the study and data gathering method Altmetrics provide information on downloads and citations Relevant categories and keywords added to enhance discoverability of the data Dataset assigned a DOI Source: https://doi.org/10.6084/m9.figshare.5259415 Example of curation output from Research Data Support Licence to clarify reuse conditions
  • 10. 9 Data curation as part of publishing / 10th April 2019 Example author feedback report
  • 11. 10 Data curation as part of publishing / 10th April 2019 Checks carried out by the curation team  Most appropriate repository used?  Data and metadata at the repository consistent with manuscript?  Terms of use and terms of access for the data consistent with manuscript?  Terms of data use consistent with journal policy?
  • 12. 11 Data curation as part of publishing / 10th April 2019 Addition of missing information Error correction Suggestions to increase FAIRness Improvements to manuscript tables, text or figures to aid understanding and reuse of the work Data access or data license conditions updated at repository or manuscript to aid accessibility Repository metadata improved to aid dataset discoverability Improvements to file names and/or file structure at the repository to aid understanding and reuse of the work Possible outcomes of curation Manuscript text Manuscript figure Manuscript table Data files at the repository
  • 13. 12 Data curation as part of publishing / 10th April 2019 Curation outcomes at Scientific Data (Study period March 2018 to March 2019) 77% of manuscripts - no issues identified 23% of manuscripts - at least 1 issue identified and resolved 10% of manuscripts - errors identified and resolved
  • 14. 13 Data curation as part of publishing / 10th April 2019 RDS curation outcomes (Study period March 2018 to March 2019) In 55% of RDS curation jobs, the curator suggested updates to the repository hosted data files Sensitive data removed Missing data added License conditions updated File format & naming improved Mandated data moved to specialist repositories Supplementary Information moved to repository Opaque language clarified
  • 15. 14 Data curation as part of publishing / 10th April 2019 We encourage the use of community endorsed ontologies, standards and repositories where possible
  • 16. 15 Data curation as part of publishing / 10th April 2019 We encourage the use of community endorsed ontologies, standards and repositories where possible springernature.com/gp/authors/research-data-policy/repositories/12327124
  • 17. 16 Data curation as part of publishing / 10th April 2019 The Springer Nature research data curators Joseph Salter Development Editor Tristan Matthews Assistant Research Data Editor Graham Smith Senior Research Data Editor Rebecca Grant Research Data Manager Alexandra Philiastides Assistant Research Data Editor Varsha Khodiyar Data Curation Manager
  • 18. 17 Data curation as part of publishing / 10th April 2019 • Springer Nature has had at least one research data curator since the launch of Scientific Data in 2014. • Since 2017, data curation has been available as a separate service for increasing numbers of Springer Nature authors and editors. • The Research Data team has built up significant expertise in the area of data publishing. • Our curators are able to identify and help resolve both minor and major issues prior to articles and data being made public. • Our curators increase the FAIRness of published research data • We focus on increasing the Findability and Accessibility of data and metadata in our curation processes. • We encourage our authors to increase the Interoperability and Reusability of their data and metadata; by using community ontologies for metadata, and encouraging the use of community research data infrastructure where this exists. Summary: Curation as part of the publishing process
  • 19. 18 Data curation as part of publishing / 10th April 2019 18 The story behind the image Antarctica meltdown could double sea level rise Researchers at Pennsylvania State University have been considering how quickly a glacial ice melt in Antarctica would raise sea levels. By updating models with new discoveries and comparing them with past sea-level rise events they predict that a melting Antarctica could raise oceans by more than 3 feet by the end of the century if greenhouse gas emissions continued unabated, roughly doubling previous total sea- level rise estimates. Rising seas could put many of the world’s coastlines underwater or at risk of flooding and storm surges. Varsha Khodiyar, PhD Data Curation Manager varsha.khodiyar@nature.com @varsha_khodiyar go.nature.com/ResearchDataServices researchdata.springernature.com researchdata@springernature.com nature.com/scientificdata scientificdata@nature.com @scientificdata