Presentation given at Biocuration 2019 Session 5 (Data standards and ontologies: Making data FAIR)
Abstract:
Since 2016, academic publishers including Springer Nature, Elsevier and Taylor & Francis have been providing standard research data policies to journal authors, reflecting key aspects of the FAIR Principles’ practical applications: sharing data in repositories, using persistent identifiers and citing data appropriately. In spite of the rise of FAIR and good data management practice, recent surveys found that nearly 60% of researchers had never heard of the FAIR Principles, and 46% are not sure how to organise their data in a presentable and useful way. In this presentation we will analyse the results of a white paper which assessed the key challenges faced by researchers in sharing their data, and discuss current initiatives and approaches to support researchers to adopt good data sharing practice.
These include the roll-out of research data policies since 2016, as well as the launch of a Helpdesk service which has provided support to authors and allowed the research data team to capture more granular information on the challenges they face in sharing their data. We will also discuss the development of a third-party curation service which assists authors in depositing their data into appropriate repositories, and drafting data availability statements.
Finally we will assess the impacts of some of these interventions, including an analysis of data availability statements and an overview of the methods authors are currently using to share their data, and how these align with FAIR.
VIRUSES structure and classification ppt by Dr.Prince C P
New approaches to data management: supporting FAIR data sharing at Springer Nature
1. New approaches to data
management: supporting FAIR
data sharing at Springer Nature
Varsha Khodiyar, PhD
Biocuration 2019
IllustrationinspiredbytheworkofJohnMaynardKeynes
2. 1
New approaches to data management: supporting FAIR data sharing at Springer Nature
Springer Nature is a leading research, educational and professional publisher, providing
quality content to our communities through a range of innovative platforms, products
and services.
Home to brands including Springer, Nature Research, BioMed Central, Palgrave
Macmillan and Scientific American.
As the leading open access publisher, we see the rise of open research in all its
manifestations as one of the major forces reshaping the way that researchers
communicate and collaborate to advance the pace and quality of discovery.
Our focus is on investing in and creating tools, services or training that help the research
community to understand and utilise new ideas and concepts.
3. 2
New approaches to data management: supporting FAIR data sharing at Springer Nature
Practical Challenges for Researchers
in Data Sharing white paper
A global survey of nearly 8000
researchers
Stuart, David; Baynes, Grace; Hrynaszkiewicz, Iain; Allin, Katie;
Penny, Dan; Lucraft, Mithu; Astell, Mathias (2018): Whitepaper:
Practical challenges for researchers in data sharing
https://doi.org/10.6084/m9.figshare.5975011.v1
4. 3
New approaches to data management: supporting FAIR data sharing at Springer Nature
Private sharing of data is more common than public sharing
of data
Global levels of data sharing:
• Poland – 76% (highest)
• Germany – 75%
• UK – 58%
• USA – 55%
5. 4
New approaches to data management: supporting FAIR data sharing at Springer Nature
6. 5
New approaches to data management: supporting FAIR data sharing at Springer Nature
Are researchers aware of FAIR?
The State of Open Data Report 2018,
https://doi.org/10.6084/m9.figshare.7195058.v2
7. 6
New approaches to data management: supporting FAIR data sharing at Springer Nature
1. Data journals at Springer Nature
8. 7
New approaches to data management: supporting FAIR data sharing at Springer Nature
2. Springer Nature standardized journal data policies
• Launched data policy standardisation initiative in 2016
• More than 1,500 (~60%) Springer Nature journals have adopted a standard
research data policy
• All policies support the use of community specific standards, mandates and
repositories
• All policies and journals promote data citation
• Similar initiatives since introduced by other publishers
Standardising and harmonising research data policy in scholarly publishing
Iain Hrynaszkiewicz, Aliaksandr Birukou, Mathias Astell, Sowmya Swaminathan, Amye
Kenall, Varsha Khodiyar
International Journal of Digital Curation 2017 https://doi.org/10.2218/ijdc.v12i1.531
9. 8
New approaches to data management: supporting FAIR data sharing at Springer Nature
How do journal policies improve FAIRness of published
data?
A journal policy may be the first time that a researcher has been
encouraged (or required) to share their research data openly
The policies always recommend that data are shared in suitable
repositories rather than made available on request or uploaded as
supplementary material
The policies encourage any data used to be cited in the reference list,
to allow it to be found and accessed more easily
Mandatory aspects of the policy are checked before the article is
published
10. 9
New approaches to data management: supporting FAIR data sharing at Springer Nature
3. Research Data Helpdesk
Queries are answered within two business days
Run by members of the Springer Nature Research Data team
Expertise in data curation and management, archiving and
digital preservation, copyright and licensing, Open Access
publishing
Always encourage best practices, e.g. the use of
community repositories for specific data types
Email: researchdata@springernature.com
http://www.springernature.com/gp/group/data-policy/helpdesk
11. 10
New approaches to data management: supporting FAIR data sharing at Springer Nature
2018 queries to our Helpdesk
59
33
33
24
5
3
3
2
1
Repositories and depositing data
Policy compliance
Research Data Support
Data availability and access
Other
Data citation
Policy implementation
Copyright and licensing
Data types
0 10 20 30 40 50 60 70
n = 163
12. 11
New approaches to data management: supporting FAIR data sharing at Springer Nature
4. Research Data Support service
• Research Data Support
service (RDS) launched April
2017
• Provide support and advice
on research data sharing, for
authors and editors
• Promote best practice for
sharing research data
associated with a publication
www.springernature.com/la/authors/research-data
13. 12
New approaches to data management: supporting FAIR data sharing at Springer Nature
Analysing DAS across published articles allows us to assess how authors choose
to share their data
Analyzing the impact via data availability statements (DAS)
“The datasets generated during and/or analysed
during the current study are available in the [NAME]
repository, [PERSISTENT WEB LINK TO DATASETS].”
“The datasets generated during and/or analysed
during the current study are available from the
corresponding author on reasonable request.”
“All data generated or analysed during this study are
included in this published article (and its
supplementary information files).”
14. 13
New approaches to data management: supporting FAIR data sharing at Springer Nature
Analysing Data Availability at Nature journals
An analysis of data sharing at Nature Research journals
(including Nature, Nature Ecology and Evolution, Nature Human Behaviour, Nature
Microbiology, Nature Plants, Nature Medicine, Nature Methods, Nature Biotechnology,
Nature Genetics, Nature Chemical Biology, Nature Immunology, Nature Structural and
Molecular Biology, Nature Astronomy, Nature Chemistry, Nature Climate Change, Nature
Materials, Nature Energy, Nature Nanotechnology, Nature Biomedical Engineering and
Nature Photonics)
Data availability statements were coded to assess what methods authors are using to
share the data that accompanies their published articles.
The disciplinary focus of each journal was also captured
15. 14
New approaches to data management: supporting FAIR data sharing at Springer Nature
Coding the Data Availability Statements
Type 1 stated that the data is available from the author on request.
Type 2 stated that the data had been included in the manuscript or its supplementary
material.
Type 3 stated that some or all of the data is publicly available, for example in a
repository.
Type 4 stated that figure source data was included with the manuscript. This is a method
of data sharing used by some authors in a subset of Nature journals that publish life
sciences research. Some journals encourage authors to provide the source data behind
their figures/plots as spreadsheets.
Type 1 statement
16. 15
New approaches to data management: supporting FAIR data sharing at Springer Nature
Results of the analysis
Statement type by discipline
21
46
7
116
23
13
2
28
31
86
23
35
6
4
0
20
40
60
80
100
120
140
160
180
200
Chemistry and
applied science
Life sciences Multidisciplinary Physical sciences
Type 4 statement
Type 3 statement
Type 2 statement
Type 1 statement
17. 16
New approaches to data management: supporting FAIR data sharing at Springer Nature
Does using Research Data Support encourage authors to
link to their data?
An analysis of nearly 150 papers published by authors who received
Research Data Support
108
12
1
0
20
40
60
80
100
120
Yes No N/A
Number of authors,
n=122
Was a data DOI included in the author’s published paper?
18. 17
New approaches to data management: supporting FAIR data sharing at Springer Nature
5. Incentives for data sharing: Open data badges pilot
Currently piloting on BMC Microbiology
Does the application of badges to published
papers which share their data openly:
• Affect levels of data sharing by authors?
• Affect reader engagement with articles and
their supporting data, and perceptions of
article quality?
And can we assess the benefits and costs of
consistently assessing and awarding open data
badges for a Springer Nature journal?
19. 18
New approaches to data management: supporting FAIR data sharing at Springer Nature
Open data badges pilot: BMC Microbiology
Read a blogpost on the project here:
https://tinyurl.com/y8atpy8a
20. 19
New approaches to data management: supporting FAIR data sharing at Springer Nature
Summary: Publisher interventions to increase data FAIRness
• Publisher policies for data sharing can help to encourage data sharing, even
in disciplines where it is not currently common practice.
• Researchers may not be familiar with the FAIR principles, but they can still
be supported to increase the FAIRness of their data.
• Different levels of support may be required, from advice to hands-on data
curation.
• Analysis of data sharing methods can be used over time to assess changes
in practice.
• Providing hands on support leads to good data sharing practice by authors.
21. 20
New approaches to data management: supporting FAIR data sharing at Springer Nature
20
The story behind the image
John Maynard Keynes (1883–1946)
John Maynard Keynes was a British economist who
revolutionised the theory and practice of macroeconomics,
reformed economics and had a profound influence on
economic policy. This illustration represents the Keynesian
model which shows that in a monetary economy it is
possible to have periods of high unemployment unless
governments use active monetary and fiscal policy to
stimulate aggregate demand.
Varsha Khodiyar, PhD
Data Curation Manager
varsha.khodiyar@nature.com
@varsha_khodiyar
go.nature.com/ResearchDataServices
researchdata.springernature.com
researchdata@springernature.com
With thanks to Rebecca Grant for
pulling together this slide deck!