http://kulibrarians.g.hatena.ne.jp/kulibrarians/20170222
Presentation by Marta Teperek (University of Cambridge)
- Introduction to research data management (2016)
CC BY 4.0
1. OSC
Office of Scholarly Communication
Introduction to research data
management
Rosie Higman and Marta Teperek
Office of Scholarly Communication
Cambridge University Library
28th October 2016
2. OSC
Aims of today’s workshop:
1. Provide you with more confidence in data and information
management
2. Provide you with information about existing resources
3. Provide you with contact details which you can use whenever
you need advice
4. You can make it as interactive as you want – please use your
electronic devices
http://bit.ly/20161028RDM
3. OSC
Some housekeeping information:
1. Today: mixture of activities and us talking
2. You can ask questions any time you want
3. There is no fire alarm testing
4. Slides are available at:
http://bit.ly/20161028RDM
4. OSC
Plan for today:
We will cover:
1. Introduction
2. Backup and file sharing
3. How to organise your data well
4. Data sharing
5. …how to avoid problems => data management plans
Ask questions at any time
5. OSC
To start with…
• Do you have any questions about data or
information management that you hope
will be addressed during this workshop?
7. OSC
‘Data’ can mean a lot…
And many, many others:
- Raw instrument readings
- Processed data
- Analysed data
- Genomic data
- Microscopic photos, western
blot images and measurement
- Spreadsheets
- Videos
- Surveys and interviews
- Field notes
- Lab books
- Physical samples
- Protocols
9. OSC Disastrous data loss…
Credit: Peter Murray-Rust,
http://blogs.ch.cam.ac.uk/pmr/2011/08/01/why-you-need-a-data-management-plan/, August 2011, CC-BY
Department of Chemistry,
University of Cambridge
11. OSC
• your laptop got stolen
• your lab burnt
• you lost your USB stick
• your portable hard drive got damaged
• data from your Dropbox/Googledrive account
disappeared
How much of your data would you lose
if…?
13. OSC
Store at home!
Your
departmental
server
At least 2 backups at 2 locations:
Free software to manage backups (there is plenty of free software):
http://www.2brightsparks.com/download-syncbackfree.html
Copy ASAP!
15. OSC File sharing
• Google Drive… - caution!
– Do not use cloud storage to store restricted
data
• E-mail
• Website (Moodle, Confluence, Basecamp
etc.)
• FTP/SFTP
• University of Cambridge solutions
– OneDrive
– Dropbox
17. OSC
Cloud services comparison
Space included 1 TB Unlimited 15GB
Price (per
annum)
Free £55-66 Free (1 TB is
£95.88)
File history Yes 30 days Yes
Support available Webpages from UIS Webpages from UIS Unsupported
Where are files
stored?
Within EU Within EU Anywhere
Live editing Yes – dependent on
browser
Only with an Office 365
licence
Yes
What types of
data can be
shared?
Unclassified, Cambridge
only, or confidential data
Unclassified, Cambridge
only, or confidential data
Unclassified or
public data
23. OSC Data organisation:
• Consistent
• Meaningful to you and your colleagues
• Allow you to find files easily
• Would you be able to easily find your
data?
27. OSC How do you organise your physical samples?
Vincent Gaggioli
28. OSC Physical samples need to be managed as well
Marta Teperek, PhD physical sample organisation
29. OSC Physical samples need to be managed as well
• Create maps of your samples
– can be simple Excel spreadsheets
– and keep them up to date!
• Reference your samples:
– dates in notebooks
– supplier’s name/code
• Add any relevant notes
30. OSC File naming conventions – do they matter?
Copyright: http://10pm.com/
***
***
31. OSC File naming conventions – do they matter?
In 3 years time would you know what these are?
40. OSC
Research relies on the principle that we share
our findings
Ideas and results need to be shared to move
human knowledge forward
Open Research is fundamental to being an academic
41. OSC Science relies on the principle that we share our findings
From Dr Eric Turner:
https://figshare.com/articles/Peer_review_After_Results_are_Known_Are_we_PA
RKing_the_Cart_Before_the_Horse_/3381379
FDA record of clinical trials with 12 antidepressants:
Only positive results published
42. OSC
p-value 0.05: who is going to publish their results?
Non-positive results need to be shared not to waste time and resources
43. OSC Selfish reason: share once and don’t be bothered
or
http://dx.doi.org/10.1101/gr.201541.115
44. OSC Selfish reason: share once and don’t be bothered
http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE75164
45. OSC
Get access to shared data
https://researchdata.jiscinvolve.org/wp/2016/02/04/932/
46. OSC Protection against misconduct
http://dx.doi.org/10.1038/nature12968
http://dx.doi.org/10.1038/nature12969
49. OSC
Less time wasted
Less time wasted
http://dx.doi.org/10.1083/jcb.20070205310112013r
http://dx.doi.org/10.1016/j.devcel.2008.11.006
50. OSC
2010
Start of the PhD
2011
1 year of PhD gone
results not
reproduced
2012
2 years of PhD gone
results not reproduced
Looking for the original
data…
Less time wasted
54. OSC
• It took 6 years from the time of the original
publication (2007) to the final retraction
(2013)
• Time & resources wasted because data
was not available
(not to mention people’s careers!)
Less time wasted
56. OSC
Send us your paper within 3 months of acceptance, or before signing a copyright form
or choosing an Open Access option – whichever comes first.
Making your publication Open Access
www.openaccess.cam.ac.
uk
57. OSC Open Access to research data
“Publicly funded research data are a public good (…),
which should be made openly available with as few
restrictions as possible…”
www.rcuk.ac.uk/research/datapolicy
59. OSC How to share
data?
• Store data for (at least) 10 years
• Describe your data
• Deposit your data in suitable data repositories and add a link to your data in your
publication
• NCBI/GEO: http://www.ncbi.nlm.nih.gov/geo/
• For sensitive data:
• UK Data Service: reshare.ukdataservice.ac.uk/ or EGA:
www.ebi.ac.uk/ega/home
• Or other repositories (including Cambridge repository):
www.data.cam.ac.uk/repository
60. OSC Resources for working with personal/sensitive data
• University Ethics website
• Our website (University resources)
• MRC guidelines
• ESRC guidelines
72. OSC
Attribution
Must acknowledge
the author of the
work
Non-
commercial
Only the original
author can make
money
No-
derivatives
Cannot
change/remix the
work
ShareAlike
New creations must
be shared under the
same rules
Creative Commons Licenses
73. OSC Creative Commons Licenses
CC BY-NC-ND
CC BY-NC-SA
Attribution-Non-Commercial-
NoDerivatives Others can download and
share the work as long as the author is
credited, they are not changed in any way and
are not used commercially
Attribution-NonCommercial-ShareAlike
Distribute, remix, tweak and build upon the
work non-commercially as long as the author
is credited for the original creation and the
new work is licensed under identical terms
Attribution NoDerivatives Redistribution,
commercial and non-commercial, as long as
the work is unchanged and the original author
credited
CC BY-ND
74. OSC Creative Commons Licenses
Attribution-Non-Commercial Remix, tweak
and build upon a work non-commercially. New
works must acknowledge the original author
CC BY-NC
Attribution ShareAlike Remix, tweak and
build upon the work, even commercially as
long as the original author is credited and the
new creation licensed under identical terms
CC BY-SA
Attribution Distribute, remix, tweak and
build upon the work as long as the author is
credited for the original creation
CC-BY
83. OSC
• You have 2 mins to write your own data
plan
• Fill in just the top section and leave the
‘Comments’ section blank
Data management plan:
84. OSC
• Work in pairs and exchange your
plans
• You have 2 mins to write down
comments on each other’s plans
Data management plan:
85. OSC
• Now you have 2 minutes to exchange
feedback
Data management plan:
86. OSC
• You have created your first data management plan with
comments from a peer-reviewer
• You can get a simple example of a data management
plan at the end
(examples of DMPs supporting grant applications are at
http://www.data.cam.ac.uk/DMPsupport)
Data management plan:
87. OSC Today’s summary:
We have covered the following:
1. Backup and file sharing
2. How to organise your data well?
3. Data sharing
4. How to avoid problems: data management plans