SlideShare une entreprise Scribd logo
1  sur  32
Introduction to Research Data Management
                                  For postgraduate Students

                  University of Northampton, 20th February 2013




                                        Marieke Guy
                                   DCC, University of Bath
                                    m.guy@ukoln.ac.uk
                                                                                                                      Funded by:
    This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland
    License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/ ; or,
    (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
Today’s Talk…

Will consider:
• What are data
• What is the research process
• What is research data management and why does it matter
• How you can start thinking about good research data
  management
   We hope you leave able to explain why research data
   management is important and what roles postgraduate
   students / researchers play.
What are data?
• The lowest level of abstraction from which information and
  knowledge are derived
• Research data are collected, observed or created, for the
  purposes of analysis to produce and validate original
  research results
• Both analogue and digital materials are 'data'

• Digital data can be:
   • created in a digital form ("born digital")
   • converted to a digital form (digitised)
What is the research process?




           Research
            Process




                           •Research360
What is research data management?
• Data curation is “the active
  management and appraisal
  of data
• Over the lifecycle of
  scholarly and scientific
  interest”
• Data have importance as
  the evidential base
• of scholarly conclusions
• Curation is part of good
  research practice
Why carry out RDM?
Managing your data properly:
                                               Sharing data can
• Can prevent data loss
                                               increase citation
• Makes researching easier
• Helps with validation of results and
  research integrity
• Offers new research opportunities and
  collaborations
• Is inline with the UON policy released in
  June 2011                                 More citations: 69% ↑

• Is important for UON’s reputation         (Piwowar, 2007 in PLoS)

• Is part of good research practice
Data lifecycle
                                                  1. What data will you
                                                     produce?
           5.
      Preservation
                                  1.              2. How will you organise the
                                Create
       & Re-Use
                                                     data?
                                                  3. Can you/others understand
                                                     the data
    4.
                                          2.
Publication
& Deposit
                                     Active Use
                                                  4. What data will be deposited
                                                     and where?
                     3.
                Documentation                     5. Who will be interested in
                                                     re-using the data?
Conceptualise and plan

Activities
• define a research question and design your methodology
• bid for funding (incl. data management and sharing plans)
• plan data creation (capture methods, standards, formats)
• data management plans!

Roles
PGR student, supervisory team, sponsors / funding bodies, IT,
research governance, ethics panel

Decisions made now have an impact on every other stage of the
lifecycle, so it is worth getting things right from the start!
1. What data will you produce?

                                                  • What type of data will
           5.
                                                    you produce?
                                  1.
      Preservation
                                Create
       & Re-Use
                                                  • What types of file
                                                    format?

    4.
                                                  • How easy is it to create or
                                          2.
Publication
& Deposit
                                     Active Use     reproduce?

                                                  • Who owns it and is
                     3.
                Documentation                       responsible for it?
Data can take many forms
• Notebooks & lab books
• Instrument measurements
• Experimental observations
                                           •http://www.flickr.com/photos/charleswelch/3
                                                            597432481//
• Still images, video & audio
• Consent forms                              •http://www.google.co.uk/imgres?
                                            q=illumina+bgi&hl=en&client=firefox-
                                                a&hs=Jl2&rls=org.mozilla:en-

• Text corpuses                                   GB:official&biw=1366&bih




• Models & software
• Survey results & interview transcripts
Data types
        Data Type                     Value                     Example
Observational data          Usually irreplaceable      Sensor readings,
captured around the time                               telemetry, survey results,
of the event                                           neuro-images
Experimental data from      Often reproducible but can Gene sequence,
lab equipment               be expensive               chromatograms, toroid
                                                       magnetic field readings
Simulation data generated   Model and metadata         Climate models, economic
from test models            (inputs) more important    models
                            than output data.

                            Large modules can take a
                            lot of computer time to
                            reproduce
Derived or compiled data    Reproducible               Text and data mining,
                            (but very expensive)       compiled databases, 3D
                                                       models
Who owns or is responsible for
           your data?
Ownership
• Data ownership is complex, often defined on a case-by-case
  basis
• May be dependent on individual contractual agreements
• Contracts define needs of the University, staff, students,
  funders, collaborators

Management
• Contact the University of Northampton ethics department
Responsibilities for data
In practice - Everyone plays their part

If you’re generating and using data, you should:
   • Comply with guidelines from your group, department, faculty, collaborators
   • Make sure your data is securely stored and backed up
   • Describe your data so that you/others can understand it in future

If you’re managing a project, you should:
   • Be fully aware of funder, collaborator and publisher requirements
   • Ensure you have access to group data
   • Assess what should be published and/or archived


• More info: http://www.data-archive.ac.uk/create-manage
2. How will you look after your data?


           5.
                                  1.
      Preservation
       & Re-Use
                                Create
                                                  • Is your data safe?

                                                  • Is your data organised?
    4.
Publication
                                          2.
                                     Active Use   • Can you find your data?
& Deposit



                     3.
                Documentation
Storage and Security
 3… 2… 1… Backup!
    at least 3 copies of a file
    on at least 2 different media
    with at least 1 offsite

 Test file recovery
    At set up time and on a regular basis

 Access
    Protect your hardware
    If sensitive use file encryption
    Keep passwords safe (e.g. Keypass)
    At least 2 people should have access to your data
More info: http://www.data-archive.ac.uk/create-manage/storage
Storage & Security – Back up options
       Media                  Advantages                     Disadvantages
 CDs or DVDs         • Useful for quick restore in   •   Static capture of data
                       the event of minor disaster   •   Not built to last
                                                     •   Vulnerable to theft
                                                     •   Physical loss of media
 External hard       • Dynamic capture of data       •   Must store securely and
 drives, including   • Useful for quick restore in       remotely to original copy
 dropbox               the event of minor disaster   •   Vulnerable to theft
                                                     •   Must use file encryption if
                                                         sensitive
 Northampton         • Resilient backup              •   Lack of offline access
 server                                              •   Must have a Northampton
                     • TUNDRA2 will replace              account
 Digital scans of    • Can carry out using campus    • Manipulation of page
 lab books             printer                         content difficult
Can you find your data?
A Clear Directory Structure
• Top level folder and substructure
File Version Control
• Discard obsolete versions if no longer needed after making
   backups
• Manage using: File naming (see below), version control software
   (e.g. Git, Mercurial, SVN)
File Naming Conventions
• Record any naming conventions or abbreviations used e.g.
   [Experiment]_[Reagent]_[Instrument]_[YYYYMMDD].dat
• Date/time stamp or use a separate ID (e.g. v1) for each version
More info: http://www.jiscdigitalmedia.ac.uk/crossmedia/advice/
   choosing-a-file-name/
3. Documenting data

                                                  • Do you still understand
                                                    your older work?
           5.
                                  1.
      Preservation
                                Create
       & Re-Use
                                                  • Is the file structure /
                                                    naming understandable
                                                    to others?
    4.
                                          2.
Publication
& Deposit
                                     Active Use
                                                  • Which data will be kept?

                     3.                           • Which data can be
                Documentation
                                                    discarded?
Understanding your data
• Students:
   • Will you be able to write up your methods at the end of
     your studies?
• Project leads:
   • Will you be able to respond to reviewers comments?
   • Will you be able to find the information you need for final
     project reports?
• Can you reproduce your work if you need to?
• What information would someone else need to replicate
  your work?
Understanding your data
Do you know how you generated your data?
   • Equipment or software used
   • Experimental protocol
   • Other things included in (e.g.) a lab notebook
   • Can reference a published article, if it covers everything
Are you able to give credit to external sources of data?
   • Include details of where the data are held, identified & accessed
   • Cite a publication describing the data
   • Cite the data itself e.g.
Metadata
 • Contextual information for data is called metadata — literally
   data about data
 • Data repositories & archives require some generic metadata,
   e.g. author, title, publication date
 • For data to be useful, it will also need subject-specific
   metadata e.g. reagent names, experimental conditions,
   population demographic
 • Record contextual information in a text file (such as a ‘read
   me’ file) in the same directory as the data e.g.
        • codes for categorical survey responses
        • ‘999 indicates a dummy value in the data’
More info: http://www.data-archive.ac.uk/create-manage/document
3.What data will be deposited
                  & where?
                                                  • Are you expected to
                                                    share your data?
           5.
                                  1.
      Preservation
       & Re-Use
                                Create
                                                  • Are you allowed to
                                                    share your data?

                                                  • Define the core data set
    4.
Publication
                                          2.
                                     Active Use
                                                    of the project
& Deposit


                                                  • Which data will be
                     3.
                Documentation                       included in your
                                                    publication / thesis?
Data Sharing – Why share your
             data?
• Share with your future self – avoid repeating research!

• Promote your research – get cited!

• Enable new discoveries

• Replication

• Store your data in a reliable archive

• Comply with funding requirements
Requirements to share your data
• Some journal publishers have a policy on data availability.
• Most UK funders now expect research data to be made
  publically available – UON policy
• Are you making any of your data available as supplementary
  information?
• Is there sufficient information with the data so that it can be
  understood and reused?

                declaration
    data are a public good and          Code of good research conduct
    should be openly available      data should be preserved and accessible
                                                 for 10 years +
         Funders’ data policies        Common principles on data policy
www.dcc.ac.uk/resources/policy-and-     www.rcuk.ac.uk/research/Pages/
legal/funders-data-policies                      DataPolicy.aspx
Restrictions on sharing your data

Are there privacy requirements from the funders or commercial
partners? e.g. personal data, high security data
   • You might not have the right to share data collected from
     other sources
   • It depends upon whether those data were licensed and
     have terms of use
   • Most databases are licensed and prohibit redistribution of
     data without permission
   • If you are uncertain as to your rights to disseminate data,
     check with the ethics department
How to share your data
• Deposit in a data repository eg. GenBank, UKDA
• Data can be licensed
• Culture of data sharing: can make available your data under a CC-
  BY or CC0 declaration to make this explicit
       • CC-BY license permits reuse but requires attribution
       • CC0 declaration is a waiver of copyright.
       • Note that laws about data vary in different countries.

• You may have rights to first use or to commercial exploit data
  How to license research data:
  http://www.dcc.ac.uk/resources/how-guides/license-research-
5. Preservation and reuse

                                                  • How long will your data
           5.
                                                    be reusable for?
                                  1.
      Preservation
                                Create
       & Re-Use
                                                  • Do you need to prepare
                                                    your data for long term
                                                    archive?
    4.
                                          2.
Publication
                                     Active Use
& Deposit
                                                  • Which data do you need
                                                    to keep?
                     3.
                Documentation
Data retention and archiving
How permanent are the data?
  • Short term (e.g. 3-5 years)
  • Long term (e.g. 10 years)
  • Indefinite

Should discarded data be destroyed?
   • Keep all versions? Just final version? First and last?

What are the re-processing costs?
  • Keep only software and protocol/methodology information

Are there tools/software needed to create, process or visualise
the data? Archive these with your data
Selection and appraisal
 Make a start on selection and appraisal from as early a point as possible.

 Plan for what you think you’ll need to keep to support your research findings. What is
  the minimum you’ll need to support your findings over time?
 Know who you are keeping it the data for and what you want them to be able do with
  it (is for yourself only, or for other people too?). This may affect the way you keep it
  and what you keep.
 Conversely, know what you need to dispose of. Destruction is often vital to ensure
  compliance with legal requirements.
 Appraise for the here and now but with an eye to the future.

 Think about resources required / available. These will affect you selection and
  appraisal decisions.
 Look for relationships with other data sets in your archive/repository as part of the
  appraisal process (i.e. does the dataset augment another collection significantly?).
 Some funders stipulate that you must identify whether the data exists already. This
  process might highlight additional datasets that your new research might augment
  significantly.
File formats for long term access
 •   Unencrypted
 •   Uncompressed
 •   Non-proprietary/patent-encumbered
 •   Open, documented standard
 •   Standard representation (ASCII, Unicode)
          Type               Recommended                Avoid for data sharing
  Tabular data      CSV, TSV, SPSS portable           Excel
  Text              Plain text, HTML, RTF             Word
                    PDF/A only if layout matters
  Media             Container: MP4, Ogg               Quicktime
                    Codec: Theora, Dirac, FLAC        H264
  Images            TIFF, JPEG2000, PNG               GIF, JPG
  Structured data   XML, RDF                          RDBMS

Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
Summary
• Data management is important at all stages of a project
• There are tools available to help you
• Keep your data safe: Back up your data and test your back-
  ups
• Keep your data organised
   • Find it – good formats and file names
   • Understand it - check documentation and metadata
• Consider publishing your data so that you can get recognition
  for your work
• Help is available at:
  http://researchsupporthub.northampton.ac.uk/contact-us/
Thanks - any questions?

                  Acknowledgements:
Thanks to Research360, DCC staff, UK Data Archive, Mantra
             (University of Edinburgh) for slides

Contenu connexe

Tendances

Research topic analysis
Research topic analysisResearch topic analysis
Research topic analysisHarve Abella
 
Searching Pubmed and Medline
Searching Pubmed and MedlineSearching Pubmed and Medline
Searching Pubmed and MedlineAshley D'Andrea
 
MYP General Grade 1-7 Descriptors
MYP General Grade 1-7 DescriptorsMYP General Grade 1-7 Descriptors
MYP General Grade 1-7 Descriptorsalisonyangkis
 
Triangulation: An Approach to establish Credibility and Dependability of Qual...
Triangulation: An Approach to establish Credibility and Dependability of Qual...Triangulation: An Approach to establish Credibility and Dependability of Qual...
Triangulation: An Approach to establish Credibility and Dependability of Qual...sankarprasadmohanty
 
Scientific integrity
Scientific integrityScientific integrity
Scientific integrityscromartie123
 
Data privacy impact assessment
Data privacy impact assessmentData privacy impact assessment
Data privacy impact assessmentStephen Owen
 
Research ethics & scientific misconduct
Research ethics & scientific misconductResearch ethics & scientific misconduct
Research ethics & scientific misconductDr Ghaiath Hussein
 
Case Study in Qualitative Research
 Case Study in Qualitative Research Case Study in Qualitative Research
Case Study in Qualitative ResearchROOHASHAHID1
 
Ethics In Research
Ethics In ResearchEthics In Research
Ethics In ResearchGrant Heller
 
Grounded theory research method
Grounded theory   research methodGrounded theory   research method
Grounded theory research methodMarie Faith Cayas
 
Qualitative codes and coding
Qualitative codes and coding Qualitative codes and coding
Qualitative codes and coding Heather Ford
 
Scientific publications and peer review ethics
Scientific publications and   peer review ethicsScientific publications and   peer review ethics
Scientific publications and peer review ethicsAboul Ella Hassanien
 
How to conduct systematic literature review
How to conduct systematic literature reviewHow to conduct systematic literature review
How to conduct systematic literature reviewKashif Hussain
 
Scientific report writing 3 results and discussion
Scientific report writing 3  results and discussionScientific report writing 3  results and discussion
Scientific report writing 3 results and discussionJacklyn Kong
 

Tendances (20)

Research methodology
Research methodologyResearch methodology
Research methodology
 
Research Ethics
Research EthicsResearch Ethics
Research Ethics
 
Budgeting study
Budgeting studyBudgeting study
Budgeting study
 
Research topic analysis
Research topic analysisResearch topic analysis
Research topic analysis
 
Searching Pubmed and Medline
Searching Pubmed and MedlineSearching Pubmed and Medline
Searching Pubmed and Medline
 
MYP General Grade 1-7 Descriptors
MYP General Grade 1-7 DescriptorsMYP General Grade 1-7 Descriptors
MYP General Grade 1-7 Descriptors
 
Triangulation: An Approach to establish Credibility and Dependability of Qual...
Triangulation: An Approach to establish Credibility and Dependability of Qual...Triangulation: An Approach to establish Credibility and Dependability of Qual...
Triangulation: An Approach to establish Credibility and Dependability of Qual...
 
Scientific integrity
Scientific integrityScientific integrity
Scientific integrity
 
Data privacy impact assessment
Data privacy impact assessmentData privacy impact assessment
Data privacy impact assessment
 
Research Methods: Searches & Systematic Reviews
Research Methods: Searches & Systematic ReviewsResearch Methods: Searches & Systematic Reviews
Research Methods: Searches & Systematic Reviews
 
Research ethics & scientific misconduct
Research ethics & scientific misconductResearch ethics & scientific misconduct
Research ethics & scientific misconduct
 
Case Study in Qualitative Research
 Case Study in Qualitative Research Case Study in Qualitative Research
Case Study in Qualitative Research
 
Research design
Research design Research design
Research design
 
Apply Risk Management to Computerized and Automated Systems
Apply Risk Management to Computerized and Automated SystemsApply Risk Management to Computerized and Automated Systems
Apply Risk Management to Computerized and Automated Systems
 
Ethics In Research
Ethics In ResearchEthics In Research
Ethics In Research
 
Grounded theory research method
Grounded theory   research methodGrounded theory   research method
Grounded theory research method
 
Qualitative codes and coding
Qualitative codes and coding Qualitative codes and coding
Qualitative codes and coding
 
Scientific publications and peer review ethics
Scientific publications and   peer review ethicsScientific publications and   peer review ethics
Scientific publications and peer review ethics
 
How to conduct systematic literature review
How to conduct systematic literature reviewHow to conduct systematic literature review
How to conduct systematic literature review
 
Scientific report writing 3 results and discussion
Scientific report writing 3  results and discussionScientific report writing 3  results and discussion
Scientific report writing 3 results and discussion
 

Similaire à Introduction to Research Data Management for postgraduate students

Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersJez Cope
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅kulibrarians
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagramSteven Cracknell
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto UniversityStephanie Simms
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementKathryn Houk
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curationGarethKnight
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management PlanKristin Briney
 
Preventing data loss
Preventing data lossPreventing data loss
Preventing data lossIUPUI
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016IzzyChad
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementopl10
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 

Similaire à Introduction to Research Data Management for postgraduate students (20)

Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagram
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Preventing data loss
Preventing data lossPreventing data loss
Preventing data loss
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 

Plus de Marieke Guy

Ways to ensure “buy in” from the academics in the transition to digitised ass...
Ways to ensure “buy in” from the academics in the transition to digitised ass...Ways to ensure “buy in” from the academics in the transition to digitised ass...
Ways to ensure “buy in” from the academics in the transition to digitised ass...Marieke Guy
 
Assessing for a World Beyond Assessment
Assessing for a World Beyond AssessmentAssessing for a World Beyond Assessment
Assessing for a World Beyond AssessmentMarieke Guy
 
The blandness is its formulaic style’: insights to help understand the impact...
The blandness is its formulaic style’: insights to help understand the impact...The blandness is its formulaic style’: insights to help understand the impact...
The blandness is its formulaic style’: insights to help understand the impact...Marieke Guy
 
Redesigning assessments for a world with artificial intelligence
Redesigning assessments for a world with artificial intelligenceRedesigning assessments for a world with artificial intelligence
Redesigning assessments for a world with artificial intelligenceMarieke Guy
 
Closing remarks: Assessment with Phill Dawson
Closing remarks: Assessment with Phill DawsonClosing remarks: Assessment with Phill Dawson
Closing remarks: Assessment with Phill DawsonMarieke Guy
 
The UCL assessment journey
The UCL assessment journeyThe UCL assessment journey
The UCL assessment journeyMarieke Guy
 
The UCL lockdown browser pilot
The UCL lockdown browser pilotThe UCL lockdown browser pilot
The UCL lockdown browser pilotMarieke Guy
 
Assessment in the time of change
Assessment in the time of changeAssessment in the time of change
Assessment in the time of changeMarieke Guy
 
Digital Assessment Team 2022 - a day in the life.pptx
Digital Assessment Team 2022 - a day in the life.pptxDigital Assessment Team 2022 - a day in the life.pptx
Digital Assessment Team 2022 - a day in the life.pptxMarieke Guy
 
The Digital Assessment Marathon 
The Digital Assessment Marathon The Digital Assessment Marathon 
The Digital Assessment Marathon Marieke Guy
 
Inspired assessments
Inspired assessmentsInspired assessments
Inspired assessmentsMarieke Guy
 
Designing alternative assessments
Designing alternative assessmentsDesigning alternative assessments
Designing alternative assessmentsMarieke Guy
 
MCQs_ The joys of making your mind up.pdf
MCQs_ The joys of making your mind up.pdfMCQs_ The joys of making your mind up.pdf
MCQs_ The joys of making your mind up.pdfMarieke Guy
 
Rubrics_ removing the glitch in the assessment matrix (1).pdf
Rubrics_ removing the glitch in the assessment matrix (1).pdfRubrics_ removing the glitch in the assessment matrix (1).pdf
Rubrics_ removing the glitch in the assessment matrix (1).pdfMarieke Guy
 
Making your mind up: Formalising the evaluation of learning technologies 
Making your mind up: Formalising the evaluation of learning technologies Making your mind up: Formalising the evaluation of learning technologies 
Making your mind up: Formalising the evaluation of learning technologies Marieke Guy
 
Video assessment recipes
Video assessment recipesVideo assessment recipes
Video assessment recipesMarieke Guy
 
Alternative assessments
Alternative assessmentsAlternative assessments
Alternative assessmentsMarieke Guy
 
Connect more: Digital Culture forum - A thousand things, a thousand times
Connect more: Digital Culture forum - A thousand things, a thousand timesConnect more: Digital Culture forum - A thousand things, a thousand times
Connect more: Digital Culture forum - A thousand things, a thousand timesMarieke Guy
 
The Certainty of Uncertainty: Transnational Online Pivot in China
The Certainty of Uncertainty: Transnational Online Pivot in ChinaThe Certainty of Uncertainty: Transnational Online Pivot in China
The Certainty of Uncertainty: Transnational Online Pivot in ChinaMarieke Guy
 
The Transnational Online Pivot: A Case Study Exploring Online Delivery in China
The Transnational Online Pivot: A Case Study Exploring Online Delivery in ChinaThe Transnational Online Pivot: A Case Study Exploring Online Delivery in China
The Transnational Online Pivot: A Case Study Exploring Online Delivery in ChinaMarieke Guy
 

Plus de Marieke Guy (20)

Ways to ensure “buy in” from the academics in the transition to digitised ass...
Ways to ensure “buy in” from the academics in the transition to digitised ass...Ways to ensure “buy in” from the academics in the transition to digitised ass...
Ways to ensure “buy in” from the academics in the transition to digitised ass...
 
Assessing for a World Beyond Assessment
Assessing for a World Beyond AssessmentAssessing for a World Beyond Assessment
Assessing for a World Beyond Assessment
 
The blandness is its formulaic style’: insights to help understand the impact...
The blandness is its formulaic style’: insights to help understand the impact...The blandness is its formulaic style’: insights to help understand the impact...
The blandness is its formulaic style’: insights to help understand the impact...
 
Redesigning assessments for a world with artificial intelligence
Redesigning assessments for a world with artificial intelligenceRedesigning assessments for a world with artificial intelligence
Redesigning assessments for a world with artificial intelligence
 
Closing remarks: Assessment with Phill Dawson
Closing remarks: Assessment with Phill DawsonClosing remarks: Assessment with Phill Dawson
Closing remarks: Assessment with Phill Dawson
 
The UCL assessment journey
The UCL assessment journeyThe UCL assessment journey
The UCL assessment journey
 
The UCL lockdown browser pilot
The UCL lockdown browser pilotThe UCL lockdown browser pilot
The UCL lockdown browser pilot
 
Assessment in the time of change
Assessment in the time of changeAssessment in the time of change
Assessment in the time of change
 
Digital Assessment Team 2022 - a day in the life.pptx
Digital Assessment Team 2022 - a day in the life.pptxDigital Assessment Team 2022 - a day in the life.pptx
Digital Assessment Team 2022 - a day in the life.pptx
 
The Digital Assessment Marathon 
The Digital Assessment Marathon The Digital Assessment Marathon 
The Digital Assessment Marathon 
 
Inspired assessments
Inspired assessmentsInspired assessments
Inspired assessments
 
Designing alternative assessments
Designing alternative assessmentsDesigning alternative assessments
Designing alternative assessments
 
MCQs_ The joys of making your mind up.pdf
MCQs_ The joys of making your mind up.pdfMCQs_ The joys of making your mind up.pdf
MCQs_ The joys of making your mind up.pdf
 
Rubrics_ removing the glitch in the assessment matrix (1).pdf
Rubrics_ removing the glitch in the assessment matrix (1).pdfRubrics_ removing the glitch in the assessment matrix (1).pdf
Rubrics_ removing the glitch in the assessment matrix (1).pdf
 
Making your mind up: Formalising the evaluation of learning technologies 
Making your mind up: Formalising the evaluation of learning technologies Making your mind up: Formalising the evaluation of learning technologies 
Making your mind up: Formalising the evaluation of learning technologies 
 
Video assessment recipes
Video assessment recipesVideo assessment recipes
Video assessment recipes
 
Alternative assessments
Alternative assessmentsAlternative assessments
Alternative assessments
 
Connect more: Digital Culture forum - A thousand things, a thousand times
Connect more: Digital Culture forum - A thousand things, a thousand timesConnect more: Digital Culture forum - A thousand things, a thousand times
Connect more: Digital Culture forum - A thousand things, a thousand times
 
The Certainty of Uncertainty: Transnational Online Pivot in China
The Certainty of Uncertainty: Transnational Online Pivot in ChinaThe Certainty of Uncertainty: Transnational Online Pivot in China
The Certainty of Uncertainty: Transnational Online Pivot in China
 
The Transnational Online Pivot: A Case Study Exploring Online Delivery in China
The Transnational Online Pivot: A Case Study Exploring Online Delivery in ChinaThe Transnational Online Pivot: A Case Study Exploring Online Delivery in China
The Transnational Online Pivot: A Case Study Exploring Online Delivery in China
 

Dernier

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 

Dernier (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Introduction to Research Data Management for postgraduate students

  • 1. Introduction to Research Data Management For postgraduate Students University of Northampton, 20th February 2013 Marieke Guy DCC, University of Bath m.guy@ukoln.ac.uk Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
  • 2. Today’s Talk… Will consider: • What are data • What is the research process • What is research data management and why does it matter • How you can start thinking about good research data management We hope you leave able to explain why research data management is important and what roles postgraduate students / researchers play.
  • 3. What are data? • The lowest level of abstraction from which information and knowledge are derived • Research data are collected, observed or created, for the purposes of analysis to produce and validate original research results • Both analogue and digital materials are 'data' • Digital data can be: • created in a digital form ("born digital") • converted to a digital form (digitised)
  • 4. What is the research process? Research Process •Research360
  • 5. What is research data management? • Data curation is “the active management and appraisal of data • Over the lifecycle of scholarly and scientific interest” • Data have importance as the evidential base • of scholarly conclusions • Curation is part of good research practice
  • 6. Why carry out RDM? Managing your data properly: Sharing data can • Can prevent data loss increase citation • Makes researching easier • Helps with validation of results and research integrity • Offers new research opportunities and collaborations • Is inline with the UON policy released in June 2011 More citations: 69% ↑ • Is important for UON’s reputation (Piwowar, 2007 in PLoS) • Is part of good research practice
  • 7. Data lifecycle 1. What data will you produce? 5. Preservation 1. 2. How will you organise the Create & Re-Use data? 3. Can you/others understand the data 4. 2. Publication & Deposit Active Use 4. What data will be deposited and where? 3. Documentation 5. Who will be interested in re-using the data?
  • 8. Conceptualise and plan Activities • define a research question and design your methodology • bid for funding (incl. data management and sharing plans) • plan data creation (capture methods, standards, formats) • data management plans! Roles PGR student, supervisory team, sponsors / funding bodies, IT, research governance, ethics panel Decisions made now have an impact on every other stage of the lifecycle, so it is worth getting things right from the start!
  • 9. 1. What data will you produce? • What type of data will 5. you produce? 1. Preservation Create & Re-Use • What types of file format? 4. • How easy is it to create or 2. Publication & Deposit Active Use reproduce? • Who owns it and is 3. Documentation responsible for it?
  • 10. Data can take many forms • Notebooks & lab books • Instrument measurements • Experimental observations •http://www.flickr.com/photos/charleswelch/3 597432481// • Still images, video & audio • Consent forms •http://www.google.co.uk/imgres? q=illumina+bgi&hl=en&client=firefox- a&hs=Jl2&rls=org.mozilla:en- • Text corpuses GB:official&biw=1366&bih • Models & software • Survey results & interview transcripts
  • 11. Data types Data Type Value Example Observational data Usually irreplaceable Sensor readings, captured around the time telemetry, survey results, of the event neuro-images Experimental data from Often reproducible but can Gene sequence, lab equipment be expensive chromatograms, toroid magnetic field readings Simulation data generated Model and metadata Climate models, economic from test models (inputs) more important models than output data. Large modules can take a lot of computer time to reproduce Derived or compiled data Reproducible Text and data mining, (but very expensive) compiled databases, 3D models
  • 12. Who owns or is responsible for your data? Ownership • Data ownership is complex, often defined on a case-by-case basis • May be dependent on individual contractual agreements • Contracts define needs of the University, staff, students, funders, collaborators Management • Contact the University of Northampton ethics department
  • 13. Responsibilities for data In practice - Everyone plays their part If you’re generating and using data, you should: • Comply with guidelines from your group, department, faculty, collaborators • Make sure your data is securely stored and backed up • Describe your data so that you/others can understand it in future If you’re managing a project, you should: • Be fully aware of funder, collaborator and publisher requirements • Ensure you have access to group data • Assess what should be published and/or archived • More info: http://www.data-archive.ac.uk/create-manage
  • 14. 2. How will you look after your data? 5. 1. Preservation & Re-Use Create • Is your data safe? • Is your data organised? 4. Publication 2. Active Use • Can you find your data? & Deposit 3. Documentation
  • 15. Storage and Security 3… 2… 1… Backup! at least 3 copies of a file on at least 2 different media with at least 1 offsite Test file recovery At set up time and on a regular basis Access Protect your hardware If sensitive use file encryption Keep passwords safe (e.g. Keypass) At least 2 people should have access to your data More info: http://www.data-archive.ac.uk/create-manage/storage
  • 16. Storage & Security – Back up options Media Advantages Disadvantages CDs or DVDs • Useful for quick restore in • Static capture of data the event of minor disaster • Not built to last • Vulnerable to theft • Physical loss of media External hard • Dynamic capture of data • Must store securely and drives, including • Useful for quick restore in remotely to original copy dropbox the event of minor disaster • Vulnerable to theft • Must use file encryption if sensitive Northampton • Resilient backup • Lack of offline access server • Must have a Northampton • TUNDRA2 will replace account Digital scans of • Can carry out using campus • Manipulation of page lab books printer content difficult
  • 17. Can you find your data? A Clear Directory Structure • Top level folder and substructure File Version Control • Discard obsolete versions if no longer needed after making backups • Manage using: File naming (see below), version control software (e.g. Git, Mercurial, SVN) File Naming Conventions • Record any naming conventions or abbreviations used e.g. [Experiment]_[Reagent]_[Instrument]_[YYYYMMDD].dat • Date/time stamp or use a separate ID (e.g. v1) for each version More info: http://www.jiscdigitalmedia.ac.uk/crossmedia/advice/ choosing-a-file-name/
  • 18. 3. Documenting data • Do you still understand your older work? 5. 1. Preservation Create & Re-Use • Is the file structure / naming understandable to others? 4. 2. Publication & Deposit Active Use • Which data will be kept? 3. • Which data can be Documentation discarded?
  • 19. Understanding your data • Students: • Will you be able to write up your methods at the end of your studies? • Project leads: • Will you be able to respond to reviewers comments? • Will you be able to find the information you need for final project reports? • Can you reproduce your work if you need to? • What information would someone else need to replicate your work?
  • 20. Understanding your data Do you know how you generated your data? • Equipment or software used • Experimental protocol • Other things included in (e.g.) a lab notebook • Can reference a published article, if it covers everything Are you able to give credit to external sources of data? • Include details of where the data are held, identified & accessed • Cite a publication describing the data • Cite the data itself e.g.
  • 21. Metadata • Contextual information for data is called metadata — literally data about data • Data repositories & archives require some generic metadata, e.g. author, title, publication date • For data to be useful, it will also need subject-specific metadata e.g. reagent names, experimental conditions, population demographic • Record contextual information in a text file (such as a ‘read me’ file) in the same directory as the data e.g. • codes for categorical survey responses • ‘999 indicates a dummy value in the data’ More info: http://www.data-archive.ac.uk/create-manage/document
  • 22. 3.What data will be deposited & where? • Are you expected to share your data? 5. 1. Preservation & Re-Use Create • Are you allowed to share your data? • Define the core data set 4. Publication 2. Active Use of the project & Deposit • Which data will be 3. Documentation included in your publication / thesis?
  • 23. Data Sharing – Why share your data? • Share with your future self – avoid repeating research! • Promote your research – get cited! • Enable new discoveries • Replication • Store your data in a reliable archive • Comply with funding requirements
  • 24. Requirements to share your data • Some journal publishers have a policy on data availability. • Most UK funders now expect research data to be made publically available – UON policy • Are you making any of your data available as supplementary information? • Is there sufficient information with the data so that it can be understood and reused? declaration data are a public good and Code of good research conduct should be openly available data should be preserved and accessible for 10 years + Funders’ data policies Common principles on data policy www.dcc.ac.uk/resources/policy-and- www.rcuk.ac.uk/research/Pages/ legal/funders-data-policies DataPolicy.aspx
  • 25. Restrictions on sharing your data Are there privacy requirements from the funders or commercial partners? e.g. personal data, high security data • You might not have the right to share data collected from other sources • It depends upon whether those data were licensed and have terms of use • Most databases are licensed and prohibit redistribution of data without permission • If you are uncertain as to your rights to disseminate data, check with the ethics department
  • 26. How to share your data • Deposit in a data repository eg. GenBank, UKDA • Data can be licensed • Culture of data sharing: can make available your data under a CC- BY or CC0 declaration to make this explicit • CC-BY license permits reuse but requires attribution • CC0 declaration is a waiver of copyright. • Note that laws about data vary in different countries. • You may have rights to first use or to commercial exploit data How to license research data: http://www.dcc.ac.uk/resources/how-guides/license-research-
  • 27. 5. Preservation and reuse • How long will your data 5. be reusable for? 1. Preservation Create & Re-Use • Do you need to prepare your data for long term archive? 4. 2. Publication Active Use & Deposit • Which data do you need to keep? 3. Documentation
  • 28. Data retention and archiving How permanent are the data? • Short term (e.g. 3-5 years) • Long term (e.g. 10 years) • Indefinite Should discarded data be destroyed? • Keep all versions? Just final version? First and last? What are the re-processing costs? • Keep only software and protocol/methodology information Are there tools/software needed to create, process or visualise the data? Archive these with your data
  • 29. Selection and appraisal  Make a start on selection and appraisal from as early a point as possible.  Plan for what you think you’ll need to keep to support your research findings. What is the minimum you’ll need to support your findings over time?  Know who you are keeping it the data for and what you want them to be able do with it (is for yourself only, or for other people too?). This may affect the way you keep it and what you keep.  Conversely, know what you need to dispose of. Destruction is often vital to ensure compliance with legal requirements.  Appraise for the here and now but with an eye to the future.  Think about resources required / available. These will affect you selection and appraisal decisions.  Look for relationships with other data sets in your archive/repository as part of the appraisal process (i.e. does the dataset augment another collection significantly?).  Some funders stipulate that you must identify whether the data exists already. This process might highlight additional datasets that your new research might augment significantly.
  • 30. File formats for long term access • Unencrypted • Uncompressed • Non-proprietary/patent-encumbered • Open, documented standard • Standard representation (ASCII, Unicode) Type Recommended Avoid for data sharing Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF Word PDF/A only if layout matters Media Container: MP4, Ogg Quicktime Codec: Theora, Dirac, FLAC H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
  • 31. Summary • Data management is important at all stages of a project • There are tools available to help you • Keep your data safe: Back up your data and test your back- ups • Keep your data organised • Find it – good formats and file names • Understand it - check documentation and metadata • Consider publishing your data so that you can get recognition for your work • Help is available at: http://researchsupporthub.northampton.ac.uk/contact-us/
  • 32. Thanks - any questions? Acknowledgements: Thanks to Research360, DCC staff, UK Data Archive, Mantra (University of Edinburgh) for slides