Research Data Management from a Software Engineering Perspective

A presentation on Research Data (and Software) Management given at the Research Software Engineering Community Event, Imperial College London.

  1. 1. Research Data Management – From a Software Engineering Perspective Research Software Engineering Community Event Imperial College London, 17th February 2016 Sarah A. Stewart Research Data Support Assistant Imperial College London sarah.stewart@imperial.ac.uk / @Biostew http://orcid.org/0000-0002-9465-4042
  2. 2. Noughts and Crosses: Software Preservation for Research
  3. 3. Software’s Role in Research?
  4. 4. Software = Data? • ‘Software is used to create, interpret, present, manipulate and manage data.’ (Software Sustainability Institute) • Data: ‘recorded factual material commonly retained by and accepted...as necessary to validate research findings’ (EPSRC) • Is the software necessary to validate research findings? • Good practice to make software available with adequate documentation to enable others to more easily validate your research findings and to access and re-use your research data. • Software = Data!
  5. 5. Missing Research Data “In their parents' attic, in boxes in the garage, or stored on now-defunct floppy disks — these are just some of the inaccessible places in which scientists have admitted to keeping their old research data.” http://www.nature.com/news/scientists-losing-data-at-a-rapid-rate-1.14416
  6. 6. Why is RDM in Software Engineering Important ? • Software Preservation! Digital Preservation a pressing concern! • Storage, Retrieval, Reconstruction, Replay - all are complexities relating to code libraries, dependencies and software engineering overall. • Preserving data is talked about more often, but software that created it is also important, particularly if data needs to be re-created, or requires the software in order to interrogate or visualise the data. • Planning is essential for subsequent retrieval, reconstruction and replay. • Software is a digital object which is frequently the result of research and is often a vital prerequisite for the preservation of other digital objects. • Software preservation should be part of a broader preservation strategy - RDM.
  7. 7. Imperial College RDM Policy The Provost’s Board has approved a research data management policy in order to assist principle investigators in meeting requirements from research funding bodies: “Imperial College London is committed to promoting the highest standards of academic research, including excellence in research data management. This includes a robust digital curation infrastructure that supports open data access and protects confidential data. The College acknowledges legal, ethical and commercial constraints on data sharing and the need to preserve the academic entitlement to publication.” “Principal Investigators have overall responsibility for the effective management of research data generated within or obtained for their research, including by their research groups. The Library and ICT will provide training, guidance and services to support PIs.” https://workspace.imperial.ac.uk/researchservices/Public/Imperial%20College%20RD M%20Policy.pdf
  8. 8. Research Data Life Cycle
  9. 9. Guidance on RDM website http://www.imperial.ac.uk/research-data-management
  10. 10. Useful Tools for RDM in Software Engineering
  11. 11. The RDM Team : Here to Help! We provide: • guidance to assist you to comply with Imperial College and funders’ policies regarding the management of research data • advice about tools and resources to help you manage your data throughout the lifecycle of your research • workshops and information sessions to keep you updated with the latest RDM practices You can contact us with any queries at: rdm-enquiries@imperial.ac.uk
  12. 12. Any Questions ? Thank you! For more Information: Webpage: www.imperial.ac.uk/research-data-management E-mail : rdm-enquiries@imperial.ac.uk RDM Team – Ash Barnes, Sarah Stewart