SlideShare une entreprise Scribd logo
1  sur  40
Télécharger pour lire hors ligne
Stuart Macdonald
RDM Service Coordinator
University of Edinburgh
stuart.macdonald@ed.ac.uk
RDMF 12 - Research Data and Repositories (and other systems), RDMF, University of Leicester, 19
November 2014
RDM PROGRAMME @ EDINBURGH
- Service Interoperation
Background
• EDINA and University Data Library (EDL) together are a
division within Information Services (IS) of the University of
Edinburgh.
• EDINA is a Jisc-designated centre for digital expertise &
online service delivery - http://edina.ac.uk/
• The Data Library assists Edinburgh University users in the
discovery, access, use and management of research datasets:
http://www.ed.ac.uk/is/data-library
• Research and Learning Services offer specific services to the
University with a focus on enabling research (OA publications,
research data, bibliometrics) and resource discovery for
learners (resource search systems).
University of Edinburgh RDM Policy
 University of Edinburgh is one
of the first Universities in UK
to adopt a policy for managing
research data:
http://www.ed.ac.uk/is/resear
ch-data-policy
 The policy was approved by
the University Court on 16 May
2011.
 It’s acknowledged that this is
an aspirational policy and that
implementation will take some
years.
An RDM Policy Implementation Committee was set up by the
VP of Knowledge Management charged with delivering
services that will meet RDM policy objectives:
• Membership from across Information Services
• Iterate with researchers to ensure services meet the needs of
researchers
The VP also established a Steering Committee led by
Prof. Peter Clarke with members of the Research Committee
from the 3 colleges, IS, and the Research Office (ERI).
Their role is to:
• Provide oversight to the activity of the Implementation
Committee
• Ensure services meet researcher requirements without harming
research competitiveness
Governance
Policy implementation - Research Data
Management Roadmap (2012-2015)
 Cross-divisional collaboration
 Services already in place:
o Data management planning
o Active working file space =
DataStore
o Data publication repository =
DataShare
 Services in development:
o Long term data archive =
DataVault
o Data Asset Register (DAR)
 RDM support: Awareness raising,
training & consultancy
http://edin.ac/1u3sKqy
Before research During research After research
Research Data Management Planning
Performed at the conceptual stage before research
data are created (what, where, who, how)
Customised instance of DCC’s DMPonline toolkit for
University of Edinburgh use:
• Funders and local (non-funder) DMP templates
• Institutional guidance (storage, services, support)
• Piloting customised school-level guidance - end of Jan. 2015
Tailored DMP assistance for researchers submitting
research proposals (F-2-F)
DataStore
 NAS facility to store data that are actively used in current research activities
 1.6PB storage initially. 0.5 TB (500GB) per researchers, PGR upwards
 Up to 0.25TB of each allocation can be used for “shared” group storage
 Cost of extra storage: £200 per TB per year = 1TB primary storage, 10 days
online file history, 60 days backup, DR copy
 Infrastructure in place. Allocation of space devolved to IT departments of
respective Schools overseen by Heads of IT from each College.
 De-allocation policy detailing responsibilities and storage costs for
‘orphaned data’ - pending approval by Steering Committee
DataShare
 Edinburgh DataShare is the University’s open access multi-disciplinary data
repository: http://datashare.is.ed.ac.uk
 Assists researchers disseminate their research, get credit for data
publication, and preserve their data for the long-term (DOI, licence, citation)
 Help researchers comply with funder requirements to preserve and share
your data and complies with Edinburgh’s RDM Policy
Data Vault
 Safe, private, store of data that is only accessible by the data creator or
their representative
 Secure storage: File security; Storage security; Additional security:
encryption
 Long term assurance
 Automatic versioning
 Front-end application requirements (authorisation, retention & deletion,
file structure, file transfter, integration
Data Asset Register (DAR)
 A catalogue of data assets produced by University of Edinburgh
researchers
 A key component of the University of Edinburgh RDM systems
 Will give researchers a single place to record the existence of the data
assets they produce for discovery, access, and re-use as appropriate.
 Paper proposing the adoption of PURE as the University’s DAR
provisionally accepted the RDM Steering Committee (Oct. 2014)
Systems do not live in isolation,
and become more powerful and
more likely to be used if they
are integrated with each other.
However, the last thing that we
want is to introduce further
systems that need to be fed
with duplicate information.
This means interoperation for
some or all of the components
Interoperation
RDM Support
Making the most of local support!
• RDM team will work with the Research Administrators in
each School.
• Academic Support Librarians (who represent each of the 22
Schools).
• IT staff in each School.
• ERI staff. They will be receiving RDM training.
• Each School’s Ethics Committee
• Queries can be sent to the IS Helpline who will direct them
as appropriate.
Awareness raising
• Introductory sessions on RDM
services and support for
research active and research
admin staff in Schools /
Institutes
• RDM website:
http://www.ed.ac.uk/is/data-
management
• RDM blog:
http://datablog.is.ed.ac.uk
• RDM wiki:
https://www.wiki.ed.ac.uk/displa
y/RDM/Research+Data+Manage
ment+Wiki
MANTRA
 MANTRA is an
internationally recognized
self-paced online training
course developed here for
PGR’s and early career
researchers in data
management issues.
 Data handling exercises
with open datasets in 4
analytical packages: R,
SPSS, NVivo, ArcGIS
 CC License & embed units
in VLE’s e.g. Moodle
http://datalib.edina.ac.uk/mantra
Training: Tailored Courses
 A range of training
programmes on research
data management (RDM) in
the form of workshops,
power sessions, seminars
and drop in sessions to
help researchers with
research data management
issues
 http://www.ed.ac.uk/schools-
departments/information-
services/research-support/data-
management/rdm-training
 Creating a data management
plan for your grant application
 Research Data Management
Programme at the University of
Edinburgh
 Good practice in Research
Data Management
 Handling data using SPSS
 Handling data with ArcGIS
http://edin.ac/1kRMPv3

Service Integration
• DataShare is a customised DSpace instance with a
selection of OAI-PMH compliant DCMI metadata fields for data
discovery through Google and other search engines
• Records are harvested by Data Citation Index
• SWORD API utilised for batch deposit of large and/or many
files from remote computers (‘Push using http’)
• Internal batch ingest of many/large files to circumvent 2.1GB
limit via the web interface (‘Pull via command line interface’)
• Use of checksums to determine that delivered object mirrors deposited
object
• Working with F1000Research to define a workflow for
depositors to get credit for data as research output by
publishing data articles - http://f1000research.com/
• Published new list of data journals for our depositors
DSpace GITHUB plugin* - allows software to be archived from
GitHub (or similar) source code repository into DataShare, which can
then be assigned a DOI to facilitate citation - using the SWORD deposit
protocol
DataSync - to allow sharing of data on DataStore:
• drop-box type functionality
• uses open source ‘ownCloud’ technology
• desktop and mobile machines synchronize files with the ownCloud
server
• file updates are pushed between all devices connected to a user's
account.
Research data deposit from RSpace Electronic Lab Notebook (ELN)
interface into DataShare (and Datastore & Data Vault) using SWORD
* http://blog.stuartlewis.com/2014/09/09/github-to-repository-deposit/
Integrating an electronic lab notebook
with a university research infrastructure:
Case Study with RSpace at the
University of Edinburgh
Rory Macneil
Research Space
rmacneil@researchspace.com
RDMF 12 - Research Data and
Repositories (and other systems), RDMF,
University of Leicester, 19 November 2014
Overview
● ELNs – where the demand coming from
● RSpace – origins and overview
● RSpace at Edinburgh
– Linking to files in Edinburgh DataStore
– Depositing content in Edinburgh DataShare
– Archiving in Edinburgh DataVault
● Platform for integration with other RDM
infrastructures
Who and what is driving demand for
ELNs?
● Researchers
– Utility and convenience of paper lab book + online capabilities
– On multiple devices
– File management/integration
● Groups/PIs
– Controlled sharing
– Collaboration
– Group management
– File management/integration
● Institutions: data librarians, research admins, IT, commercialisation offices
– Enterprise features: Scalable deployment, Single Sign On
– IP protection: audit trail, signing
– Publishing
– Archiving
– Repository integration
RSpace
● Conceived in response to Wisconsin RFP
and trial 2011 - 2012
● Developed with Wisconsin by Research
Space 2012 - 2013
Researcher experience
Sketching √
Image annotation √
Chemical structures √
Notebook √
Forms √
Templating √
Snippets √
PDF export √
Export to html √
File gallery √
Journal view √
Tablet friendly √
Clean design √
Performance √
Round trip editing √
Offline access √
PI/Lab support
Sharing √
Messaging √
Lab set up enabled √
Group management √
Inter-group collaboration √
Institutional requirements
(IT, data librarians, research admins,
commercialisation offices)
Single sign on √
Tiered admin √
Group set up √
IP support √
Export to XML √
Archiving √
Repository integration √
RSpace design advantages
● Easy data entry
● Easy and flexible data structuring
● Multiple ways of getting data out (and back
in)
– Export PDF
– Export to html
– Export Zip (XML)
– Re-import, preserving structure
– Archive (with metadata)
Business Model
● Free public cloud for labs and individuals
● Institutional deployments @$100/user/year
● Seamless movement of groups and data
between different RSpaces
Edinburgh
Public
Cloud
Stanford
La
b
La
b
La
b
RSpace at Edinburgh
– Linking to files in Edinburgh DataStore
– Depositing content in Edinburgh DataShare
– Archiving in Edinburgh DataVault
Linking to DataStore
“My plan for workflow would be generally to
deposit my data in DataStore either from the wet
lab instruments (gel photos, elisa data, etc, and
also possibly directly from an iPad) or from in silico
data analysis I’ve been doing, and then link to it
from within RSpace.”
Linking to DataStore
Experiment
Procedure
~~~~~~~~~~
~~~~~~~~~~
Results
~~~~~~~~~~
Results.xls
ELN UoE DataStore
Exposing DataStore File Roots
Linking to a DataStore File
Linking to a DataStore File
Linking to a DataStore File
DataStore integration: Designing for
the User
● Single Sign On via EASE
● Seamless file access
– Common Internet File Standard with user
credentials
● Multiple file roots per user
– Idiosyncratic organisation
– Sharing between users/groups
– Accessing external files (DataStore, Box,
Dropbox)
Exporting to DataShare
RSpace
UoE DataShare
Adding metadata
RSpace – DataShare integration:
Backend platform
–Edinburgh DataShare has three interfaces/APIs
●Web-UI
●Python
●SWORD (simple Java based web-service which supports repository
deposits)
–RSpace uses the SWORD Interface
–The SWORD server accepts a file for deposition if a METS
description file is provided
Four part METS implementation in
RSpace – DataShare integration
•RSpace uses the standard METS header
•DMD -- field definitions are based on Dublin Core
–Four required fields in Edinburgh DataShare -- contributor,
publisher, title, and data creator -- must be completed as part
of the deposit through RSpace
–Additional optional fields can be filled in later by DataShare
administrator:
●FUNDER, SPATIAL_COVERAGE, TIME_PERIOD, DATA_CREATOR, AVAILABLE_DATE,
DESCRIPTION_ABSTRACT, DESCRIPTION_TOC, LANGUAGE, RELATION_VERSION_OF,
RELATION_REFERENCED_BY, SUPERCEDES, RIGHT, SOURCE, SUBJECT_KEYWORDS,
SUBJECT_CLASSIFICATION, ALTERNATIVE_TITLE
•All zipped files and their mime-types (e.g. application/pdf,
text/html) are included
•A structure map describes the full structure and relationships
between the above three elements
RSpace – DataShare integration:
Workflow
•Front end trigger
–An RSpace user selects files/folders/notebooks to be
deposited from RSpace, and starts the deposit process
•Backend to support the user workflow
–RSpace extracts the associated data and resources from
its database and file-store
–These are turned into xml files
–METS is used to describe the zip file and each selected
file
–The xml, resource, and METS files are zipped into a zip file
for archiving
–The DSpace SWORD client deposits the zip file to
DataShare after an authentication and validation
Archiving in Edinburgh DataVault
● DataVault functionality/API not yet
specified
● Anticipate use of XML zip archive
● Many requirements to be determined
– e.g., searching, restoration
RSpace and Edinburgh RDM
RSpace
server
DataShareDataStore
DataVault User / Browser
RSpace and RDM: Other institutions
RSpace
server
DSpace/
other
repositories
File store
Archive
User / Browser
Inte
rfac
e
Inte
rfac
e
XML

Contenu connexe

Tendances

Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011
Robin Rice
 

Tendances (20)

Data Curation Lifecycle Management at the University of Edinburgh
Data Curation Lifecycle Management at the University of EdinburghData Curation Lifecycle Management at the University of Edinburgh
Data Curation Lifecycle Management at the University of Edinburgh
 
A national repository (library?) service for learning materials
A national repository (library?) service for learning materialsA national repository (library?) service for learning materials
A national repository (library?) service for learning materials
 
IASSIST40: Data management & curation workshop
IASSIST40: Data management & curation workshopIASSIST40: Data management & curation workshop
IASSIST40: Data management & curation workshop
 
EDINA / Data Library Overview
EDINA / Data Library OverviewEDINA / Data Library Overview
EDINA / Data Library Overview
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Investigation into Private LOCKSS Networks
Investigation into Private LOCKSS NetworksInvestigation into Private LOCKSS Networks
Investigation into Private LOCKSS Networks
 
Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011
 
Guiding users through data deposit
Guiding users through data depositGuiding users through data deposit
Guiding users through data deposit
 
Six Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShareSix Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShare
 
RDM Programme@Edinburgh
RDM Programme@EdinburghRDM Programme@Edinburgh
RDM Programme@Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
National Activities and the UK LOCKSS Alliance
National Activities and the UK LOCKSS AllianceNational Activities and the UK LOCKSS Alliance
National Activities and the UK LOCKSS Alliance
 
Research Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of EdinburghResearch Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of Edinburgh
 
PEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent PreservedPEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent Preserved
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
Access Control in ESDIN: Shibboleth
Access Control in ESDIN: ShibbolethAccess Control in ESDIN: Shibboleth
Access Control in ESDIN: Shibboleth
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
 
Data Library Services at the University of Edinburgh
Data Library Services at the University of EdinburghData Library Services at the University of Edinburgh
Data Library Services at the University of Edinburgh
 
The Keepers Registry: Enabling Trust in E-Journal Preservation
The Keepers Registry: Enabling Trust in E-Journal PreservationThe Keepers Registry: Enabling Trust in E-Journal Preservation
The Keepers Registry: Enabling Trust in E-Journal Preservation
 

En vedette

PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2 PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
EDINA, University of Edinburgh
 
Credo reference promoting resources workshop edina slides
Credo reference promoting resources workshop   edina slidesCredo reference promoting resources workshop   edina slides
Credo reference promoting resources workshop edina slides
EDINA, University of Edinburgh
 

En vedette (20)

Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
Shibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDIShibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDI
 
How does it feel to participate in public?
How does it feel to participate in public?How does it feel to participate in public?
How does it feel to participate in public?
 
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2 PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
 
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShareCollaboration to Curation: The High Rise Project meets Edinburgh DataShare
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
 
Tweeting and Blogging for Academics
Tweeting and Blogging for AcademicsTweeting and Blogging for Academics
Tweeting and Blogging for Academics
 
Library roles in research data management
Library roles in research data management Library roles in research data management
Library roles in research data management
 
Credo reference promoting resources workshop edina slides
Credo reference promoting resources workshop   edina slidesCredo reference promoting resources workshop   edina slides
Credo reference promoting resources workshop edina slides
 
UK RepositoryNet+
UK RepositoryNet+UK RepositoryNet+
UK RepositoryNet+
 
JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training
 
Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSS
 
CLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based ArchivingCLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based Archiving
 
The WSTIERIA Project – A Web of Services
The  WSTIERIA Project – A Web of ServicesThe  WSTIERIA Project – A Web of Services
The WSTIERIA Project – A Web of Services
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...
 
RJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to RepositoriesRJ Broker: Automating Delivery of Research Output to Repositories
RJ Broker: Automating Delivery of Research Output to Repositories
 
COBWEB Project: Citizens Observatories Side Event
COBWEB Project: Citizens Observatories Side EventCOBWEB Project: Citizens Observatories Side Event
COBWEB Project: Citizens Observatories Side Event
 
Reference Rot: Threat and Remedy
Reference Rot: Threat and RemedyReference Rot: Threat and Remedy
Reference Rot: Threat and Remedy
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
UK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas WorkshopUK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas Workshop
 
Developing a Crowd Sourcing App
Developing a Crowd Sourcing AppDeveloping a Crowd Sourcing App
Developing a Crowd Sourcing App
 

Similaire à RDM Programme @ Edinburgh - Service Interoperation

RDM programme @ Edinburgh an institutional approach
RDM programme @ Edinburgh an institutional approachRDM programme @ Edinburgh an institutional approach
RDM programme @ Edinburgh an institutional approach
Jisc
 

Similaire à RDM Programme @ Edinburgh - Service Interoperation (20)

Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...
Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...
Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...
 
RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015
 
RDM programme @ Edinburgh an institutional approach
RDM programme @ Edinburgh an institutional approachRDM programme @ Edinburgh an institutional approach
RDM programme @ Edinburgh an institutional approach
 
Edinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for DataEdinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for Data
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015
 
Research Data Management Programme in Edinburgh
Research Data Management Programme in EdinburghResearch Data Management Programme in Edinburgh
Research Data Management Programme in Edinburgh
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Ppls mvm2
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
RDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, PracticeRDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, Practice
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
RDM @ Edinburgh - Arkivum Workshop
RDM @ Edinburgh - Arkivum WorkshopRDM @ Edinburgh - Arkivum Workshop
RDM @ Edinburgh - Arkivum Workshop
 
RDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian ExperienceRDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian Experience
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 

Plus de EDINA, University of Edinburgh

Plus de EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 

Dernier

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Dernier (20)

Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 

RDM Programme @ Edinburgh - Service Interoperation

  • 1. Stuart Macdonald RDM Service Coordinator University of Edinburgh stuart.macdonald@ed.ac.uk RDMF 12 - Research Data and Repositories (and other systems), RDMF, University of Leicester, 19 November 2014 RDM PROGRAMME @ EDINBURGH - Service Interoperation
  • 2. Background • EDINA and University Data Library (EDL) together are a division within Information Services (IS) of the University of Edinburgh. • EDINA is a Jisc-designated centre for digital expertise & online service delivery - http://edina.ac.uk/ • The Data Library assists Edinburgh University users in the discovery, access, use and management of research datasets: http://www.ed.ac.uk/is/data-library • Research and Learning Services offer specific services to the University with a focus on enabling research (OA publications, research data, bibliometrics) and resource discovery for learners (resource search systems).
  • 3. University of Edinburgh RDM Policy  University of Edinburgh is one of the first Universities in UK to adopt a policy for managing research data: http://www.ed.ac.uk/is/resear ch-data-policy  The policy was approved by the University Court on 16 May 2011.  It’s acknowledged that this is an aspirational policy and that implementation will take some years.
  • 4. An RDM Policy Implementation Committee was set up by the VP of Knowledge Management charged with delivering services that will meet RDM policy objectives: • Membership from across Information Services • Iterate with researchers to ensure services meet the needs of researchers The VP also established a Steering Committee led by Prof. Peter Clarke with members of the Research Committee from the 3 colleges, IS, and the Research Office (ERI). Their role is to: • Provide oversight to the activity of the Implementation Committee • Ensure services meet researcher requirements without harming research competitiveness Governance
  • 5. Policy implementation - Research Data Management Roadmap (2012-2015)  Cross-divisional collaboration  Services already in place: o Data management planning o Active working file space = DataStore o Data publication repository = DataShare  Services in development: o Long term data archive = DataVault o Data Asset Register (DAR)  RDM support: Awareness raising, training & consultancy http://edin.ac/1u3sKqy Before research During research After research
  • 6. Research Data Management Planning Performed at the conceptual stage before research data are created (what, where, who, how) Customised instance of DCC’s DMPonline toolkit for University of Edinburgh use: • Funders and local (non-funder) DMP templates • Institutional guidance (storage, services, support) • Piloting customised school-level guidance - end of Jan. 2015 Tailored DMP assistance for researchers submitting research proposals (F-2-F)
  • 7. DataStore  NAS facility to store data that are actively used in current research activities  1.6PB storage initially. 0.5 TB (500GB) per researchers, PGR upwards  Up to 0.25TB of each allocation can be used for “shared” group storage  Cost of extra storage: £200 per TB per year = 1TB primary storage, 10 days online file history, 60 days backup, DR copy  Infrastructure in place. Allocation of space devolved to IT departments of respective Schools overseen by Heads of IT from each College.  De-allocation policy detailing responsibilities and storage costs for ‘orphaned data’ - pending approval by Steering Committee DataShare  Edinburgh DataShare is the University’s open access multi-disciplinary data repository: http://datashare.is.ed.ac.uk  Assists researchers disseminate their research, get credit for data publication, and preserve their data for the long-term (DOI, licence, citation)  Help researchers comply with funder requirements to preserve and share your data and complies with Edinburgh’s RDM Policy
  • 8. Data Vault  Safe, private, store of data that is only accessible by the data creator or their representative  Secure storage: File security; Storage security; Additional security: encryption  Long term assurance  Automatic versioning  Front-end application requirements (authorisation, retention & deletion, file structure, file transfter, integration Data Asset Register (DAR)  A catalogue of data assets produced by University of Edinburgh researchers  A key component of the University of Edinburgh RDM systems  Will give researchers a single place to record the existence of the data assets they produce for discovery, access, and re-use as appropriate.  Paper proposing the adoption of PURE as the University’s DAR provisionally accepted the RDM Steering Committee (Oct. 2014)
  • 9. Systems do not live in isolation, and become more powerful and more likely to be used if they are integrated with each other. However, the last thing that we want is to introduce further systems that need to be fed with duplicate information. This means interoperation for some or all of the components Interoperation
  • 10. RDM Support Making the most of local support! • RDM team will work with the Research Administrators in each School. • Academic Support Librarians (who represent each of the 22 Schools). • IT staff in each School. • ERI staff. They will be receiving RDM training. • Each School’s Ethics Committee • Queries can be sent to the IS Helpline who will direct them as appropriate.
  • 11. Awareness raising • Introductory sessions on RDM services and support for research active and research admin staff in Schools / Institutes • RDM website: http://www.ed.ac.uk/is/data- management • RDM blog: http://datablog.is.ed.ac.uk • RDM wiki: https://www.wiki.ed.ac.uk/displa y/RDM/Research+Data+Manage ment+Wiki
  • 12. MANTRA  MANTRA is an internationally recognized self-paced online training course developed here for PGR’s and early career researchers in data management issues.  Data handling exercises with open datasets in 4 analytical packages: R, SPSS, NVivo, ArcGIS  CC License & embed units in VLE’s e.g. Moodle http://datalib.edina.ac.uk/mantra
  • 13. Training: Tailored Courses  A range of training programmes on research data management (RDM) in the form of workshops, power sessions, seminars and drop in sessions to help researchers with research data management issues  http://www.ed.ac.uk/schools- departments/information- services/research-support/data- management/rdm-training  Creating a data management plan for your grant application  Research Data Management Programme at the University of Edinburgh  Good practice in Research Data Management  Handling data using SPSS  Handling data with ArcGIS http://edin.ac/1kRMPv3 
  • 14. Service Integration • DataShare is a customised DSpace instance with a selection of OAI-PMH compliant DCMI metadata fields for data discovery through Google and other search engines • Records are harvested by Data Citation Index • SWORD API utilised for batch deposit of large and/or many files from remote computers (‘Push using http’) • Internal batch ingest of many/large files to circumvent 2.1GB limit via the web interface (‘Pull via command line interface’) • Use of checksums to determine that delivered object mirrors deposited object • Working with F1000Research to define a workflow for depositors to get credit for data as research output by publishing data articles - http://f1000research.com/ • Published new list of data journals for our depositors
  • 15. DSpace GITHUB plugin* - allows software to be archived from GitHub (or similar) source code repository into DataShare, which can then be assigned a DOI to facilitate citation - using the SWORD deposit protocol DataSync - to allow sharing of data on DataStore: • drop-box type functionality • uses open source ‘ownCloud’ technology • desktop and mobile machines synchronize files with the ownCloud server • file updates are pushed between all devices connected to a user's account. Research data deposit from RSpace Electronic Lab Notebook (ELN) interface into DataShare (and Datastore & Data Vault) using SWORD * http://blog.stuartlewis.com/2014/09/09/github-to-repository-deposit/
  • 16. Integrating an electronic lab notebook with a university research infrastructure: Case Study with RSpace at the University of Edinburgh Rory Macneil Research Space rmacneil@researchspace.com RDMF 12 - Research Data and Repositories (and other systems), RDMF, University of Leicester, 19 November 2014
  • 17. Overview ● ELNs – where the demand coming from ● RSpace – origins and overview ● RSpace at Edinburgh – Linking to files in Edinburgh DataStore – Depositing content in Edinburgh DataShare – Archiving in Edinburgh DataVault ● Platform for integration with other RDM infrastructures
  • 18. Who and what is driving demand for ELNs? ● Researchers – Utility and convenience of paper lab book + online capabilities – On multiple devices – File management/integration ● Groups/PIs – Controlled sharing – Collaboration – Group management – File management/integration ● Institutions: data librarians, research admins, IT, commercialisation offices – Enterprise features: Scalable deployment, Single Sign On – IP protection: audit trail, signing – Publishing – Archiving – Repository integration
  • 19. RSpace ● Conceived in response to Wisconsin RFP and trial 2011 - 2012 ● Developed with Wisconsin by Research Space 2012 - 2013
  • 20. Researcher experience Sketching √ Image annotation √ Chemical structures √ Notebook √ Forms √ Templating √ Snippets √ PDF export √ Export to html √ File gallery √ Journal view √ Tablet friendly √ Clean design √ Performance √ Round trip editing √ Offline access √
  • 21. PI/Lab support Sharing √ Messaging √ Lab set up enabled √ Group management √ Inter-group collaboration √
  • 22. Institutional requirements (IT, data librarians, research admins, commercialisation offices) Single sign on √ Tiered admin √ Group set up √ IP support √ Export to XML √ Archiving √ Repository integration √
  • 23. RSpace design advantages ● Easy data entry ● Easy and flexible data structuring ● Multiple ways of getting data out (and back in) – Export PDF – Export to html – Export Zip (XML) – Re-import, preserving structure – Archive (with metadata)
  • 24. Business Model ● Free public cloud for labs and individuals ● Institutional deployments @$100/user/year ● Seamless movement of groups and data between different RSpaces Edinburgh Public Cloud Stanford La b La b La b
  • 25. RSpace at Edinburgh – Linking to files in Edinburgh DataStore – Depositing content in Edinburgh DataShare – Archiving in Edinburgh DataVault
  • 26. Linking to DataStore “My plan for workflow would be generally to deposit my data in DataStore either from the wet lab instruments (gel photos, elisa data, etc, and also possibly directly from an iPad) or from in silico data analysis I’ve been doing, and then link to it from within RSpace.”
  • 29. Linking to a DataStore File
  • 30. Linking to a DataStore File
  • 31. Linking to a DataStore File
  • 32. DataStore integration: Designing for the User ● Single Sign On via EASE ● Seamless file access – Common Internet File Standard with user credentials ● Multiple file roots per user – Idiosyncratic organisation – Sharing between users/groups – Accessing external files (DataStore, Box, Dropbox)
  • 35. RSpace – DataShare integration: Backend platform –Edinburgh DataShare has three interfaces/APIs ●Web-UI ●Python ●SWORD (simple Java based web-service which supports repository deposits) –RSpace uses the SWORD Interface –The SWORD server accepts a file for deposition if a METS description file is provided
  • 36. Four part METS implementation in RSpace – DataShare integration •RSpace uses the standard METS header •DMD -- field definitions are based on Dublin Core –Four required fields in Edinburgh DataShare -- contributor, publisher, title, and data creator -- must be completed as part of the deposit through RSpace –Additional optional fields can be filled in later by DataShare administrator: ●FUNDER, SPATIAL_COVERAGE, TIME_PERIOD, DATA_CREATOR, AVAILABLE_DATE, DESCRIPTION_ABSTRACT, DESCRIPTION_TOC, LANGUAGE, RELATION_VERSION_OF, RELATION_REFERENCED_BY, SUPERCEDES, RIGHT, SOURCE, SUBJECT_KEYWORDS, SUBJECT_CLASSIFICATION, ALTERNATIVE_TITLE •All zipped files and their mime-types (e.g. application/pdf, text/html) are included •A structure map describes the full structure and relationships between the above three elements
  • 37. RSpace – DataShare integration: Workflow •Front end trigger –An RSpace user selects files/folders/notebooks to be deposited from RSpace, and starts the deposit process •Backend to support the user workflow –RSpace extracts the associated data and resources from its database and file-store –These are turned into xml files –METS is used to describe the zip file and each selected file –The xml, resource, and METS files are zipped into a zip file for archiving –The DSpace SWORD client deposits the zip file to DataShare after an authentication and validation
  • 38. Archiving in Edinburgh DataVault ● DataVault functionality/API not yet specified ● Anticipate use of XML zip archive ● Many requirements to be determined – e.g., searching, restoration
  • 39. RSpace and Edinburgh RDM RSpace server DataShareDataStore DataVault User / Browser
  • 40. RSpace and RDM: Other institutions RSpace server DSpace/ other repositories File store Archive User / Browser Inte rfac e Inte rfac e XML