SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
Thursday 10 May 2012
                                                 Eduserv Symposium: Big Data




JISC and the Big (Research) Data Challenge

Simon Hodson
JISC Programme Manager, Managing Research Data
Why is managing research data important?



JISC considers it a priority to support universities in improving the way
   research data is managed and, where appropriate, made available for
                                   reuse.
Research funder policies, legislative frameworks, good practice, open data
agenda
 – The outputs of publicly funded research should be publicly available.
 – The evidence underpinning research findings should be available for
   validation
Good data management is good for research
 – More efficient research process, avoidance of data loss, benefits of data reuse

Alignment with university missions.
 – Universities want to provide excellent research infrastructure.
 – Universities want to have better oversight of research outputs.
Estimated Research Data Requirements


Two Russell Group Universities
  Estimated current data holdings of c.2PB (managed and unmanaged)
  Currently provide 800TB/300TB in a central storage facility, not all of which is
  used (but will be full in 12-18 months)…
  Significant amount of data in temporary storage, external drives etc…
  ‘the more groups we go to talk to, the more we're hearing of significant
  data holdings on external hard drives and small RAID systems’
1994 Group University
  No central research data provision.
  Faculties (medicine, business, humanities) have 20-30TB each.
  Engineering currently has 170TB faculty system, urgent need to expand.
  But… one group, recently interviewed, currently has 250TB, only half in
  ‘managed storage’; will reach PB levels in the next few years.
DUDs
  The data centre
under the desk (or
 in a back pack) is
   not adequate.
Why manage research data?




Not just about storage or avoiding data loss…!
It’s about knowing what to keep and what to throw away…
Important to extract maximum return on investment from publicly
funded research.
Access to underlying data is essential for verification and therefore
research integrity.
Opportunities to extract more knowledge from existing data, new
analysis.
It’s about making the most out of data created!
Making Data Meaningful and Reusable
JISC and Research Data




1. Understanding the problem (pre-2007-2009)
2. Prototyping solutions (2009-11)
3. Hardening solutions and building institutional capacity (2011-13)
4. Developing elements of national infrastructure (2013+)
1: Understanding the Problem


Key JISC reports:
    Dealing with Data:
    http://www.ukoln.ac.uk/ukoln/staff/
    e.j.lyon/reports/dealing_with_data_
    report-final.pdf
    Keeping Research Data Safe:
    http://www.jisc.ac.uk/media/docum
    ents/publications/keepingresearch
    datasafe0408.pdf
    Skills, Role, Career Structure of
    Data Scientists and Curators:
    http://www.jisc.ac.uk/media/docum
    ents/programmes/digitalrepositorie
    s/dataskillscareersfinalreport.pdf
Other:
    UKRDS Scoping Study:
    http://www.ukrds.ac.uk/resources/
Prototyping Solutions:
                                         First MRD Programme, 2009-11



RDM Infrastructure (guidance/support, systems)



RDM Planning (DMPs, best practice, disciplinary challenges)



               RDM Training (targeted at disciplinary needs)



               Challenges of data citation and publication



First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
Building Institutional Capacity:
                                              First MRD Programme, 2009-11


RDM Infrastructure (policy, guidance/support, systems)
17 large projects




RDM Planning (DMPs, best practice, disciplinary challenges)



                     RDM Training (disciplines and libraries/research
                     support)

                     Innovative data publication


Second JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
Projects shortly to be announced for research data publication and developing RDM
training materials: http://bit.ly/jiscmrd-2012-Call
A holistic approach…



                          Leadership and
                        Policy Development



Publication, Citation
                                             Guidance and
  and Discovery
                                               Training
   Mechanisms




                                        Support for Data
    RDM Systems and
                                         Management
      Infrastructure
                                           Planning
How to develop RDM services
                                         Why develop services?
                                         Roles and responsibilities
      In development!                    Process of service development
                                         The components / building blocks
                                         •      Policy
                                         •      Data Management
                                         Planning
                                         •      Storage
                                         •      Data registry..... Examples and
                                                                  case studies to
                                         Getting started           develop into
                                                                      toolkit
Slide Credit: Sarah Jones and Martin Donnelly, DCC
Next steps? Elements of a national infrastructure




Journals are increasingly implementing policies requiring availability
of underlying data.
   Registry of Journal Data Policies to help researchers and research
   administrators understand the implications and changing landscape.
Universities are developing catalogues of research data holdings.
   National registry of research data to facilitate discovery, reuse; better
   understanding of impact and research landscape.
Thank You!




First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
Second JISC MRD Programme, 2011-13: http://bit.ly/jiscmrd2009-11
Programme Blog: http://researchdata.jiscinvolve.org/
MRD Project Blogs: http://tiny.cc/MRDblogs
Twitter: #jiscmrd
E-mail: s.hodson@jisc.ac.uk
Acknowledgements for slides, content: Carol Goble, Liz Lyon, Peter Murray-
Rust, David Shotton, Martin Donnelly, Sarah Jones.
From prototype to platform…




 DataFlow Project: http://www.dataflow.ox.ac.uk/




UMF Programme SaaS for RDM Projects: http://www.jisc.ac.uk/whatwedo/programmes/umf.aspx
The JISC UMF DataFlow Project



     Researchers                          DataStage is a file management system
                                          A DataStage data package consists of
                                          selected data files accompanied by an
                                          RDF metadata manifest, with a SWORD
                                          v2 wrapper


    DataStage file system

                                                         Researchers, other users


                                SWORD deposit

 DataBank is a generic repository, and
 can be used to store things other that
 research datasets, for example data
 management plans (DMPs)                                 DataBank repository

Contenu connexe

Tendances

DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
 
RDM for librarians
RDM for librariansRDM for librarians
RDM for librariansSarah Jones
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3mjpickt
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesMarieke Guy
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costsSarah Jones
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013SALCTG
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCCMartin Donnelly
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at EdinburghSarah Jones
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data ManagementOpenAIRE
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12ASIS&T
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityLancaster University Library
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public versionjhudms
 

Tendances (20)

What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
RDM for librarians
RDM for librariansRDM for librarians
RDM for librarians
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
Supporting-DMPs
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at Edinburgh
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster University
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public version
 

En vedette

"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013Kaitlin Thaney
 
Escuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social MediaEscuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social MediaReto Leder
 
RDM in higher education
RDM in higher educationRDM in higher education
RDM in higher educationSarah Jones
 
KU Memorial Unions Plansbook
KU Memorial Unions PlansbookKU Memorial Unions Plansbook
KU Memorial Unions PlansbookKaraSchwerdt
 
Research data management at the DCC
Research data management at the DCCResearch data management at the DCC
Research data management at the DCCSarah Jones
 
Research data challenge presentation
Research data challenge presentationResearch data challenge presentation
Research data challenge presentationJisc
 

En vedette (7)

"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013
 
Escuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social MediaEscuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social Media
 
RDM in higher education
RDM in higher educationRDM in higher education
RDM in higher education
 
KU Memorial Unions Plansbook
KU Memorial Unions PlansbookKU Memorial Unions Plansbook
KU Memorial Unions Plansbook
 
Research data management at the DCC
Research data management at the DCCResearch data management at the DCC
Research data management at the DCC
 
Cni2012
Cni2012Cni2012
Cni2012
 
Research data challenge presentation
Research data challenge presentationResearch data challenge presentation
Research data challenge presentation
 

Similaire à Simon Hodson

Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research DataMartin Donnelly
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineMartin Donnelly
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingLisa Haddow
 
Research data management and the Digital Curation Centre
Research data management and the Digital Curation CentreResearch data management and the Digital Curation Centre
Research data management and the Digital Curation CentreMartin Donnelly
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...heila1
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introductionMartin Donnelly
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012IUPUI
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP introSarah Jones
 
Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012sherif user group
 
What the DCC Can do for you
What the DCC Can do for youWhat the DCC Can do for you
What the DCC Can do for youMarieke Guy
 
Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Robin Rice
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesIUPUI
 

Similaire à Simon Hodson (20)

Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP Online
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
 
Research data management and the Digital Curation Centre
Research data management and the Digital Curation CentreResearch data management and the Digital Curation Centre
Research data management and the Digital Curation Centre
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introduction
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Research Data Management Roadmap@Edinburgh
Research Data Management Roadmap@EdinburghResearch Data Management Roadmap@Edinburgh
Research Data Management Roadmap@Edinburgh
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012
 
RDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, PracticeRDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, Practice
 
What the DCC Can do for you
What the DCC Can do for youWhat the DCC Can do for you
What the DCC Can do for you
 
Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
 

Plus de Eduserv

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionEduserv
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Eduserv
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCOEduserv
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - BoopsieEduserv
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - SoftlinkEduserv
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineEduserv
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsEduserv
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionEduserv
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experienceEduserv
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodologyEduserv
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover ContentEduserv
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product updateEduserv
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressEduserv
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketingEduserv
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Eduserv
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from MaplesoftEduserv
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo Eduserv
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisationsEduserv
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Eduserv
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsEduserv
 

Plus de Eduserv (20)

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect option
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCO
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - Boopsie
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - Softlink
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZine
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest Agreements
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolution
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experience
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodology
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover Content
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product update
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome address
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketing
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from Maplesoft
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisations
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing Councils
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Dernier (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Simon Hodson

  • 1. Thursday 10 May 2012 Eduserv Symposium: Big Data JISC and the Big (Research) Data Challenge Simon Hodson JISC Programme Manager, Managing Research Data
  • 2. Why is managing research data important? JISC considers it a priority to support universities in improving the way research data is managed and, where appropriate, made available for reuse. Research funder policies, legislative frameworks, good practice, open data agenda – The outputs of publicly funded research should be publicly available. – The evidence underpinning research findings should be available for validation Good data management is good for research – More efficient research process, avoidance of data loss, benefits of data reuse Alignment with university missions. – Universities want to provide excellent research infrastructure. – Universities want to have better oversight of research outputs.
  • 3. Estimated Research Data Requirements Two Russell Group Universities Estimated current data holdings of c.2PB (managed and unmanaged) Currently provide 800TB/300TB in a central storage facility, not all of which is used (but will be full in 12-18 months)… Significant amount of data in temporary storage, external drives etc… ‘the more groups we go to talk to, the more we're hearing of significant data holdings on external hard drives and small RAID systems’ 1994 Group University No central research data provision. Faculties (medicine, business, humanities) have 20-30TB each. Engineering currently has 170TB faculty system, urgent need to expand. But… one group, recently interviewed, currently has 250TB, only half in ‘managed storage’; will reach PB levels in the next few years.
  • 4. DUDs The data centre under the desk (or in a back pack) is not adequate.
  • 5. Why manage research data? Not just about storage or avoiding data loss…! It’s about knowing what to keep and what to throw away… Important to extract maximum return on investment from publicly funded research. Access to underlying data is essential for verification and therefore research integrity. Opportunities to extract more knowledge from existing data, new analysis. It’s about making the most out of data created!
  • 6. Making Data Meaningful and Reusable
  • 7. JISC and Research Data 1. Understanding the problem (pre-2007-2009) 2. Prototyping solutions (2009-11) 3. Hardening solutions and building institutional capacity (2011-13) 4. Developing elements of national infrastructure (2013+)
  • 8. 1: Understanding the Problem Key JISC reports: Dealing with Data: http://www.ukoln.ac.uk/ukoln/staff/ e.j.lyon/reports/dealing_with_data_ report-final.pdf Keeping Research Data Safe: http://www.jisc.ac.uk/media/docum ents/publications/keepingresearch datasafe0408.pdf Skills, Role, Career Structure of Data Scientists and Curators: http://www.jisc.ac.uk/media/docum ents/programmes/digitalrepositorie s/dataskillscareersfinalreport.pdf Other: UKRDS Scoping Study: http://www.ukrds.ac.uk/resources/
  • 9. Prototyping Solutions: First MRD Programme, 2009-11 RDM Infrastructure (guidance/support, systems) RDM Planning (DMPs, best practice, disciplinary challenges) RDM Training (targeted at disciplinary needs) Challenges of data citation and publication First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
  • 10. Building Institutional Capacity: First MRD Programme, 2009-11 RDM Infrastructure (policy, guidance/support, systems) 17 large projects RDM Planning (DMPs, best practice, disciplinary challenges) RDM Training (disciplines and libraries/research support) Innovative data publication Second JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 Projects shortly to be announced for research data publication and developing RDM training materials: http://bit.ly/jiscmrd-2012-Call
  • 11. A holistic approach… Leadership and Policy Development Publication, Citation Guidance and and Discovery Training Mechanisms Support for Data RDM Systems and Management Infrastructure Planning
  • 12. How to develop RDM services Why develop services? Roles and responsibilities In development! Process of service development The components / building blocks • Policy • Data Management Planning • Storage • Data registry..... Examples and case studies to Getting started develop into toolkit Slide Credit: Sarah Jones and Martin Donnelly, DCC
  • 13. Next steps? Elements of a national infrastructure Journals are increasingly implementing policies requiring availability of underlying data. Registry of Journal Data Policies to help researchers and research administrators understand the implications and changing landscape. Universities are developing catalogues of research data holdings. National registry of research data to facilitate discovery, reuse; better understanding of impact and research landscape.
  • 14.
  • 15. Thank You! First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs Second JISC MRD Programme, 2011-13: http://bit.ly/jiscmrd2009-11 Programme Blog: http://researchdata.jiscinvolve.org/ MRD Project Blogs: http://tiny.cc/MRDblogs Twitter: #jiscmrd E-mail: s.hodson@jisc.ac.uk Acknowledgements for slides, content: Carol Goble, Liz Lyon, Peter Murray- Rust, David Shotton, Martin Donnelly, Sarah Jones.
  • 16. From prototype to platform… DataFlow Project: http://www.dataflow.ox.ac.uk/ UMF Programme SaaS for RDM Projects: http://www.jisc.ac.uk/whatwedo/programmes/umf.aspx
  • 17. The JISC UMF DataFlow Project Researchers DataStage is a file management system A DataStage data package consists of selected data files accompanied by an RDF metadata manifest, with a SWORD v2 wrapper DataStage file system Researchers, other users SWORD deposit DataBank is a generic repository, and can be used to store things other that research datasets, for example data management plans (DMPs) DataBank repository