SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Long tail of Research Data
Making the link from the long tail to libraries
Charles (Chuck) Humphrey
University of Alberta Libraries
2014 May
Outline
● Context represented by the long tail of research data
● The long tail applied to project-level research
● Library roles in the research & data lifecycle
o Institutional roles
o Project-level service roles
o Inter-institutional network roles
Long Tail of Research Data
Long Tail of Research Data
RDMI
Research Data Management Infrastructure
● The combination of technology, services, and
expertise organised locally or globally to support
research data activities across the research lifecycle.
In Canada, we have gone from building a national institution
to support research data to building research data management
infrastructure. This infrastructure is being built from the
bottom-up with library involvement.
The Long Tail of
Research Data Applied
to Project-level
Research and Uses of
Technology
The number of projects that consist
of large volumes of data files under
1GB requires the most help with
research data management
infrastructure (RDMI).
As the scale in data file size
increases beyond 500GB, RDMI is
built into the operation of the
project. These projects are in need
of post-project preservation of their
data.
Management & stewardship
Managing research data entails the many activities dealing
with the operational support of data across the stages of the
research lifecycle. This involves the “what” and “how” of
research data.
Data Stewardship is about the identity of those responsible
for ensuring data management activities are performed to best
practice levels and standards across the complete lifecycle.
This addresses “who” is responsible for specific data activities.
Institution
Level
Project
Level
KEY
Research lifecycle
Library Roles
Project
Research
Institution
Network
Research data management
Data stewardship
Library and projects
● Tools, services, and expertise
o Data management planning
o Metadata choices: objects and workflow
o Project file sharing
o Data file version management
o DOI assignment and registration
o Data file citations
o Predicable data and metadata flows for submission
to a data repository, including file formats
Meeting researchers’ needs
● An easy way to share data with one, a few, or many other
researchers that does not involve the use of email or
Dropbox.
● A simplified approach to entering project-level metadata that
can be used repeatedly with other applications.
● A one-step method of minting DOIs for data files that can be
used in publications .
● A way to manage multiple versions of data files, including
keeping track of changes made to the data.
● A service that helps organize data to submit for preservation
processing.
Library and the institution
● Data stewardship
o Research data policy for the institution
o Data deposit and dissemination agreements
o Suite of preservation policies
● Tools, services, and expertise
o Data curation
o Data dissemination
o Data preservation
Institutional policy
Institutional policy
Data
Coordinator
Data
Curator
Digital
Preservation
Officer
Metadata
Librarian
DITL /
Storage
Team
Access /
Discovery
Librarian
Co-ordinate
Submission
Generate
AIP
AIP Quality
Assurance
Co-ordinate
Updates
Build
Metadata
SIP Quality
Assurance
Co-ordinate
Updates
Disaster
Recovery
Generate
Descriptive
Info
Data
Management
Manage DM
Co-ordinate
Access
DIP Quality
Assurance
Generate DIP
Data
Management
Archival
Storage
Co-ordinate
Submission
Preservation
Roles and
Responsibilities
of UAL
Positions
Library and networks
● Shared data management infrastructure
o Tools development
o Preservation processing and storage
o Discovery metadata exchange
Virtual
Research
Environment
Data in
Publications
Curated Data
Preservation
System
The ARC
Network of
Canadian
Libraries
Atlantic Canada
Quebec
Ontario
The Prairies
British Columbia
Library Roles
Project
Research
Institution
Network
Research data management
Data stewardship

Contenu connexe

Plus de OpenAIRE

Plus de OpenAIRE (20)

What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community Call
 
2nd Content Providers Community Call
2nd Content Providers Community Call2nd Content Providers Community Call
2nd Content Providers Community Call
 
1st Content Providers Community Call
1st Content Providers Community Call1st Content Providers Community Call
1st Content Providers Community Call
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph
 
IPR and Exploitation
IPR and Exploitation IPR and Exploitation
IPR and Exploitation
 
Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2
 

Dernier

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 

Dernier (20)

Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 

OpenAIRE-COAR conference 2014: Long tail of Science - Making the link from long tail to libraries, by Chuck Humphrey - University of Alberta

  • 1. Long tail of Research Data Making the link from the long tail to libraries Charles (Chuck) Humphrey University of Alberta Libraries 2014 May
  • 2. Outline ● Context represented by the long tail of research data ● The long tail applied to project-level research ● Library roles in the research & data lifecycle o Institutional roles o Project-level service roles o Inter-institutional network roles
  • 3. Long Tail of Research Data
  • 4. Long Tail of Research Data
  • 5. RDMI Research Data Management Infrastructure ● The combination of technology, services, and expertise organised locally or globally to support research data activities across the research lifecycle. In Canada, we have gone from building a national institution to support research data to building research data management infrastructure. This infrastructure is being built from the bottom-up with library involvement.
  • 6. The Long Tail of Research Data Applied to Project-level Research and Uses of Technology The number of projects that consist of large volumes of data files under 1GB requires the most help with research data management infrastructure (RDMI). As the scale in data file size increases beyond 500GB, RDMI is built into the operation of the project. These projects are in need of post-project preservation of their data.
  • 7. Management & stewardship Managing research data entails the many activities dealing with the operational support of data across the stages of the research lifecycle. This involves the “what” and “how” of research data. Data Stewardship is about the identity of those responsible for ensuring data management activities are performed to best practice levels and standards across the complete lifecycle. This addresses “who” is responsible for specific data activities.
  • 10. Library and projects ● Tools, services, and expertise o Data management planning o Metadata choices: objects and workflow o Project file sharing o Data file version management o DOI assignment and registration o Data file citations o Predicable data and metadata flows for submission to a data repository, including file formats
  • 11.
  • 12. Meeting researchers’ needs ● An easy way to share data with one, a few, or many other researchers that does not involve the use of email or Dropbox. ● A simplified approach to entering project-level metadata that can be used repeatedly with other applications. ● A one-step method of minting DOIs for data files that can be used in publications . ● A way to manage multiple versions of data files, including keeping track of changes made to the data. ● A service that helps organize data to submit for preservation processing.
  • 13.
  • 14. Library and the institution ● Data stewardship o Research data policy for the institution o Data deposit and dissemination agreements o Suite of preservation policies ● Tools, services, and expertise o Data curation o Data dissemination o Data preservation
  • 17.
  • 18. Data Coordinator Data Curator Digital Preservation Officer Metadata Librarian DITL / Storage Team Access / Discovery Librarian Co-ordinate Submission Generate AIP AIP Quality Assurance Co-ordinate Updates Build Metadata SIP Quality Assurance Co-ordinate Updates Disaster Recovery Generate Descriptive Info Data Management Manage DM Co-ordinate Access DIP Quality Assurance Generate DIP Data Management Archival Storage Co-ordinate Submission Preservation Roles and Responsibilities of UAL Positions
  • 19. Library and networks ● Shared data management infrastructure o Tools development o Preservation processing and storage o Discovery metadata exchange
  • 20.
  • 21. Virtual Research Environment Data in Publications Curated Data Preservation System The ARC Network of Canadian Libraries Atlantic Canada Quebec Ontario The Prairies British Columbia