SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Data sharing as part of the research workflow
Varsha Khodiyar, PhD
Data Curation Editor, Scientific Data
Nature Publishing Group
@varsha_khodiyar
@scientificdata
Perspective from Scientific Data
Data Perspective beyond Alliances, 3rd March 2016
Why the push to share data?
Research conduct
Publication bias – what is submitted
Experimental design
Statistics
Lab supervision and training
Research reporting and sharing
Gels, microscopy images
Statistical reporting
Methods description
Data deposition and availability
2
Generating research data is expensive
Just 18.1% NIH grant applications funded in 2014*
• Hours spent writing grants?
• Hours spent reviewing grants?
Resources are finite/expensive
• Modified animals
• Specialized reagents
Time and effort taken in the laboratory to generate
good, valid data
* report.nih.gov/success_rates/Success_ByIC.cfm
Data needs to be…
Discoverable
Need to
know it’s
there
Accessible
Must be able
to get to the
data
Usable
Require
sufficient
information
about how
the data was
generated
Persistent
Historical
data access
as part of the
scientific
record, as
well as for
new research
Reliable
Data
provenance
informs data
reuse
decisions
Joint Declaration of Data Citation Principles www.force11.org/group/joint-declaration-
data-citation-principles-final
Achieving human and machine accessibility of cited data in scholarly publications Starr et
al. PeerJ Computer Science (2015). doi:10.7717/peerj-cs.1
Making data count Kratz & Strasser. Sci. Data (2015). doi:10.1038/sdata.2015.39
The FAIR guiding principles for scientific data management and stewardship Williams et al.
Sci. Data (in press)
Researchers already share data
• Most researchers are sharing
data, and using the data of
others
• Direct contact between
researchers (on request) is a
common way of sharing data
• Repositories are second most
common method of sharing
Kratz and Strasser (2015) doi: 10.1371/journal.pone.0117619 9
But…
Sharing of data upon request from published articles
• relies heavily on trust
• when stored informally, disappears at a rate of ~17% per year
(Vines et al. 2014; doi: 10.1016/j.cub.2013.11.014)
Data shared in a repository
• often not reusable due to insufficient context
• may not be possible to determine reliability (peer review?)
• may not be easily findable, if not referenced in a scholarly
article
• no scholarly credit for data producers
Data papers and journals
• Ensure formal storage in repository
• Allow space for authors to include sufficient context for
reuse
• Peer reviewers often specifically requested to comment
on data archive reusability
• Data paper are formal works, giving scholarly credit to
data producers
• Formal data citations enabling data discovery via
bibliographic indexes that researchers are used to using
Data journals and multidisciplinary research
Cross-domain data sharing vital for solving the most pressing world
issues:
• Public health (social science, epidemiology & molecular biology)
• Resource management & sustainability (energy research, policy,
ecology & climate science)
Differences between researchers of vocabulary and expressions of
reliability, mean clear descriptions of data become even more essential
for cross-domain data sharing.
Multidisciplinary data journals (e.g. Data Science Journal, Scientific
Data):
• provide a data sharing outlet to researchers in all domains
• help datasets cross domain boundaries, data is more visible and
searchable i.e. less siloing
8
Data reuse by the research community
9
“The Data Descriptor made it easier
to use the data, for me it was critical
that everything was there…all the
technical details like voxel size.”
Professor Daniele Marinazzo
Data reuse by the non-research community
10
http://www.nytimes.com/interactive/2014/12/30/science/history-of-ebola-in-24-outbreaks.html
Increasing the discoverability of data
• Is data truly discoverable by
researchers outside the original
authors domain?
• Too many papers to read in each
person’s own field.
• Could increasing the machine
accessibility of data, result in
increased data reuse?
Data Descriptors have human and machine readable
components
12
Human readable
representation of
study
i.e. article (HTML &
PDF)
Human readable
representation of
study
i.e. article (HTML
& PDF)
Machine
readable
representation
of study
i.e. metadata
• We capture metadata about the data being described in each Data Descriptor
• The manuscript captures human readable metadata needed for data reuse
• The curated metadata records capture machine readable metadata needed for
machine based data discovery
Metadata at Scientific Data
ISA format for machine readable metadata
14
• Study workflow
• Key sample characteristics
needed for data discovery
• Relates samples to data files
• Shows location of dataset
• Uses controlled vocabularies
and ontologies (where
possible)
Metadata for data discovery
Search by:
• Data Repositories
• Experiment design
• Measurements made
• Technologies used
• Factor types
• Sample Characteristics
• Organism
• Environment types
• Geographic locations
scientificdata.isa-explorer.org
16
After data
analysis has
been
published
Before analysis
has been
published
Authors not
intending to
analyse data
Data Descriptors can be
submitted and published
at any point in the
research workflow
After data
analysis has
been
published
Before the
analysis has
been published
Publication
alongside analysis
article
Data as part of the publication workflow
Data as part of the research workflow?
Papers usually written after analyses, key details can be forgotten
• Ideally metadata would be captured during data generation
process
• Takes time and effort to capture adequate metadata of
sufficient quality for data reuse
Machine readable metadata
• Metadata format needs to be decided prospectively
• Researchers require professional expertise and guidance to use
ontologies (essential for machine readability and discovery)
How to ensure data generators are able to capture metadata easily
and in sufficient detail for reuse?
17
Discoverable
Machine
based data
discovery
Implement
data citations
Use
community
ontologies
Accessible &
Persistent
Encourage
use of
repositories
Use
persistent
identifiers
for data
Usable
Metadata
capture
during data
generation
process
Encourage
use of
minimal
reporting
standards
Reliable
Encourage
peer
reviewers to
evaluate
data archive
(structure,
format)
alongside
the article
Researcher
incentives
Recognise
data as a
first class
scholarly
work
Provide
tools for
data
visualization
and
discovery
Building infrastructure to promote data sharing as part
of the research workflow
Scientific Data at RDA
Working groups
Publishing Data Workflows
(co-chair)
BioSharing Registry
(Susanna Sansone is co-chair)
Interest groups
Publishing Data
Data Fabric
Data in Context
Metadata
Certification of Digital Repositories
19
Visit nature.com/sdata
Email scientificdata@nature.com
Tweet @ScientificData
Honorary Academic Editor
Susanna-Assunta Sansone
Managing Editor
Andrew L. Hufton
Data Curation Editor
Varsha K. Khodiyar
Advisory Panel and Editorial
Board including senior researchers,
funders, librarians and curators
Supported by

Contenu connexe

Tendances

Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collectionSherry Lake
 
Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in researchLouise Corti
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13DataDryad
 
Data management woolfrey
Data management woolfreyData management woolfrey
Data management woolfreypvhead123
 
Using a Case Study to Teach Data Management to Librarians
Using a Case Study to Teach Data Management to LibrariansUsing a Case Study to Teach Data Management to Librarians
Using a Case Study to Teach Data Management to LibrariansSherry Lake
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampSherry Lake
 
Data Management - Lynn Woolfrey
Data Management - Lynn WoolfreyData Management - Lynn Woolfrey
Data Management - Lynn Woolfreypvhead123
 
Identifying and tracking research resources using RRIDs: a practical approach
Identifying and tracking research resources using RRIDs:  a practical approachIdentifying and tracking research resources using RRIDs:  a practical approach
Identifying and tracking research resources using RRIDs: a practical approachdkNET
 
Veterinary Nursing - Literature Searching
Veterinary Nursing - Literature SearchingVeterinary Nursing - Literature Searching
Veterinary Nursing - Literature SearchingJamie Halstead
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSMaaike Duine
 
Publishing and impact 20141028
Publishing and impact 20141028Publishing and impact 20141028
Publishing and impact 20141028Hugo Besemer
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Susanna-Assunta Sansone
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycleSherry Lake
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
 

Tendances (18)

Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collection
 
Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in research
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
 
Payton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook MetadataPayton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook Metadata
 
Creating dmp
Creating dmpCreating dmp
Creating dmp
 
Data management woolfrey
Data management woolfreyData management woolfrey
Data management woolfrey
 
Using a Case Study to Teach Data Management to Librarians
Using a Case Study to Teach Data Management to LibrariansUsing a Case Study to Teach Data Management to Librarians
Using a Case Study to Teach Data Management to Librarians
 
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM Bootcamp
 
Data Management - Lynn Woolfrey
Data Management - Lynn WoolfreyData Management - Lynn Woolfrey
Data Management - Lynn Woolfrey
 
Identifying and tracking research resources using RRIDs: a practical approach
Identifying and tracking research resources using RRIDs:  a practical approachIdentifying and tracking research resources using RRIDs:  a practical approach
Identifying and tracking research resources using RRIDs: a practical approach
 
Veterinary Nursing - Literature Searching
Veterinary Nursing - Literature SearchingVeterinary Nursing - Literature Searching
Veterinary Nursing - Literature Searching
 
THOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOSTHOR Workshop - Data Publishing PLOS
THOR Workshop - Data Publishing PLOS
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Publishing and impact 20141028
Publishing and impact 20141028Publishing and impact 20141028
Publishing and impact 20141028
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 

Similaire à Data Sharing as Part of Research Workflow

Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Susanna-Assunta Sansone
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
DataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017Susanna-Assunta Sansone
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Datakfear
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...LEARN Project
 
The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...Hilmar Lapp
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 

Similaire à Data Sharing as Part of Research Workflow (20)

Enhance your rese​arch impact through open science
Enhance your rese​arch impact through open scienceEnhance your rese​arch impact through open science
Enhance your rese​arch impact through open science
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
DataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE Education Module 08: Data Citation
DataONE Education Module 08: Data Citation
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
Open data in a big data world (Accord ICSU-IAP-ISSC-TWAS)
 
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
Open data in a big data world Accord (ICSU-IAP-ISSC-TWAS)
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017
 
Researh data management
Researh data managementResearh data management
Researh data management
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Data
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 
The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 

Plus de Varsha Khodiyar

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceVarsha Khodiyar
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Varsha Khodiyar
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsVarsha Khodiyar
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccinesVarsha Khodiyar
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publicationVarsha Khodiyar
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositoriesVarsha Khodiyar
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?Varsha Khodiyar
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Varsha Khodiyar
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...Varsha Khodiyar
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing processVarsha Khodiyar
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Varsha Khodiyar
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharingVarsha Khodiyar
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGVarsha Khodiyar
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesVarsha Khodiyar
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataVarsha Khodiyar
 
Privacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical dataPrivacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical dataVarsha Khodiyar
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?Varsha Khodiyar
 
Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Varsha Khodiyar
 
Open science: your questions answered
Open science: your questions answeredOpen science: your questions answered
Open science: your questions answeredVarsha Khodiyar
 
Open for science to support replication
Open for science to support replicationOpen for science to support replication
Open for science to support replicationVarsha Khodiyar
 

Plus de Varsha Khodiyar (20)

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and tests
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccines
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositories
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing process
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharing
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IG
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional Repositories
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific Data
 
Privacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical dataPrivacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical data
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014
 
Open science: your questions answered
Open science: your questions answeredOpen science: your questions answered
Open science: your questions answered
 
Open for science to support replication
Open for science to support replicationOpen for science to support replication
Open for science to support replication
 

Dernier

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 

Dernier (20)

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 

Data Sharing as Part of Research Workflow

  • 1. Data sharing as part of the research workflow Varsha Khodiyar, PhD Data Curation Editor, Scientific Data Nature Publishing Group @varsha_khodiyar @scientificdata Perspective from Scientific Data Data Perspective beyond Alliances, 3rd March 2016
  • 2. Why the push to share data? Research conduct Publication bias – what is submitted Experimental design Statistics Lab supervision and training Research reporting and sharing Gels, microscopy images Statistical reporting Methods description Data deposition and availability 2
  • 3. Generating research data is expensive Just 18.1% NIH grant applications funded in 2014* • Hours spent writing grants? • Hours spent reviewing grants? Resources are finite/expensive • Modified animals • Specialized reagents Time and effort taken in the laboratory to generate good, valid data * report.nih.gov/success_rates/Success_ByIC.cfm
  • 4. Data needs to be… Discoverable Need to know it’s there Accessible Must be able to get to the data Usable Require sufficient information about how the data was generated Persistent Historical data access as part of the scientific record, as well as for new research Reliable Data provenance informs data reuse decisions Joint Declaration of Data Citation Principles www.force11.org/group/joint-declaration- data-citation-principles-final Achieving human and machine accessibility of cited data in scholarly publications Starr et al. PeerJ Computer Science (2015). doi:10.7717/peerj-cs.1 Making data count Kratz & Strasser. Sci. Data (2015). doi:10.1038/sdata.2015.39 The FAIR guiding principles for scientific data management and stewardship Williams et al. Sci. Data (in press)
  • 5. Researchers already share data • Most researchers are sharing data, and using the data of others • Direct contact between researchers (on request) is a common way of sharing data • Repositories are second most common method of sharing Kratz and Strasser (2015) doi: 10.1371/journal.pone.0117619 9
  • 6. But… Sharing of data upon request from published articles • relies heavily on trust • when stored informally, disappears at a rate of ~17% per year (Vines et al. 2014; doi: 10.1016/j.cub.2013.11.014) Data shared in a repository • often not reusable due to insufficient context • may not be possible to determine reliability (peer review?) • may not be easily findable, if not referenced in a scholarly article • no scholarly credit for data producers
  • 7. Data papers and journals • Ensure formal storage in repository • Allow space for authors to include sufficient context for reuse • Peer reviewers often specifically requested to comment on data archive reusability • Data paper are formal works, giving scholarly credit to data producers • Formal data citations enabling data discovery via bibliographic indexes that researchers are used to using
  • 8. Data journals and multidisciplinary research Cross-domain data sharing vital for solving the most pressing world issues: • Public health (social science, epidemiology & molecular biology) • Resource management & sustainability (energy research, policy, ecology & climate science) Differences between researchers of vocabulary and expressions of reliability, mean clear descriptions of data become even more essential for cross-domain data sharing. Multidisciplinary data journals (e.g. Data Science Journal, Scientific Data): • provide a data sharing outlet to researchers in all domains • help datasets cross domain boundaries, data is more visible and searchable i.e. less siloing 8
  • 9. Data reuse by the research community 9 “The Data Descriptor made it easier to use the data, for me it was critical that everything was there…all the technical details like voxel size.” Professor Daniele Marinazzo
  • 10. Data reuse by the non-research community 10 http://www.nytimes.com/interactive/2014/12/30/science/history-of-ebola-in-24-outbreaks.html
  • 11. Increasing the discoverability of data • Is data truly discoverable by researchers outside the original authors domain? • Too many papers to read in each person’s own field. • Could increasing the machine accessibility of data, result in increased data reuse?
  • 12. Data Descriptors have human and machine readable components 12 Human readable representation of study i.e. article (HTML & PDF) Human readable representation of study i.e. article (HTML & PDF) Machine readable representation of study i.e. metadata
  • 13. • We capture metadata about the data being described in each Data Descriptor • The manuscript captures human readable metadata needed for data reuse • The curated metadata records capture machine readable metadata needed for machine based data discovery Metadata at Scientific Data
  • 14. ISA format for machine readable metadata 14 • Study workflow • Key sample characteristics needed for data discovery • Relates samples to data files • Shows location of dataset • Uses controlled vocabularies and ontologies (where possible)
  • 15. Metadata for data discovery Search by: • Data Repositories • Experiment design • Measurements made • Technologies used • Factor types • Sample Characteristics • Organism • Environment types • Geographic locations scientificdata.isa-explorer.org
  • 16. 16 After data analysis has been published Before analysis has been published Authors not intending to analyse data Data Descriptors can be submitted and published at any point in the research workflow After data analysis has been published Before the analysis has been published Publication alongside analysis article Data as part of the publication workflow
  • 17. Data as part of the research workflow? Papers usually written after analyses, key details can be forgotten • Ideally metadata would be captured during data generation process • Takes time and effort to capture adequate metadata of sufficient quality for data reuse Machine readable metadata • Metadata format needs to be decided prospectively • Researchers require professional expertise and guidance to use ontologies (essential for machine readability and discovery) How to ensure data generators are able to capture metadata easily and in sufficient detail for reuse? 17
  • 18. Discoverable Machine based data discovery Implement data citations Use community ontologies Accessible & Persistent Encourage use of repositories Use persistent identifiers for data Usable Metadata capture during data generation process Encourage use of minimal reporting standards Reliable Encourage peer reviewers to evaluate data archive (structure, format) alongside the article Researcher incentives Recognise data as a first class scholarly work Provide tools for data visualization and discovery Building infrastructure to promote data sharing as part of the research workflow
  • 19. Scientific Data at RDA Working groups Publishing Data Workflows (co-chair) BioSharing Registry (Susanna Sansone is co-chair) Interest groups Publishing Data Data Fabric Data in Context Metadata Certification of Digital Repositories 19
  • 20. Visit nature.com/sdata Email scientificdata@nature.com Tweet @ScientificData Honorary Academic Editor Susanna-Assunta Sansone Managing Editor Andrew L. Hufton Data Curation Editor Varsha K. Khodiyar Advisory Panel and Editorial Board including senior researchers, funders, librarians and curators Supported by