dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sharing Mandates" on September 22, 2023

dkNET
An NIDDK Resource dknet.org
dkNET Office Hours: New Data Mandates
Jeffrey S. Grethe
PI, NIDDK Information Network
Co-Director, FAIR Data Informatics Laboratory, UCSD
An NIDDK Resource dknet.org
New NIH Policy Effective January 25, 2023
• US National Institutes of Health new
data sharing policy goes into effect
• All data must be managed; most
data should be shared
• “As open as possible; as closed as
necessary”
• Mandates the inclusion, approval
and execution of a Data
Management and Sharing Plan
• (DMP + S = DMS)
https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html
An NIDDK Resource dknet.org
The Details...
The effective date of the DMS Policy is January 25, 2023, including for:
• Competing grant applications that are submitted to NIH for the January
25, 2023 and subsequent receipt dates;
• Proposals for contracts that are submitted to NIH on or after January 25,
2023;
• NIH Intramural Research Projects conducted on or after January 25, 2023;
and
• Other funding agreements (e.g., Other Transactions) that are executed on
or after January 25, 2023, unless otherwise stipulated by NIH.
An NIDDK Resource dknet.org
More Details...
What
• Defines Scientific Data as: “The recorded factual material commonly accepted in the scientific
community as of sufficient quality to validate and replicate research findings, regardless of whether
the data are used to support scholarly publications. Scientific data do not include laboratory
notebooks, preliminary analyses, completed case report forms, drafts of scientific papers, plans for
future research, peer reviews, communications with colleagues, or physical objects, such as
laboratory specimens.”
• Even those scientific data not used to support a publication are considered scientific data and within
the final DMS Policy’s scope
When
• “[s]hared scientific data should be made accessible as soon as possible, and no later than the time of
an associated publication, or the end of the award/support period, whichever comes first.”
• Researchers may share data underlying publication during the period of award but may share other
data that have not yet led to a publication by the end of the award period.
Where
• Encourages the use of established repositories to the extent possible.
An NIDDK Resource dknet.org
More Details...
How
• NIH encourages data management and data sharing practices consistent with the FAIR data
principles
Funding
• Fees for long-term data preservation and sharing are allowable, but funds for these activities must be
spent during the performance period, even for scientific data and metadata preserved and shared
beyond the award period.
Repercussions
• After the end of the funding period, non-compliance with the NIH ICO-approved Plan may be taken
into account by NIH for future funding decisions for the recipient institution
The DMS Policy applies to all research, funded or conducted in whole or in part by NIH, that results in the
generation of scientific data. This includes research funded or conducted by extramural grants, contracts,
Intramural Research Projects, or other funding agreements regardless of NIH funding level or funding
mechanism.
An NIDDK Resource dknet.org
Data as a Research Product
Sound, reproducible scholarship rests upon a foundation of robust, accessible data. For this to be
so in practice as well as theory, data must be accorded due importance in the practice of
scholarship and in the enduring scholarly record…”
https://www.force11.org/group/joint-declaration-dat
a-citation-principles-final
Joint Declaration of Data Citation Principles
1. Data should be considered legitimate, citable products of research. Data citations should be
accorded the same importance in the scholarly record as citations of other research objects,
such as publications.
2. Data citations should facilitate giving scholarly credit and normative and legal attribution to all
contributors to the data, recognizing that a single style or mechanism of attribution may not be
applicable to all data.
3. In scholarly literature, whenever and wherever a claim relies upon data, the corresponding data
should be cited.
A data citation looks like a regular citation
DOI
Full citation
DOI:10.34945/F5XW2P
Proper data citation = data citation metrics
https://datasetsearch.research.google.com/
An NIDDK Resource dknet.org
Good data management is the gateway to data sharing
Borghi J, Abrams S, Lowenberg D, Simms S, Chodacki J (2018) Support Your Data: A Research Data
Management Guide for Researchers. Research Ideas and Outcomes 4: e26439.
https://doi.org/10.3897/rio.4.e26439
An NIDDK Resource dknet.org
Changing the culture around data management and sharing
• Me
• Answer to the underpowered study
• Data sharing and good data
management are closely aligned
• Compliance with mandates
• Credit for the totality of my work
• Science and Society
• Transparency
• Reproducibility
• Reduced waste
• Driving discovery
• Future me
• One most likely to benefit from good
data management and sharing
through stable archives
• No one ever regretted annotating too
much
• My colleagues (and PI)
• Easy to engage with colleagues over
well annotated data and associated
code
• What happens when the post doc
leaves?
April 2021; National Academies of Science Workshop
An NIDDK Resource dknet.org
But how do I do that?
● “NIH encourages data management and data sharing practices
consistent with the FAIR data principles.”
● “NIH strongly encourages the use of established repositories to
the extent possible for preserving and sharing scientific data”
An NIDDK Resource dknet.org
The FAIR Guiding Principles for scientific data management and
stewardship
High level principles to make data:
• Findable
• Accessible
• Interoperable
• Re-usable
Mark D. Wilkinson et al. The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data (2016). DOI: 10.1038/sdata.2016.18
…for humans and machines
An NIDDK Resource dknet.org
Findable
● F1. (meta)data are assigned a globally unique and
persistent identifier
● F2. data are described with rich metadata
● F3. metadata clearly and explicitly include the identifier of
the data it describes
● F4. (meta)data are registered or indexed in a searchable
resource
Accessible
● A1. (meta)data are retrievable by their identifier using a
standardized communications protocol
● A1.1 the protocol is open, free, and universally
implementable
● A1.2 the protocol allows for an authentication and
authorization procedure, where necessary
● A2. metadata are accessible, even when the data are no
longer available
● I1. (meta)data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation.
● I2. (meta)data use vocabularies that follow FAIR
principles
● I3. (meta)data include qualified references to other
(meta)data
Re-usable
● R1. meta(data) are richly described with a plurality of
accurate and relevant attributes
● R1.1. (meta)data are released with a clear and accessible
data usage license
● R1.2. (meta)data are associated with detailed provenance
● R1.3. (meta)data meet domain-relevant community
standards
Interoperable
G
o
o
d
M
e
t
a
d
a
t
a
DOI for your data
U
s
e
o
f
A
p
p
r
o
p
r
i
a
t
e
R
e
p
o
s
i
t
o
r
y
An NIDDK Resource dknet.org
Resource sharing plan →
Data Management and Sharing Plan
An NIDDK Resource dknet.org
Required Elements of the NIH Plan
Data Type
Related Tools, Software and/or Code
Standards
Data Preservation, Access, and Associated Timelines
Access, Distribution, or Reuse Considerations
Oversight of Data Management and Sharing
How compliance with the Plan will be monitored and managed,
frequency of oversight, and by whom (e.g., titles, roles).
Adapted from Ghosh et al. 2022
Not a part of review
An NIDDK Resource dknet.org
Data Type
• Type and amount/size
• Modality
• Level of aggregation
• Degree of data processing
• Which data and why
• A brief listing of the metadata
• Other relevant data, and
• Associated documentation to
facilitate interpretation of the
scientific data.
Example of some metadata needed
• Acquisition: Instrument, Protocol
• Biosample: Tissue, Cell, Organoid
• Participant/Donor
• Assay
• Analytics
Voltage traces Spike times Fluorescence traces
Optophysiology MRI Microscopy
Adapted from Ghosh et al. 2022
An NIDDK Resource dknet.org
Standards
• Data formats
• CSV/TSV, PNG, Tiff
• NWB, NIfTI, OME.Zarr, SWC
• Data dictionaries
• Data identifiers
• Sub-01, Ses-02,
• Definitions
• Ontologies
• Unique identifiers
• UUID, RRID
• Other data documentation
• Quality assurance, control
xkcd: 927
Adapted from Ghosh et al. 2022
An NIDDK Resource dknet.org
What standards should I use?
● Repositories often enforce specific
standards for metadata and data
● Thinking about where your data will
end up before you start your
experiments will help you determine
how to collect, annotate and
organize your (meta)data
● Fairsharing.org maintains a database
of standards and policies across
biomedicine
An NIDDK Resource dknet.org
Related Tools, Software and/or Code
• Any specialized tools needed
to access or manipulate
shared scientific data
• How needed tools can be
accessed
• Whether such tools are
likely to remain available for
as long as the scientific data
remain available.
Adapted from Ghosh et al. 2022
This should include the following information, if applicable:
● Which statistical package or program was used to
manipulate the data, along with the version of the
software that was used and any packages, scripts, or
settings that were used or developed during the course of
the study, as well as how users can access the software
● Whether there were any custom workflows or pipelines
developed as part of the study necessary to analyze or
process the data, and how
● Whether there were any executable programs or macros
written as part of the study necessary to analyze or
process the data, as well as how users can access the code
An NIDDK Resource dknet.org
Data Preservation, Access, and Associated Timelines
• Name of the repository(ies)
• How data will be findable and
identifiable
• Timeline
Adapted from Ghosh et al. 2022
An NIDDK Resource dknet.org
Lesson: Think about where your
data will end up in the beginning
Best practice: Submit your data to repository specialized for your type of data or your
domain
..if there isn’t one, then there are also general purpose repositories available
dknet.org
An NIDDK Resource
Where Can I Deposit My Data?
• List of DK relevant
repositories,
recommended by NLM
and various journals
• Created in conjunction
with NIDDK
• Coming soon: FAIR data
wizard
● FAIR Standards
● Clinical Repositories
Information
● Data maintenance
● Data size limit and cost
● Dynamic database
https://dknet.org/about/Suggested-data-repositories-niddk
An NIDDK Resource dknet.org
Access, Distribution, or Reuse Considerations
• Informed consent
• Privacy and confidentiality
protections
• Whether access to scientific data
derived from humans will be
controlled
• Any restrictions imposed by
federal, Tribal, or state laws,
regulations, or policies, or existing
or anticipated agreements
• Any other considerations that may
limit the extent of data sharing.
Adapted from Ghosh et al. 2022
An NIDDK Resource dknet.org
Data Management and Sharing Plan
● Creating a good data management
and sharing plan allows you to:
○ Comply with NIH mandates
○ Ensure that you allocate
enough resources for
preparing and sharing your
data
○ Ensure that you collect your
data in a FAIR manner
○ Easily share data with yourself,
future you, your colleagues
and the scientific community
● dkNET provides links to resources
that can help https://dknet.org/rin/rigor-reproducibility-about
An NIDDK Resource dknet.org
• Helps plan ahead
• Start alongside proposal, well
before data collection
• Human and technological
resources
• Cost: $$, Effort
• Cost
• Training/re-training
• Time
• Resources
It benefits your scientific work!
Data Management and Sharing Plan
• Benefit
• Reduction in $$, effort, less
surprises
• Organizing data and metadata
reduces transition effort
• Open data opens new avenues and
may reduce collection cost
• Using standards lowers
development cost
Adapted from Ghosh et al. 2022
An NIDDK Resource dknet.org
An NIDDK Resource dknet.org
FAIR Partnership
Researchers Repositories and
Registries
Indexers
Aggregators
• Good data
management
• Rich metadata
• Prepare to share
• Open formats
• Adopt/align to
standards
• Submit to repository
• Persistent identifier
• Machine based access
• Clear license
• Support for open, domain
specific standards
• Machine readable metadata
• Future friendly formats
• Persistent metadata
• Bidirectional links
• Data citation
• Index
• Effective Search
• Persistent metadata
dkNET
An NIDDK Resource dknet.org
Provide Easy Access to the DMSP Resources
An NIDDK Resource dknet.org
dkNET Collection of Information
An NIDDK Resource dknet.org
Repository Wizard
• Rate 80
repositories against
FAIR, Open,
Trustworthy,
Citable principles
and other criteria
• Wizard guides a
user through
criteria that may be
important for
selection
Coming Soon
dknet.org
An NIDDK Resource
Get involved in the dkNET Community
dkNET Homepage: dkNET.org
Check out dkNET Resour
Join Webinar
Follow us
@dknet_info
Check Out or Post
News and Funding
Opportunities
Blog, Calendar
Sign up email list
An NIDDK Resource dknet.org
1 sur 32

Recommandé

dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha... par
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET
65 vues29 diapositives
Praetzellis "Data Management Planning and Tools" par
Praetzellis "Data Management Planning and Tools"Praetzellis "Data Management Planning and Tools"
Praetzellis "Data Management Planning and Tools"National Information Standards Organization (NISO)
410 vues36 diapositives
NSF Data Requirements and Changing Federal Requirements for Research par
NSF Data Requirements and Changing Federal Requirements for ResearchNSF Data Requirements and Changing Federal Requirements for Research
NSF Data Requirements and Changing Federal Requirements for ResearchMargaret Henderson
648 vues30 diapositives
Data Management Planning for Engineers par
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for EngineersSherry Lake
821 vues25 diapositives
OpenAIRE webinar on Open Research Data in H2020 (OAW2016) par
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE
1.4K vues32 diapositives
DMP health sciences par
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
870 vues18 diapositives

Contenu connexe

Similaire à dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sharing Mandates" on September 22, 2023

dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha... par
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET
106 vues22 diapositives
You down with dmp yeah you know me! par
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!Renaine Julian
312 vues27 diapositives
dkNET Introduction for Librarians par
dkNET Introduction for LibrariansdkNET Introduction for Librarians
dkNET Introduction for LibrariansdkNET
158 vues35 diapositives
Open Science Globally: Some Developments/Dr Simon Hodson par
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
186 vues23 diapositives
Shifting the goal post – from high impact journals to high impact data par
 Shifting the goal post – from high impact journals to high impact data Shifting the goal post – from high impact journals to high impact data
Shifting the goal post – from high impact journals to high impact data CGIAR Research Program on Dryland Systems
508 vues21 diapositives
EPSRC research data expectations and research software management par
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software managementHistoric Environment Scotland
928 vues22 diapositives

Similaire à dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sharing Mandates" on September 22, 2023(20)

dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha... par dkNET
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET106 vues
You down with dmp yeah you know me! par Renaine Julian
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
Renaine Julian312 vues
dkNET Introduction for Librarians par dkNET
dkNET Introduction for LibrariansdkNET Introduction for Librarians
dkNET Introduction for Librarians
dkNET158 vues
Data accessibilityandchallenges par jyotikhadake
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallenges
jyotikhadake159 vues
Data Governance in two different data archives: When is a federal data reposi... par Carolyn Ten Holter
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...
DataONE Education Module 02: Data Sharing par DataONE
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
DataONE1.2K vues
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared par Rob Daley
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Rob Daley176 vues
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09... par dkNET
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET325 vues
LIBER Webinar: Are the FAIR Data Principles really fair? par LIBER Europe
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Europe1.3K vues

Plus de dkNET

dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo... par
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...dkNET
13 vues52 diapositives
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023 par
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023dkNET
12 vues40 diapositives
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete... par
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...dkNET
43 vues48 diapositives
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ... par
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...dkNET
26 vues51 diapositives
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se... par
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...dkNET
10 vues49 diapositives
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22 par
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22dkNET
142 vues21 diapositives

Plus de dkNET(20)

dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo... par dkNET
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET13 vues
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023 par dkNET
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET12 vues
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete... par dkNET
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET43 vues
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ... par dkNET
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET26 vues
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se... par dkNET
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET10 vues
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22 par dkNET
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22
dkNET Webinar "Visualizing Organelle and Cell Longevity In Situ" 05/20/22
dkNET142 vues
dkNET Webinar "Integrative Artificial Intelligence Approach to Predict T1D" 0... par dkNET
dkNET Webinar "Integrative Artificial Intelligence Approach to Predict T1D" 0...dkNET Webinar "Integrative Artificial Intelligence Approach to Predict T1D" 0...
dkNET Webinar "Integrative Artificial Intelligence Approach to Predict T1D" 0...
dkNET202 vues
dkNET-HIRN Webinar "T Cell Antigen Discovery: Experimental and Computational ... par dkNET
dkNET-HIRN Webinar "T Cell Antigen Discovery: Experimental and Computational ...dkNET-HIRN Webinar "T Cell Antigen Discovery: Experimental and Computational ...
dkNET-HIRN Webinar "T Cell Antigen Discovery: Experimental and Computational ...
dkNET147 vues
dkNET Webinar "Uncovering Novel Mediators and Mechanisms of Leptin Action" 02... par dkNET
dkNET Webinar "Uncovering Novel Mediators and Mechanisms of Leptin Action" 02...dkNET Webinar "Uncovering Novel Mediators and Mechanisms of Leptin Action" 02...
dkNET Webinar "Uncovering Novel Mediators and Mechanisms of Leptin Action" 02...
dkNET159 vues
dkNET Webinar "The Stimulating Peripheral Activity To Relieve Conditions (SPA... par dkNET
dkNET Webinar "The Stimulating Peripheral Activity To Relieve Conditions (SPA...dkNET Webinar "The Stimulating Peripheral Activity To Relieve Conditions (SPA...
dkNET Webinar "The Stimulating Peripheral Activity To Relieve Conditions (SPA...
dkNET295 vues
dkNET Webinar "Pancreatlas™: Mapping the Human Pancreas in Health and Disease... par dkNET
dkNET Webinar "Pancreatlas™: Mapping the Human Pancreas in Health and Disease...dkNET Webinar "Pancreatlas™: Mapping the Human Pancreas in Health and Disease...
dkNET Webinar "Pancreatlas™: Mapping the Human Pancreas in Health and Disease...
dkNET95 vues
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For... par dkNET
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET277 vues
dkNET Webinar "YCharOS: Antibody Characterization Through Open Science" 10/22... par dkNET
dkNET Webinar "YCharOS: Antibody Characterization Through Open Science" 10/22...dkNET Webinar "YCharOS: Antibody Characterization Through Open Science" 10/22...
dkNET Webinar "YCharOS: Antibody Characterization Through Open Science" 10/22...
dkNET286 vues
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021 par dkNET
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET258 vues
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-... par dkNET
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET261 vues
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica... par dkNET
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET289 vues
dkNET Webinar - nPOD Nanotomy: Large-Scale Electron Microscopy Database For H... par dkNET
dkNET Webinar - nPOD Nanotomy: Large-Scale Electron Microscopy Database For H...dkNET Webinar - nPOD Nanotomy: Large-Scale Electron Microscopy Database For H...
dkNET Webinar - nPOD Nanotomy: Large-Scale Electron Microscopy Database For H...
dkNET291 vues
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021 par dkNET
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET313 vues
dkNET Webinar - Vivli: A Global Clinical Trials Data Sharing Platform 12/11/2020 par dkNET
dkNET Webinar - Vivli: A Global Clinical Trials Data Sharing Platform 12/11/2020dkNET Webinar - Vivli: A Global Clinical Trials Data Sharing Platform 12/11/2020
dkNET Webinar - Vivli: A Global Clinical Trials Data Sharing Platform 12/11/2020
dkNET338 vues
dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020 par dkNET
dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020
dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020
dkNET263 vues

Dernier

Open Access Publishing in Astrophysics par
Open Access Publishing in AstrophysicsOpen Access Publishing in Astrophysics
Open Access Publishing in AstrophysicsPeter Coles
1.2K vues26 diapositives
Experimental animal Guinea pigs.pptx par
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptxMansee Arya
35 vues16 diapositives
plasmids par
plasmidsplasmids
plasmidsscribddarkened352
13 vues2 diapositives
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy... par
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...Anmol Vishnu Gupta
7 vues10 diapositives
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio... par
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...Trustlife
127 vues17 diapositives
How to be(come) a successful PhD student par
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD studentTom Mens
524 vues62 diapositives

Dernier(20)

Open Access Publishing in Astrophysics par Peter Coles
Open Access Publishing in AstrophysicsOpen Access Publishing in Astrophysics
Open Access Publishing in Astrophysics
Peter Coles1.2K vues
Experimental animal Guinea pigs.pptx par Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya35 vues
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy... par Anmol Vishnu Gupta
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio... par Trustlife
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Trustlife127 vues
How to be(come) a successful PhD student par Tom Mens
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens524 vues
Light Pollution for LVIS students par CWBarthlmew
Light Pollution for LVIS studentsLight Pollution for LVIS students
Light Pollution for LVIS students
CWBarthlmew9 vues
별헤는 사람들 2023년 12월호 전명원 교수 자료 par sciencepeople
별헤는 사람들 2023년 12월호 전명원 교수 자료별헤는 사람들 2023년 12월호 전명원 교수 자료
별헤는 사람들 2023년 12월호 전명원 교수 자료
sciencepeople58 vues
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor... par Trustlife
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Trustlife100 vues
A giant thin stellar stream in the Coma Galaxy Cluster par Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani17 vues
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... par ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI7 vues
Exploring the nature and synchronicity of early cluster formation in the Larg... par Sérgio Sacani
Exploring the nature and synchronicity of early cluster formation in the Larg...Exploring the nature and synchronicity of early cluster formation in the Larg...
Exploring the nature and synchronicity of early cluster formation in the Larg...
Sérgio Sacani910 vues
Applications of Large Language Models in Materials Discovery and Design par Anubhav Jain
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain13 vues
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... par InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...

dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sharing Mandates" on September 22, 2023

  • 1. An NIDDK Resource dknet.org dkNET Office Hours: New Data Mandates Jeffrey S. Grethe PI, NIDDK Information Network Co-Director, FAIR Data Informatics Laboratory, UCSD
  • 2. An NIDDK Resource dknet.org New NIH Policy Effective January 25, 2023 • US National Institutes of Health new data sharing policy goes into effect • All data must be managed; most data should be shared • “As open as possible; as closed as necessary” • Mandates the inclusion, approval and execution of a Data Management and Sharing Plan • (DMP + S = DMS) https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html
  • 3. An NIDDK Resource dknet.org The Details... The effective date of the DMS Policy is January 25, 2023, including for: • Competing grant applications that are submitted to NIH for the January 25, 2023 and subsequent receipt dates; • Proposals for contracts that are submitted to NIH on or after January 25, 2023; • NIH Intramural Research Projects conducted on or after January 25, 2023; and • Other funding agreements (e.g., Other Transactions) that are executed on or after January 25, 2023, unless otherwise stipulated by NIH.
  • 4. An NIDDK Resource dknet.org More Details... What • Defines Scientific Data as: “The recorded factual material commonly accepted in the scientific community as of sufficient quality to validate and replicate research findings, regardless of whether the data are used to support scholarly publications. Scientific data do not include laboratory notebooks, preliminary analyses, completed case report forms, drafts of scientific papers, plans for future research, peer reviews, communications with colleagues, or physical objects, such as laboratory specimens.” • Even those scientific data not used to support a publication are considered scientific data and within the final DMS Policy’s scope When • “[s]hared scientific data should be made accessible as soon as possible, and no later than the time of an associated publication, or the end of the award/support period, whichever comes first.” • Researchers may share data underlying publication during the period of award but may share other data that have not yet led to a publication by the end of the award period. Where • Encourages the use of established repositories to the extent possible.
  • 5. An NIDDK Resource dknet.org More Details... How • NIH encourages data management and data sharing practices consistent with the FAIR data principles Funding • Fees for long-term data preservation and sharing are allowable, but funds for these activities must be spent during the performance period, even for scientific data and metadata preserved and shared beyond the award period. Repercussions • After the end of the funding period, non-compliance with the NIH ICO-approved Plan may be taken into account by NIH for future funding decisions for the recipient institution The DMS Policy applies to all research, funded or conducted in whole or in part by NIH, that results in the generation of scientific data. This includes research funded or conducted by extramural grants, contracts, Intramural Research Projects, or other funding agreements regardless of NIH funding level or funding mechanism.
  • 6. An NIDDK Resource dknet.org Data as a Research Product Sound, reproducible scholarship rests upon a foundation of robust, accessible data. For this to be so in practice as well as theory, data must be accorded due importance in the practice of scholarship and in the enduring scholarly record…” https://www.force11.org/group/joint-declaration-dat a-citation-principles-final Joint Declaration of Data Citation Principles 1. Data should be considered legitimate, citable products of research. Data citations should be accorded the same importance in the scholarly record as citations of other research objects, such as publications. 2. Data citations should facilitate giving scholarly credit and normative and legal attribution to all contributors to the data, recognizing that a single style or mechanism of attribution may not be applicable to all data. 3. In scholarly literature, whenever and wherever a claim relies upon data, the corresponding data should be cited.
  • 7. A data citation looks like a regular citation DOI Full citation DOI:10.34945/F5XW2P
  • 8. Proper data citation = data citation metrics https://datasetsearch.research.google.com/
  • 9. An NIDDK Resource dknet.org Good data management is the gateway to data sharing Borghi J, Abrams S, Lowenberg D, Simms S, Chodacki J (2018) Support Your Data: A Research Data Management Guide for Researchers. Research Ideas and Outcomes 4: e26439. https://doi.org/10.3897/rio.4.e26439
  • 10. An NIDDK Resource dknet.org Changing the culture around data management and sharing • Me • Answer to the underpowered study • Data sharing and good data management are closely aligned • Compliance with mandates • Credit for the totality of my work • Science and Society • Transparency • Reproducibility • Reduced waste • Driving discovery • Future me • One most likely to benefit from good data management and sharing through stable archives • No one ever regretted annotating too much • My colleagues (and PI) • Easy to engage with colleagues over well annotated data and associated code • What happens when the post doc leaves? April 2021; National Academies of Science Workshop
  • 11. An NIDDK Resource dknet.org But how do I do that? ● “NIH encourages data management and data sharing practices consistent with the FAIR data principles.” ● “NIH strongly encourages the use of established repositories to the extent possible for preserving and sharing scientific data”
  • 12. An NIDDK Resource dknet.org The FAIR Guiding Principles for scientific data management and stewardship High level principles to make data: • Findable • Accessible • Interoperable • Re-usable Mark D. Wilkinson et al. The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data (2016). DOI: 10.1038/sdata.2016.18 …for humans and machines
  • 13. An NIDDK Resource dknet.org Findable ● F1. (meta)data are assigned a globally unique and persistent identifier ● F2. data are described with rich metadata ● F3. metadata clearly and explicitly include the identifier of the data it describes ● F4. (meta)data are registered or indexed in a searchable resource Accessible ● A1. (meta)data are retrievable by their identifier using a standardized communications protocol ● A1.1 the protocol is open, free, and universally implementable ● A1.2 the protocol allows for an authentication and authorization procedure, where necessary ● A2. metadata are accessible, even when the data are no longer available ● I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. ● I2. (meta)data use vocabularies that follow FAIR principles ● I3. (meta)data include qualified references to other (meta)data Re-usable ● R1. meta(data) are richly described with a plurality of accurate and relevant attributes ● R1.1. (meta)data are released with a clear and accessible data usage license ● R1.2. (meta)data are associated with detailed provenance ● R1.3. (meta)data meet domain-relevant community standards Interoperable G o o d M e t a d a t a DOI for your data U s e o f A p p r o p r i a t e R e p o s i t o r y
  • 14. An NIDDK Resource dknet.org Resource sharing plan → Data Management and Sharing Plan
  • 15. An NIDDK Resource dknet.org Required Elements of the NIH Plan Data Type Related Tools, Software and/or Code Standards Data Preservation, Access, and Associated Timelines Access, Distribution, or Reuse Considerations Oversight of Data Management and Sharing How compliance with the Plan will be monitored and managed, frequency of oversight, and by whom (e.g., titles, roles). Adapted from Ghosh et al. 2022 Not a part of review
  • 16. An NIDDK Resource dknet.org Data Type • Type and amount/size • Modality • Level of aggregation • Degree of data processing • Which data and why • A brief listing of the metadata • Other relevant data, and • Associated documentation to facilitate interpretation of the scientific data. Example of some metadata needed • Acquisition: Instrument, Protocol • Biosample: Tissue, Cell, Organoid • Participant/Donor • Assay • Analytics Voltage traces Spike times Fluorescence traces Optophysiology MRI Microscopy Adapted from Ghosh et al. 2022
  • 17. An NIDDK Resource dknet.org Standards • Data formats • CSV/TSV, PNG, Tiff • NWB, NIfTI, OME.Zarr, SWC • Data dictionaries • Data identifiers • Sub-01, Ses-02, • Definitions • Ontologies • Unique identifiers • UUID, RRID • Other data documentation • Quality assurance, control xkcd: 927 Adapted from Ghosh et al. 2022
  • 18. An NIDDK Resource dknet.org What standards should I use? ● Repositories often enforce specific standards for metadata and data ● Thinking about where your data will end up before you start your experiments will help you determine how to collect, annotate and organize your (meta)data ● Fairsharing.org maintains a database of standards and policies across biomedicine
  • 19. An NIDDK Resource dknet.org Related Tools, Software and/or Code • Any specialized tools needed to access or manipulate shared scientific data • How needed tools can be accessed • Whether such tools are likely to remain available for as long as the scientific data remain available. Adapted from Ghosh et al. 2022 This should include the following information, if applicable: ● Which statistical package or program was used to manipulate the data, along with the version of the software that was used and any packages, scripts, or settings that were used or developed during the course of the study, as well as how users can access the software ● Whether there were any custom workflows or pipelines developed as part of the study necessary to analyze or process the data, and how ● Whether there were any executable programs or macros written as part of the study necessary to analyze or process the data, as well as how users can access the code
  • 20. An NIDDK Resource dknet.org Data Preservation, Access, and Associated Timelines • Name of the repository(ies) • How data will be findable and identifiable • Timeline Adapted from Ghosh et al. 2022
  • 21. An NIDDK Resource dknet.org Lesson: Think about where your data will end up in the beginning Best practice: Submit your data to repository specialized for your type of data or your domain ..if there isn’t one, then there are also general purpose repositories available
  • 22. dknet.org An NIDDK Resource Where Can I Deposit My Data? • List of DK relevant repositories, recommended by NLM and various journals • Created in conjunction with NIDDK • Coming soon: FAIR data wizard ● FAIR Standards ● Clinical Repositories Information ● Data maintenance ● Data size limit and cost ● Dynamic database https://dknet.org/about/Suggested-data-repositories-niddk
  • 23. An NIDDK Resource dknet.org Access, Distribution, or Reuse Considerations • Informed consent • Privacy and confidentiality protections • Whether access to scientific data derived from humans will be controlled • Any restrictions imposed by federal, Tribal, or state laws, regulations, or policies, or existing or anticipated agreements • Any other considerations that may limit the extent of data sharing. Adapted from Ghosh et al. 2022
  • 24. An NIDDK Resource dknet.org Data Management and Sharing Plan ● Creating a good data management and sharing plan allows you to: ○ Comply with NIH mandates ○ Ensure that you allocate enough resources for preparing and sharing your data ○ Ensure that you collect your data in a FAIR manner ○ Easily share data with yourself, future you, your colleagues and the scientific community ● dkNET provides links to resources that can help https://dknet.org/rin/rigor-reproducibility-about
  • 25. An NIDDK Resource dknet.org • Helps plan ahead • Start alongside proposal, well before data collection • Human and technological resources • Cost: $$, Effort • Cost • Training/re-training • Time • Resources It benefits your scientific work! Data Management and Sharing Plan • Benefit • Reduction in $$, effort, less surprises • Organizing data and metadata reduces transition effort • Open data opens new avenues and may reduce collection cost • Using standards lowers development cost Adapted from Ghosh et al. 2022
  • 26. An NIDDK Resource dknet.org
  • 27. An NIDDK Resource dknet.org FAIR Partnership Researchers Repositories and Registries Indexers Aggregators • Good data management • Rich metadata • Prepare to share • Open formats • Adopt/align to standards • Submit to repository • Persistent identifier • Machine based access • Clear license • Support for open, domain specific standards • Machine readable metadata • Future friendly formats • Persistent metadata • Bidirectional links • Data citation • Index • Effective Search • Persistent metadata dkNET
  • 28. An NIDDK Resource dknet.org Provide Easy Access to the DMSP Resources
  • 29. An NIDDK Resource dknet.org dkNET Collection of Information
  • 30. An NIDDK Resource dknet.org Repository Wizard • Rate 80 repositories against FAIR, Open, Trustworthy, Citable principles and other criteria • Wizard guides a user through criteria that may be important for selection Coming Soon
  • 31. dknet.org An NIDDK Resource Get involved in the dkNET Community dkNET Homepage: dkNET.org Check out dkNET Resour Join Webinar Follow us @dknet_info Check Out or Post News and Funding Opportunities Blog, Calendar Sign up email list
  • 32. An NIDDK Resource dknet.org