SlideShare a Scribd company logo
1 of 51
Avoiding the Tower
of Babel
The Role of Data Description Standards in
Biomedical Imaging
Chris Gorgolewski
Stanford University
@ChrisFiloG
The Big Data to Knowledge (BD2K)
Guide to the Fundamentals of Data Science
Chris Gorgolewski
• Obtained Ph.D. degree from University of Edinburgh, 2013
• Co-director of the Stanford Center for Reproducible Neuroscience
• Research involves building tools to enable researchers to efficiently share
their data, run reproducible analyses & link the results with previously reported
findings.
• Promotes data sharing through initiatives such as data papers & NeuroVault.
• Core developer of neuroimaging data processing framework Nipype, fMRI
preprocessing tool FMRIPREP, & quality control tool MRIQC
• Coordinator of the Brain Imaging Data Structure (BIDS) standard.
• http://chrisgorgolewski.org
What are standards useful for?
Standardy to forma jezyka
Standardy to sposob komunikacji.
Jezeli nie zanalibysmy wspolnego jezyka
nie bylibysmy sie wstanie komunikowac
i wspolnie budowac wielkich rzeczy.
Standards are a language
Standards are a way of communicating.
Without knowing the language we speak
we would not be able to communicate
and build great things.
Esperanto!
• Artificial language
• Created in 1870
• Optimized for ease of learning
Computer ports
Computer ports - ambiguity
USB C vs
Thunderbolt 3 vs
DisplayPort
Electricity
Digital communication
The Internet is possible
because of standards:
• DNS
• IP
• HTTP
• SMTP
Application Programmatic Interfaces (APIs)
• Building blocks of
Web 2.0
• Enable rapid
prototyping and
flexible scaling
How standards are developed in
the industry?
Institute of Electrical and Electronics
Engineers (IEEE)
802.11 aka Wi-Fi
Institute of Electrical and Electronics
Engineers (IEEE)
World Wide Web Consortium (W3C)
World Wide Web Consortium (W3C)
National Electrical Manufacturers Association
(NEMA)
Digital Imaging and
Communications in
Medicine (DICOM)
How standards are developed in
academia?
How to look for standards? FAIRSharing.org
Standards in academia
• Bottom up
• Often developed for a specific project (CIFTI, XCEDE, OpenfMRI)
• Technically simple
• Competitive (NIFTI vs. MINC)
• When widely adopted can be of great value (NIFTI)
Use case:
Brain Imaging Data Structure
Goals
• Enable reuse of research neuroimaging data
• Shared within or between labs
• Enable automatic analysis of datasets
• No need to manually input scanning parameters
• Make automatic consistency validation of datasets
possible
• In context of public data sharing
Consumers
• Lab PIs
• To reduce errors in data handling
• To enable reuse of data within your own lab
• Pipeline developers
• To enable automatic data processing
• Databases and repositories
• To enable automatic data submission
Meet Prof. Smith
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Meet Mike
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Getting lost in your data
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Keys to success
• Involve broad scientific community
• Step outside of the bubble of your own lab
• Share the credit
• Let it be a truly collaborative – not just a product of your lab
• Focus on the use cases
• Make public call for comments
• Follow 80/20 rule
• Provide tools (validator)
Community! Community! Community!
Ways to get more people involved
• Ease the barrier to provide feedback
• Fully open mailing list
• Online Google Doc open for anyone to comment (even anonymously)
• Give credit
• List contributors by name
• Induce the feeling of shared ownership
• Acknowledge all types of contributions
• Organize in person meetings
• Dedicated workshops or along conferences
• Be persistent!
• Most people will be too busy to help you out
Folder organization
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Folder organization
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Folder organization
participant_id age sex
sub-001 34 M
sub-002 12 F
sub-003 33 F
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Folder organization
NIfT
I
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Folder organization
{
"RepetitionTime": 3.0,
"EchoTime": 0.03,
"FlipAngle": 78,
"SliceTiming": [0.0, 0.2, 0.4, …],
"MultibandAccellerationFactor": 4,
"PhaseEncodingDirection": "j-"
}
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Tools
APIs:
• Querying BIDS datasets programmatically
Converters:
• From scanner to data
Validators:
• Verifying dataset consistency
The Validator
incf.github.io/bids-validator/
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
Decision making process for extensions
Proposal
• Initial draft written by experts
• Sent out for public comments
Refinement
• Creation of example datasets
• Implementing new functionality in the validator
Merge
• Striving for consensus
• Backwards compatibility
Extension examples
• Electroencephalography
• Positron Emission Tomography
• Intracranial Electroencephalography
• Multi spectral structural imaging
• Models
• Derived data
• Spetroscopy
Cyril Pernet
Melanie Ganz
Dora Hermes
Tal Yarkoni
Guiomar Niso
Use cases enabled by BIDS
BIDS Apps
Gorgolewski et al. 2017
Simple parallelization scheme – map/reduce
a free online platform for sharing and
analysis of neuroimaging data
Science as a Service architecture
OpenNeuro.org - Poster #1677 45
OpenNeuro.org - Poster #1677 46
Demo
OpenNeuro.org - Poster #1677 47
BIDS Contributors
• Tibor Auer 💬📖💡🔧📢
• Sylvain Baillet 📖🔍
• Elizabeth Bock 📖💡
• Eric Bridgeford 📖🔧
• Teon L. Brooks 📖💻
• Suyash Bhogawar 📖💡⚠️🔧💬
• Vince D. Calhoun 📖
• Alexander L. Cohen 🐛💻📖💬
• R. Cameron Craddock 📖📢
• Samir Das 📖
• Alejandro de la Vega 🐛💻⚠️
• Eugene P. Duff 📖
• Elizabeth DuPre 📖💡
• Eric A. Earl 🤔
• Anders Eklund 📖📢💻
• Guillaume Flandin 📖💻
• Satrajit S. Ghosh 📖💻
• Tristan Glatard 📖💻
• Mathias Goncalves 💻🔧📢
• Alexandre Gramfort 📖💡
• Yaroslav O. Halchenko 📖📢
• Thomas E. Nichols 📖
• Guiomar Niso 📖💡
• Robert Oostenveld 📖
• Dianne Patterson 📖
• John Pellman 📖
• Cyril Pernet 💬 📖 💡📋
• Dmitry Petrov 📖💻
• Russell A. Poldrack 📖🔍📢
• Jean-Baptiste Poline 📖📢🤔🎨
• Ariel Rokem 📖
• Gunnar Schaefer 📖
• Jan-Mathijs Schoffelen 📖
• Vanessa Sochat 📖
• Francois Tadel 📖🔌💡
• William Triplett 📖
• Jessica A. Turner 📖
• Joseph Wexler 📖💡
• Gaël Varoquaux 📖
• Daniel A. Handwerker 📖
• Michael Hanke 📖🤔🔧🐛📢
• Michael P. Harms 📖⚠️🔧
• Richard N. Henson 📖
• International Neuroinformatics Coordinating Facility 💵📋
• Mainak Jas 📖💻
• David Keator 📖
• Gregory Kiar 📖💻🎨🔧
• Laura and John Arnold Foundation 💵
• Xiangrui Li 📖💻
• Vladimir Litvak 📖
• Dan Lurie 🤔📖🔧🔌💻💬
• Camille Maumet 📖
• Christopher J. Markiewicz 💬📖💻
• Jeremy Moreau 📖💡
• Zachary Michael 📖
• Michael P. Milham 💡🔍
• National Institute of Mental Health 💵
• B. Nolan Nichols 📖
Resources
• IEEE Process: https://standards.ieee.org/develop/process.html
• W3C Process: https://www.w3.org/2017/Process-20170301/
• History of DICOM: https://link.springer.com/chapter/10.1007/978-3-
540-74571-6_4
• BIDS: https://www.nature.com/articles/sdata201644
• BIDS Apps:
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcb
i.1005209
• Search for standards: https://fairsharing.org/
bids.neuroimaging.io
(specification, examples, discussion forum)
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O

More Related Content

What's hot

Report of the second FAIRDOM foundry
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundryFAIRDOM
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13DataDryad
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceKrzysztof Gorgolewski
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavisSean Davis
 
Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slidesGenomeInABottle
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataLe_GFII
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?Varsha Khodiyar
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceCarole Goble
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.FAIRDOM
 
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use CasesFrom Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use CasesNeo4j
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesAnnika Eriksson
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalJackie Wirz, PhD
 
infrastructure for communicating data-intensive science
infrastructure for communicating data-intensive scienceinfrastructure for communicating data-intensive science
infrastructure for communicating data-intensive scienceBrian Bot
 
Reproducible research: practice
Reproducible research: practiceReproducible research: practice
Reproducible research: practiceC. Tobin Magle
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 

What's hot (20)

Report of the second FAIRDOM foundry
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundry
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible Neuroscince
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis
 
Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slides
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research data
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
 
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use CasesFrom Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Reproducibility: 10 Simple Rules
Reproducibility: 10 Simple RulesReproducibility: 10 Simple Rules
Reproducibility: 10 Simple Rules
 
Clinical Anatomy 9566
Clinical Anatomy 9566Clinical Anatomy 9566
Clinical Anatomy 9566
 
Data101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_finalData101 pmcb retreat_09-20-13_final
Data101 pmcb retreat_09-20-13_final
 
Ngsp
NgspNgsp
Ngsp
 
infrastructure for communicating data-intensive science
infrastructure for communicating data-intensive scienceinfrastructure for communicating data-intensive science
infrastructure for communicating data-intensive science
 
Reproducible research: practice
Reproducible research: practiceReproducible research: practice
Reproducible research: practice
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 

Similar to Avoiding the tower of babel - The Role of Data Description Standards in Biomedical Imaging

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
RDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library AssociationsRDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library AssociationsResearch Data Alliance
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorialJosh Young
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelKrzysztof Gorgolewski
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for researchARDC
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected FacilityRyan Duggan
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFlorian Markowetz
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6ARDC
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxHASHEMHASH
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesShawn Day
 
Crowdsourced biological science - edinburgh
Crowdsourced biological science - edinburghCrowdsourced biological science - edinburgh
Crowdsourced biological science - edinburghErinma Ochu
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkAdaryl "Bob" Wakefield, MBA
 

Similar to Avoiding the tower of babel - The Role of Data Description Standards in Biomedical Imaging (20)

Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
RDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library AssociationsRDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library Associations
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next level
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for research
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected Facility
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproducibly
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
NISO-Altmetrics-NE-ACRL-ScholComIG-Nov2013
NISO-Altmetrics-NE-ACRL-ScholComIG-Nov2013NISO-Altmetrics-NE-ACRL-ScholComIG-Nov2013
NISO-Altmetrics-NE-ACRL-ScholComIG-Nov2013
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social SciencesDigital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Crowdsourced biological science - edinburgh
Crowdsourced biological science - edinburghCrowdsourced biological science - edinburgh
Crowdsourced biological science - edinburgh
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open Science
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
 

More from Krzysztof Gorgolewski

Study pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerationsStudy pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerationsKrzysztof Gorgolewski
 
Towards open and reproducible neuroscience in the age of big data
Towards open and  reproducible neuroscience in the age of big dataTowards open and  reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big dataKrzysztof Gorgolewski
 
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipelineFMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipelineKrzysztof Gorgolewski
 
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...Krzysztof Gorgolewski
 
Quality control for structural and functional MRI
Quality control for structural and functional MRIQuality control for structural and functional MRI
Quality control for structural and functional MRIKrzysztof Gorgolewski
 
The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)Krzysztof Gorgolewski
 
Data sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesData sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesKrzysztof Gorgolewski
 
If you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notIf you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notKrzysztof Gorgolewski
 
NeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingNeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingKrzysztof Gorgolewski
 
Reusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giantsReusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giantsKrzysztof Gorgolewski
 

More from Krzysztof Gorgolewski (15)

Study pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerationsStudy pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerations
 
Towards open and reproducible neuroscience in the age of big data
Towards open and  reproducible neuroscience in the age of big dataTowards open and  reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big data
 
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipelineFMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
 
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...
 
Quality control for structural and functional MRI
Quality control for structural and functional MRIQuality control for structural and functional MRI
Quality control for structural and functional MRI
 
Software testing for scientists
Software testing for scientistsSoftware testing for scientists
Software testing for scientists
 
Docker for scientists
Docker for scientistsDocker for scientists
Docker for scientists
 
The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)
 
Brain Imaging Data Structure
Brain Imaging Data StructureBrain Imaging Data Structure
Brain Imaging Data Structure
 
Meta analysis in neuroimaging 101
Meta analysis in neuroimaging 101Meta analysis in neuroimaging 101
Meta analysis in neuroimaging 101
 
Data sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesData sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challenges
 
Making data sharing count
Making data sharing countMaking data sharing count
Making data sharing count
 
If you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notIf you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or not
 
NeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingNeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimaging
 
Reusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giantsReusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giants
 

Recently uploaded

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 

Recently uploaded (20)

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 

Avoiding the tower of babel - The Role of Data Description Standards in Biomedical Imaging

  • 1. Avoiding the Tower of Babel The Role of Data Description Standards in Biomedical Imaging Chris Gorgolewski Stanford University @ChrisFiloG The Big Data to Knowledge (BD2K) Guide to the Fundamentals of Data Science
  • 2. Chris Gorgolewski • Obtained Ph.D. degree from University of Edinburgh, 2013 • Co-director of the Stanford Center for Reproducible Neuroscience • Research involves building tools to enable researchers to efficiently share their data, run reproducible analyses & link the results with previously reported findings. • Promotes data sharing through initiatives such as data papers & NeuroVault. • Core developer of neuroimaging data processing framework Nipype, fMRI preprocessing tool FMRIPREP, & quality control tool MRIQC • Coordinator of the Brain Imaging Data Structure (BIDS) standard. • http://chrisgorgolewski.org
  • 3. What are standards useful for?
  • 4. Standardy to forma jezyka Standardy to sposob komunikacji. Jezeli nie zanalibysmy wspolnego jezyka nie bylibysmy sie wstanie komunikowac i wspolnie budowac wielkich rzeczy.
  • 5. Standards are a language Standards are a way of communicating. Without knowing the language we speak we would not be able to communicate and build great things.
  • 6. Esperanto! • Artificial language • Created in 1870 • Optimized for ease of learning
  • 8. Computer ports - ambiguity USB C vs Thunderbolt 3 vs DisplayPort
  • 10. Digital communication The Internet is possible because of standards: • DNS • IP • HTTP • SMTP
  • 11. Application Programmatic Interfaces (APIs) • Building blocks of Web 2.0 • Enable rapid prototyping and flexible scaling
  • 12. How standards are developed in the industry?
  • 13. Institute of Electrical and Electronics Engineers (IEEE) 802.11 aka Wi-Fi
  • 14. Institute of Electrical and Electronics Engineers (IEEE)
  • 15. World Wide Web Consortium (W3C)
  • 16. World Wide Web Consortium (W3C)
  • 17. National Electrical Manufacturers Association (NEMA) Digital Imaging and Communications in Medicine (DICOM)
  • 18. How standards are developed in academia?
  • 19. How to look for standards? FAIRSharing.org
  • 20. Standards in academia • Bottom up • Often developed for a specific project (CIFTI, XCEDE, OpenfMRI) • Technically simple • Competitive (NIFTI vs. MINC) • When widely adopted can be of great value (NIFTI)
  • 21. Use case: Brain Imaging Data Structure
  • 22. Goals • Enable reuse of research neuroimaging data • Shared within or between labs • Enable automatic analysis of datasets • No need to manually input scanning parameters • Make automatic consistency validation of datasets possible • In context of public data sharing
  • 23. Consumers • Lab PIs • To reduce errors in data handling • To enable reuse of data within your own lab • Pipeline developers • To enable automatic data processing • Databases and repositories • To enable automatic data submission
  • 24. Meet Prof. Smith POSTER NUMBER: 1854 BIDS.NEUROIMAGING.I O
  • 26. Getting lost in your data POSTER NUMBER: 1854 BIDS.NEUROIMAGING.I O
  • 27. Keys to success • Involve broad scientific community • Step outside of the bubble of your own lab • Share the credit • Let it be a truly collaborative – not just a product of your lab • Focus on the use cases • Make public call for comments • Follow 80/20 rule • Provide tools (validator)
  • 29. Ways to get more people involved • Ease the barrier to provide feedback • Fully open mailing list • Online Google Doc open for anyone to comment (even anonymously) • Give credit • List contributors by name • Induce the feeling of shared ownership • Acknowledge all types of contributions • Organize in person meetings • Dedicated workshops or along conferences • Be persistent! • Most people will be too busy to help you out
  • 32. Folder organization participant_id age sex sub-001 34 M sub-002 12 F sub-003 33 F POSTER NUMBER: 1854 BIDS.NEUROIMAGING.I O
  • 34. Folder organization { "RepetitionTime": 3.0, "EchoTime": 0.03, "FlipAngle": 78, "SliceTiming": [0.0, 0.2, 0.4, …], "MultibandAccellerationFactor": 4, "PhaseEncodingDirection": "j-" } POSTER NUMBER: 1854 BIDS.NEUROIMAGING.I O
  • 35. Tools APIs: • Querying BIDS datasets programmatically Converters: • From scanner to data Validators: • Verifying dataset consistency
  • 37. Decision making process for extensions Proposal • Initial draft written by experts • Sent out for public comments Refinement • Creation of example datasets • Implementing new functionality in the validator Merge • Striving for consensus • Backwards compatibility
  • 38. Extension examples • Electroencephalography • Positron Emission Tomography • Intracranial Electroencephalography • Multi spectral structural imaging • Models • Derived data • Spetroscopy Cyril Pernet Melanie Ganz Dora Hermes Tal Yarkoni
  • 40. Use cases enabled by BIDS
  • 43.
  • 44. a free online platform for sharing and analysis of neuroimaging data
  • 45. Science as a Service architecture OpenNeuro.org - Poster #1677 45
  • 48.
  • 49. BIDS Contributors • Tibor Auer 💬📖💡🔧📢 • Sylvain Baillet 📖🔍 • Elizabeth Bock 📖💡 • Eric Bridgeford 📖🔧 • Teon L. Brooks 📖💻 • Suyash Bhogawar 📖💡⚠️🔧💬 • Vince D. Calhoun 📖 • Alexander L. Cohen 🐛💻📖💬 • R. Cameron Craddock 📖📢 • Samir Das 📖 • Alejandro de la Vega 🐛💻⚠️ • Eugene P. Duff 📖 • Elizabeth DuPre 📖💡 • Eric A. Earl 🤔 • Anders Eklund 📖📢💻 • Guillaume Flandin 📖💻 • Satrajit S. Ghosh 📖💻 • Tristan Glatard 📖💻 • Mathias Goncalves 💻🔧📢 • Alexandre Gramfort 📖💡 • Yaroslav O. Halchenko 📖📢 • Thomas E. Nichols 📖 • Guiomar Niso 📖💡 • Robert Oostenveld 📖 • Dianne Patterson 📖 • John Pellman 📖 • Cyril Pernet 💬 📖 💡📋 • Dmitry Petrov 📖💻 • Russell A. Poldrack 📖🔍📢 • Jean-Baptiste Poline 📖📢🤔🎨 • Ariel Rokem 📖 • Gunnar Schaefer 📖 • Jan-Mathijs Schoffelen 📖 • Vanessa Sochat 📖 • Francois Tadel 📖🔌💡 • William Triplett 📖 • Jessica A. Turner 📖 • Joseph Wexler 📖💡 • Gaël Varoquaux 📖 • Daniel A. Handwerker 📖 • Michael Hanke 📖🤔🔧🐛📢 • Michael P. Harms 📖⚠️🔧 • Richard N. Henson 📖 • International Neuroinformatics Coordinating Facility 💵📋 • Mainak Jas 📖💻 • David Keator 📖 • Gregory Kiar 📖💻🎨🔧 • Laura and John Arnold Foundation 💵 • Xiangrui Li 📖💻 • Vladimir Litvak 📖 • Dan Lurie 🤔📖🔧🔌💻💬 • Camille Maumet 📖 • Christopher J. Markiewicz 💬📖💻 • Jeremy Moreau 📖💡 • Zachary Michael 📖 • Michael P. Milham 💡🔍 • National Institute of Mental Health 💵 • B. Nolan Nichols 📖
  • 50. Resources • IEEE Process: https://standards.ieee.org/develop/process.html • W3C Process: https://www.w3.org/2017/Process-20170301/ • History of DICOM: https://link.springer.com/chapter/10.1007/978-3- 540-74571-6_4 • BIDS: https://www.nature.com/articles/sdata201644 • BIDS Apps: http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcb i.1005209 • Search for standards: https://fairsharing.org/
  • 51. bids.neuroimaging.io (specification, examples, discussion forum) POSTER NUMBER: 1854 BIDS.NEUROIMAGING.I O