Avoiding the tower of babel - The Role of Data Description Standards in Biomedical Imaging
1. Avoiding the Tower
of Babel
The Role of Data Description Standards in
Biomedical Imaging
Chris Gorgolewski
Stanford University
@ChrisFiloG
The Big Data to Knowledge (BD2K)
Guide to the Fundamentals of Data Science
2. Chris Gorgolewski
• Obtained Ph.D. degree from University of Edinburgh, 2013
• Co-director of the Stanford Center for Reproducible Neuroscience
• Research involves building tools to enable researchers to efficiently share
their data, run reproducible analyses & link the results with previously reported
findings.
• Promotes data sharing through initiatives such as data papers & NeuroVault.
• Core developer of neuroimaging data processing framework Nipype, fMRI
preprocessing tool FMRIPREP, & quality control tool MRIQC
• Coordinator of the Brain Imaging Data Structure (BIDS) standard.
• http://chrisgorgolewski.org
4. Standardy to forma jezyka
Standardy to sposob komunikacji.
Jezeli nie zanalibysmy wspolnego jezyka
nie bylibysmy sie wstanie komunikowac
i wspolnie budowac wielkich rzeczy.
5. Standards are a language
Standards are a way of communicating.
Without knowing the language we speak
we would not be able to communicate
and build great things.
20. Standards in academia
• Bottom up
• Often developed for a specific project (CIFTI, XCEDE, OpenfMRI)
• Technically simple
• Competitive (NIFTI vs. MINC)
• When widely adopted can be of great value (NIFTI)
22. Goals
• Enable reuse of research neuroimaging data
• Shared within or between labs
• Enable automatic analysis of datasets
• No need to manually input scanning parameters
• Make automatic consistency validation of datasets
possible
• In context of public data sharing
23. Consumers
• Lab PIs
• To reduce errors in data handling
• To enable reuse of data within your own lab
• Pipeline developers
• To enable automatic data processing
• Databases and repositories
• To enable automatic data submission
26. Getting lost in your data
POSTER NUMBER:
1854
BIDS.NEUROIMAGING.I
O
27. Keys to success
• Involve broad scientific community
• Step outside of the bubble of your own lab
• Share the credit
• Let it be a truly collaborative – not just a product of your lab
• Focus on the use cases
• Make public call for comments
• Follow 80/20 rule
• Provide tools (validator)
29. Ways to get more people involved
• Ease the barrier to provide feedback
• Fully open mailing list
• Online Google Doc open for anyone to comment (even anonymously)
• Give credit
• List contributors by name
• Induce the feeling of shared ownership
• Acknowledge all types of contributions
• Organize in person meetings
• Dedicated workshops or along conferences
• Be persistent!
• Most people will be too busy to help you out
37. Decision making process for extensions
Proposal
• Initial draft written by experts
• Sent out for public comments
Refinement
• Creation of example datasets
• Implementing new functionality in the validator
Merge
• Striving for consensus
• Backwards compatibility
38. Extension examples
• Electroencephalography
• Positron Emission Tomography
• Intracranial Electroencephalography
• Multi spectral structural imaging
• Models
• Derived data
• Spetroscopy
Cyril Pernet
Melanie Ganz
Dora Hermes
Tal Yarkoni
49. BIDS Contributors
• Tibor Auer 💬📖💡🔧📢
• Sylvain Baillet 📖🔍
• Elizabeth Bock 📖💡
• Eric Bridgeford 📖🔧
• Teon L. Brooks 📖💻
• Suyash Bhogawar 📖💡⚠️🔧💬
• Vince D. Calhoun 📖
• Alexander L. Cohen 🐛💻📖💬
• R. Cameron Craddock 📖📢
• Samir Das 📖
• Alejandro de la Vega 🐛💻⚠️
• Eugene P. Duff 📖
• Elizabeth DuPre 📖💡
• Eric A. Earl 🤔
• Anders Eklund 📖📢💻
• Guillaume Flandin 📖💻
• Satrajit S. Ghosh 📖💻
• Tristan Glatard 📖💻
• Mathias Goncalves 💻🔧📢
• Alexandre Gramfort 📖💡
• Yaroslav O. Halchenko 📖📢
• Thomas E. Nichols 📖
• Guiomar Niso 📖💡
• Robert Oostenveld 📖
• Dianne Patterson 📖
• John Pellman 📖
• Cyril Pernet 💬 📖 💡📋
• Dmitry Petrov 📖💻
• Russell A. Poldrack 📖🔍📢
• Jean-Baptiste Poline 📖📢🤔🎨
• Ariel Rokem 📖
• Gunnar Schaefer 📖
• Jan-Mathijs Schoffelen 📖
• Vanessa Sochat 📖
• Francois Tadel 📖🔌💡
• William Triplett 📖
• Jessica A. Turner 📖
• Joseph Wexler 📖💡
• Gaël Varoquaux 📖
• Daniel A. Handwerker 📖
• Michael Hanke 📖🤔🔧🐛📢
• Michael P. Harms 📖⚠️🔧
• Richard N. Henson 📖
• International Neuroinformatics Coordinating Facility 💵📋
• Mainak Jas 📖💻
• David Keator 📖
• Gregory Kiar 📖💻🎨🔧
• Laura and John Arnold Foundation 💵
• Xiangrui Li 📖💻
• Vladimir Litvak 📖
• Dan Lurie 🤔📖🔧🔌💻💬
• Camille Maumet 📖
• Christopher J. Markiewicz 💬📖💻
• Jeremy Moreau 📖💡
• Zachary Michael 📖
• Michael P. Milham 💡🔍
• National Institute of Mental Health 💵
• B. Nolan Nichols 📖
50. Resources
• IEEE Process: https://standards.ieee.org/develop/process.html
• W3C Process: https://www.w3.org/2017/Process-20170301/
• History of DICOM: https://link.springer.com/chapter/10.1007/978-3-
540-74571-6_4
• BIDS: https://www.nature.com/articles/sdata201644
• BIDS Apps:
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcb
i.1005209
• Search for standards: https://fairsharing.org/