SlideShare une entreprise Scribd logo
1  sur  45
June, 2022
Is Open Science Better Science?
Ewout W. Steyerberg, PhD
Professor of Clinical Biostatistics and
Medical Decision Making
Thanks to many for assistance and inspiration,
including the GAP3 consortium, CENTER-TBI Study
Yes, but …
Open vs closed science
Long ago
- Performed by few, elitarian scientists
- Doing private experiments
- Discussion in small, closed communities
Probabilities to quantify uncertainty
• Christiaan Huygens 1657:
'Van rekeningh in spelen van geluck'
• Thomas Bayes 1763:
An Essay towards solving a Problem in the Doctrine of Chances”
(read to the Royal Society by Richard Price)
• Pierre Laplace 1812:
Théorie analytique des probabilités
6-Jun-22
3 Insert > Header & footer
Open vs closed science
Long ago
- Performed by few, elitarian scientists
- Doing private experiments
- Discussion in small, closed communities
Recent
- Science as a profession
- Protect data + code as intellectual property
- Aim for shocking findings in high IF journals
https://www.sciencemag.org/news/2020/06/whos-blame-these-three-scientists-are-heart-surgisphere-covid-19-scandal
Overall claim
“Open Science will make research better”
Vote pro / neutral / con
“More data is better”
Vote pro / neutral / con
6-Jun-22
5 Insert > Header & footer
Today
Aims:
- Highlight some strong points in Open Science
- Hint at some challenges in Open Science
Reflections based on personal 30-yr research experience,
specific focus on prediction research / decision making
6-Jun-22
6 Insert > Header & footer
Open Science to better address
Big Research questions
Open science research questions: case 1
Example 1: Red cards and dark skin soccer players
https://psyarxiv.com/qkwst/
6-Jun-22
8 Insert > Header & footer
Open science research questions: case 1
• 29 teams involving 61 analysts; same dataset; same research question:
whether soccer referees are more likely to give red cards to dark skin
toned players than light skin toned players
• Estimated odds ratios 0.89 –2.93 (median 1.3)
• 20 teams: statistically significant positive effect, 9: non-significant relation
6-Jun-22
9 Insert > Header & footer
Estimated odds ratios by 29 research teams
6-Jun-22
10 Insert > Header & footer
“Logistic regression”
6-Jun-22
11 Insert > Header & footer
Open science research questions: case 1
• 29 teams involving 61 analysts; same dataset; same research question:
whether soccer referees are more likely to give red cards to dark skin toned
players than light skin toned players
• Estimated odds ratios 0.89 –2.93 (median 1.3).
• 20 teams: statistically significant positive effect, 9: non-significant relation.
• 21 unique combinations of covariates
• “Variation in analysis of complex data may be difficult to
avoid, even by experts with honest intentions”
6-Jun-22
12 Insert > Header & footer
Open science research questions: case 2
6-Jun-22
13 Insert > Header & footer
Example from Maarten van Smeden
@MaartenvSmeden
Predicting mortality – the media
Findings not convincing
Cox, #4, 30 vars, max c =0.793
RF, #7, 600 vars, c=0.797
Elastic, #9, 600 vars, c=0.801
6-Jun-22
15 Insert > Header & footer
Machine learning vs conventional modeling
1. Findings convincing?
“We found that random forests did not outperform Cox models despite their
inherent ability to accommodate nonlinearities and interactions. …
Elastic nets achieved the highest discrimination performance …, demonstrating
the ability of regularisation to select relevant variables and optimise model
coefficients in an EHR context.”
6-Jun-22
16 Insert > Header & footer
Machine learning vs conventional modeling
1. Findings convincing? Not in case-study
2. Systematic / ”it depends” ?
6-Jun-22
17 Insert > Header & footer
6-Jun-22
18 Insert > Header & footer
6-Jun-22
19 Insert > Header & footer
Open science research questions: case 2
• 243 real datasets from “the OpenML database”
• RF performed better than LR:
mean difference between RF and LR was 0.041 (95%-CI =[0.031,0.053]) for
the Area Under the ROC Curve
• Results were dependent on the inclusion criteria used to select the example
datasets
• ES: Results rely on 10 x 10-fold cross-validation
6-Jun-22
20 Insert > Header & footer
Open science research questions: case 2
• More clarification needed when ML / RF works best; at least large N needed
6-Jun-22
21 Insert > Header & footer
Systematic review on ML vs classic modeling
6-Jun-22
22 Insert > Header & footer
Differences in discrimination
Thanks to Maarten van Smeden
Summary on examples of Open Science
to better address Big research questions
• 1 data set
• multiple modelers
• Multiple modeling options
• 1 neutral comparison; 243 OpenML databases
• Review of 282 comparative studies: meta-research
6-Jun-22
25 Insert > Header & footer
Open Science: data sharing
 Collaboration vs giving
6-Jun-22
27 Insert > Header & footer
Heterogeneity in data .. ignored
6-Jun-22
28 Insert > Header & footer
Data sharing
• Pro:
• Allowed for larger sample size in a rare disease
• Cons:
• Heterogeneity?
• Substantial politics / efforts
6-Jun-22
29 Insert > Header & footer
Open Science: analyses and interpretation
Analyses: ODHSI model
6-Jun-22
31 Insert > Header & footer
OHDSI: COVID and other research topics
6-Jun-22
32 Insert > Header & footer
The power of OHDSI
6-Jun-22
33 Insert > Header & footer
OMOP common data model enables sharing of
model development code
6-Jun-22
34 Insert > Header & footer
Performance for different outcomes in multiple cohorts
6-Jun-22
35 Insert > Header & footer
OHDSI: bridging data sharing - analyses
• Keep data local
• Run locally started, centrally available analyses
• Share results centrally
Open Science: analyses and interpretation
Open Science challenge:
dealing with heterogeneity for prediction research
Heterogeneity
• Study design
• Selection of subjects
• Measurement of covariates
• Measurement of outcomes
• Associations of covariates with outcome
• Overall outcome rates
• Performance of prediction models
Analyses: dealing with heterogeneity
6-Jun-22
39 Insert > Header & footer
15 cohorts: 11 RCTs, 4 Observational studies
6-Jun-22
40 Insert > Header & footer
Heterogeneous case-mix
6-Jun-22
41 Insert > Header & footer
Heterogeneous predictor effects
6-Jun-22
42 Insert > Header & footer
Heterogeneous predictions
6-Jun-22
43 Insert > Header & footer
Heterogeneity  uncertainty in individual predictions
given that a prespecified logistic model is fitted
6-Jun-22
44 Insert > Header & footer
“Open Science is Better Science”
1. Research questions in competitions
• Red cards
• Neutral comparisons / meta-research
2. Data sharing
• Collaborative efforts most successful
3. Analyses
• OHDSI: modern, keep data local
• Heterogeneity
6-Jun-22
45 Insert > Header & footer

Contenu connexe

Similaire à Open Science Better Science? Steyerberg 2June2022.pptx

2016 Scope david cocker
2016 Scope david cocker2016 Scope david cocker
2016 Scope david cockerDavid Cocker
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshopVarsha Khodiyar
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?Paul Agapow
 
Panel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkPanel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkDaniel S. Katz
 
محاضرة د.سعاد
محاضرة د.سعادمحاضرة د.سعاد
محاضرة د.سعادresearchcenterm
 
Data, Responsibly: The Next Decade of Data Science
Data, Responsibly: The Next Decade of Data ScienceData, Responsibly: The Next Decade of Data Science
Data, Responsibly: The Next Decade of Data ScienceUniversity of Washington
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Barry Smith
 
Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Philip Bourne
 
AllegroGraph - Cognitive Probability Graph webcast
AllegroGraph - Cognitive Probability Graph webcastAllegroGraph - Cognitive Probability Graph webcast
AllegroGraph - Cognitive Probability Graph webcastFranz Inc. - AllegroGraph
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsPaul Groth
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Rene Von schomberg
 
What is the reproducibility crisis in science and what can we do about it?
What is the reproducibility crisis in science and what can we do about it?What is the reproducibility crisis in science and what can we do about it?
What is the reproducibility crisis in science and what can we do about it?Dorothy Bishop
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...Christopher Hart
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Paris Data Ladies #14
Paris Data Ladies #14Paris Data Ladies #14
Paris Data Ladies #14Nina Bertrand
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 

Similaire à Open Science Better Science? Steyerberg 2June2022.pptx (20)

2016 Scope david cocker
2016 Scope david cocker2016 Scope david cocker
2016 Scope david cocker
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
Panel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still WorkPanel: Our Scholarly Recognition System Doesn’t Still Work
Panel: Our Scholarly Recognition System Doesn’t Still Work
 
محاضرة د.سعاد
محاضرة د.سعادمحاضرة د.سعاد
محاضرة د.سعاد
 
Data, Responsibly: The Next Decade of Data Science
Data, Responsibly: The Next Decade of Data ScienceData, Responsibly: The Next Decade of Data Science
Data, Responsibly: The Next Decade of Data Science
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
 
Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?
 
AllegroGraph - Cognitive Probability Graph webcast
AllegroGraph - Cognitive Probability Graph webcastAllegroGraph - Cognitive Probability Graph webcast
AllegroGraph - Cognitive Probability Graph webcast
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
What is the reproducibility crisis in science and what can we do about it?
What is the reproducibility crisis in science and what can we do about it?What is the reproducibility crisis in science and what can we do about it?
What is the reproducibility crisis in science and what can we do about it?
 
محاضرة 4
محاضرة 4محاضرة 4
محاضرة 4
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Paris Data Ladies #14
Paris Data Ladies #14Paris Data Ladies #14
Paris Data Ladies #14
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 

Plus de Ewout Steyerberg

Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Ewout Steyerberg
 
Reproducibility Leiden 12jul22.pptx
Reproducibility Leiden 12jul22.pptxReproducibility Leiden 12jul22.pptx
Reproducibility Leiden 12jul22.pptxEwout Steyerberg
 
Prediction research Twente 22June22 sel.pptx
Prediction research Twente 22June22 sel.pptxPrediction research Twente 22June22 sel.pptx
Prediction research Twente 22June22 sel.pptxEwout Steyerberg
 
Prediction research: perspectives on performance Stanford 19May22.pptx
Prediction research: perspectives on performance Stanford 19May22.pptxPrediction research: perspectives on performance Stanford 19May22.pptx
Prediction research: perspectives on performance Stanford 19May22.pptxEwout Steyerberg
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEwout Steyerberg
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Ewout Steyerberg
 

Plus de Ewout Steyerberg (6)

Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22Statistics and ML Paris 20sept22
Statistics and ML Paris 20sept22
 
Reproducibility Leiden 12jul22.pptx
Reproducibility Leiden 12jul22.pptxReproducibility Leiden 12jul22.pptx
Reproducibility Leiden 12jul22.pptx
 
Prediction research Twente 22June22 sel.pptx
Prediction research Twente 22June22 sel.pptxPrediction research Twente 22June22 sel.pptx
Prediction research Twente 22June22 sel.pptx
 
Prediction research: perspectives on performance Stanford 19May22.pptx
Prediction research: perspectives on performance Stanford 19May22.pptxPrediction research: perspectives on performance Stanford 19May22.pptx
Prediction research: perspectives on performance Stanford 19May22.pptx
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 

Dernier

Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsbassianu17
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Silpa
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Silpa
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 

Dernier (20)

Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Open Science Better Science? Steyerberg 2June2022.pptx

  • 1. June, 2022 Is Open Science Better Science? Ewout W. Steyerberg, PhD Professor of Clinical Biostatistics and Medical Decision Making Thanks to many for assistance and inspiration, including the GAP3 consortium, CENTER-TBI Study Yes, but …
  • 2. Open vs closed science Long ago - Performed by few, elitarian scientists - Doing private experiments - Discussion in small, closed communities
  • 3. Probabilities to quantify uncertainty • Christiaan Huygens 1657: 'Van rekeningh in spelen van geluck' • Thomas Bayes 1763: An Essay towards solving a Problem in the Doctrine of Chances” (read to the Royal Society by Richard Price) • Pierre Laplace 1812: Théorie analytique des probabilités 6-Jun-22 3 Insert > Header & footer
  • 4. Open vs closed science Long ago - Performed by few, elitarian scientists - Doing private experiments - Discussion in small, closed communities Recent - Science as a profession - Protect data + code as intellectual property - Aim for shocking findings in high IF journals https://www.sciencemag.org/news/2020/06/whos-blame-these-three-scientists-are-heart-surgisphere-covid-19-scandal
  • 5. Overall claim “Open Science will make research better” Vote pro / neutral / con “More data is better” Vote pro / neutral / con 6-Jun-22 5 Insert > Header & footer
  • 6. Today Aims: - Highlight some strong points in Open Science - Hint at some challenges in Open Science Reflections based on personal 30-yr research experience, specific focus on prediction research / decision making 6-Jun-22 6 Insert > Header & footer
  • 7. Open Science to better address Big Research questions
  • 8. Open science research questions: case 1 Example 1: Red cards and dark skin soccer players https://psyarxiv.com/qkwst/ 6-Jun-22 8 Insert > Header & footer
  • 9. Open science research questions: case 1 • 29 teams involving 61 analysts; same dataset; same research question: whether soccer referees are more likely to give red cards to dark skin toned players than light skin toned players • Estimated odds ratios 0.89 –2.93 (median 1.3) • 20 teams: statistically significant positive effect, 9: non-significant relation 6-Jun-22 9 Insert > Header & footer
  • 10. Estimated odds ratios by 29 research teams 6-Jun-22 10 Insert > Header & footer
  • 12. Open science research questions: case 1 • 29 teams involving 61 analysts; same dataset; same research question: whether soccer referees are more likely to give red cards to dark skin toned players than light skin toned players • Estimated odds ratios 0.89 –2.93 (median 1.3). • 20 teams: statistically significant positive effect, 9: non-significant relation. • 21 unique combinations of covariates • “Variation in analysis of complex data may be difficult to avoid, even by experts with honest intentions” 6-Jun-22 12 Insert > Header & footer
  • 13. Open science research questions: case 2 6-Jun-22 13 Insert > Header & footer Example from Maarten van Smeden @MaartenvSmeden
  • 15. Findings not convincing Cox, #4, 30 vars, max c =0.793 RF, #7, 600 vars, c=0.797 Elastic, #9, 600 vars, c=0.801 6-Jun-22 15 Insert > Header & footer
  • 16. Machine learning vs conventional modeling 1. Findings convincing? “We found that random forests did not outperform Cox models despite their inherent ability to accommodate nonlinearities and interactions. … Elastic nets achieved the highest discrimination performance …, demonstrating the ability of regularisation to select relevant variables and optimise model coefficients in an EHR context.” 6-Jun-22 16 Insert > Header & footer
  • 17. Machine learning vs conventional modeling 1. Findings convincing? Not in case-study 2. Systematic / ”it depends” ? 6-Jun-22 17 Insert > Header & footer
  • 18. 6-Jun-22 18 Insert > Header & footer
  • 19. 6-Jun-22 19 Insert > Header & footer
  • 20. Open science research questions: case 2 • 243 real datasets from “the OpenML database” • RF performed better than LR: mean difference between RF and LR was 0.041 (95%-CI =[0.031,0.053]) for the Area Under the ROC Curve • Results were dependent on the inclusion criteria used to select the example datasets • ES: Results rely on 10 x 10-fold cross-validation 6-Jun-22 20 Insert > Header & footer
  • 21. Open science research questions: case 2 • More clarification needed when ML / RF works best; at least large N needed 6-Jun-22 21 Insert > Header & footer
  • 22. Systematic review on ML vs classic modeling 6-Jun-22 22 Insert > Header & footer
  • 24. Thanks to Maarten van Smeden
  • 25. Summary on examples of Open Science to better address Big research questions • 1 data set • multiple modelers • Multiple modeling options • 1 neutral comparison; 243 OpenML databases • Review of 282 comparative studies: meta-research 6-Jun-22 25 Insert > Header & footer
  • 26. Open Science: data sharing  Collaboration vs giving
  • 27. 6-Jun-22 27 Insert > Header & footer
  • 28. Heterogeneity in data .. ignored 6-Jun-22 28 Insert > Header & footer
  • 29. Data sharing • Pro: • Allowed for larger sample size in a rare disease • Cons: • Heterogeneity? • Substantial politics / efforts 6-Jun-22 29 Insert > Header & footer
  • 30. Open Science: analyses and interpretation
  • 31. Analyses: ODHSI model 6-Jun-22 31 Insert > Header & footer
  • 32. OHDSI: COVID and other research topics 6-Jun-22 32 Insert > Header & footer
  • 33. The power of OHDSI 6-Jun-22 33 Insert > Header & footer
  • 34. OMOP common data model enables sharing of model development code 6-Jun-22 34 Insert > Header & footer
  • 35. Performance for different outcomes in multiple cohorts 6-Jun-22 35 Insert > Header & footer
  • 36. OHDSI: bridging data sharing - analyses • Keep data local • Run locally started, centrally available analyses • Share results centrally
  • 37. Open Science: analyses and interpretation
  • 38. Open Science challenge: dealing with heterogeneity for prediction research Heterogeneity • Study design • Selection of subjects • Measurement of covariates • Measurement of outcomes • Associations of covariates with outcome • Overall outcome rates • Performance of prediction models
  • 39. Analyses: dealing with heterogeneity 6-Jun-22 39 Insert > Header & footer
  • 40. 15 cohorts: 11 RCTs, 4 Observational studies 6-Jun-22 40 Insert > Header & footer
  • 44. Heterogeneity  uncertainty in individual predictions given that a prespecified logistic model is fitted 6-Jun-22 44 Insert > Header & footer
  • 45. “Open Science is Better Science” 1. Research questions in competitions • Red cards • Neutral comparisons / meta-research 2. Data sharing • Collaborative efforts most successful 3. Analyses • OHDSI: modern, keep data local • Heterogeneity 6-Jun-22 45 Insert > Header & footer