SlideShare une entreprise Scribd logo
1  sur  25
A plea for good methodology:
the strengths and limitations of
approaches to developing prediction
models in obstetrics and gynecology
Ben Van Calster
Department of Development and Regeneration, KU Leuven (B)
Department of Biomedical Data Sciences, LUMC (NL)
Research Ethics Committee, University Hospitals Leuven (B)
Epi-Centre, KU Leuven (B)
Glasgow/Leuven, October 16th 2020
2
To explain or to predict?
DESCRIBE / EXPLAIN
• Study independent associations / predictors / risk factors
• Key: effect size per variable
• Not prediction modeling!
PREDICT
• Obtain a system that gives predictions (risk estimates)
• Aim is the use in NEW patients: it should work ‘tomorrow’, not now
• Key: quality of the predictions
3
Strengths of prediction models
• Help in (shared) clinical decision making
• Objectify predictions
• Patient counseling
• Effect on clinical workflow and outcomes
GOOD METHODOLOGY AND
GOOD REPORTING ARE ESSENTIAL!
4
Beam and Kohane. JAMA 2018;319:1317-8.
Get the objective right
5
Riley. Nature 2019;572:27-9.
Cronin & Vickers. Urology 2010;76:1298-301.
Get the objective right
• Is there a real clinical need for a new model?
• For which outcome, and for which management decision?
• When during the clinical workflow should the prediction be made?
• Does this match with the timing of the predictors?
• Do you have/can you collect data that is (really) fit for purpose?
6
Example
7
Tangiisuran et al. PLoS One 2014;9:e111254.
Too many models, too few validations
• 1060 models predicting outcomes after CVD (1990-2015) (Wessler et al, 2017)
• 363 models predicting CVD (Damen et al, 2016)
• 231 models related to Covid-19 (Wynants et al, 2020; living syst review)
ObGyn related:
• 263 models in obstetrics (Kleinrouweler et al, 2016)
• 116 models to diagnose ovarian malignancy (Kaijser et al, 2014)
 Perhaps academic CVs need help, but patients need help more
8
Thanks to @GSCollins
Wessler et al. Diagn Progn Res 2017;1:20. Damen et al. BMJ 2016;353:i2416. Wynants et al. BMJ 2020;369:m1328.
Kleinrouweler et al. AJOG 2016;214:79-90. Kaijser et al. Hum Reprod Update 2014;20:229-62.
Models in obstetrics
Only 23 of 263 models (9%) have been externally validated!
9
Kleinrouweler et al. AJOG 2016;214:79-90.
Knowledge is power (1)
Avoid dichotomization of continuous predictor variables
• Biologically implausible
• Deletes information, worse predictions (AUC ) (Collins 2016; Steyerberg 2018)
• Only clinical decisions should be binary
10
Collins et al. Stat Med 2016;35:4124-35.
Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
Butts & Ng. Statistical and methodological myths and urban legends, p361-86. Routledge/Taylor & Francis, 2009.
Knowledge is power (2)
Use available knowledge, do not always ask the data!
11
Good & Hardin. Common errors in statistics (and how to avoid them). Wiley, 2006.
“Bypassing the brain to
compute by reflex is a
sure recipe for disaster”
Knowledge is power (3)
Explain how and when predictors are measured, standardize where
reasonably possible
- Units; e.g. progesterone in ng/ml or nmol/L
- How tumor volume or diameter is calculated
- What is meant by ‘hormonal therapy use’ (Which? When?)
- Smoking
- BMI: measured vs self-reported
If measurement varies across studies, model performance deteriorates
(Luijken, 2019; Luijken, 2020)
12
Luijken et al. Stat Med 2019;38:3444-59.
Luijken et al. J Clin Epidemiol 2020;119:7-18.
Knowledge is power (4): sample size
You think of buying a Porsche.
But if you do not want to pay for it,
you may get this.
The same applies for developing risk models.
13
The currency is sample size
The more complicated (or ‘fancy’) the modeling strategy,
the more you have to pay with sample size.
(counterfeit money does not help: we need good quality data)
In this respect, avoid train-test split, this reduces sample size for model
development: you’re burning your money
14
The currency is sample size
Many have heard of the “10 events per variable” rule
1. Often incorrect use: This is not about 10 patients per variable in the final model!
2. This is outdated, 10 EPV is often not enough. See new procedure (BMJ 2020).
3. Flexible algorithms are data hungry, EPV>>10 may be needed (van der Ploeg 2014).
15
Van der Ploeg et al. BMC Med Res Methodol 2014;14:137
Riley et al. BMJ 2020;368:m441.
Knowledge is power (5): missing data
Usually, “empty cells” are “full of information”!
Using only complete cases
- decreases sample size (less money)
- typically leaves a non-representative sample (biased risk estimates)
Presence of a test can be more predictive than the test result! See EHR data.
16
Agniel et al. BMJ 2018;360:k1479.
Model validation: assess calibration!
Key elements of model performance:
discrimination between patients with and without the event
calibration (correctness) of risk estimates
17
DISCRIMINATION
When it rained, was the
estimated chance of rain
higher (on average)?
CALIBRATION
For days with 80% estimated
chance of rain, did it rain on
8 out of 10 days?
Calibration: the Achilles heel
18
Van Calster & Vickers. Med Decis Making 2015;35:162-9.
Van Calster et al. BMC Med 2019;17:230.
Miscalibration: estimated risk is inaccurate
 Patient and clinician are misinformed, may lead to inappropriate decisions
(Van Calster & Vickers, 2015)
Performance depends on place and time
One external validation in one hospital does not tell much about a model!
“There is no such thing as a validated model”
 Study heterogeneity
19
Van Calster et al. BMJ 2020;370:m2614.
P-values and significance testing
Very small role in prediction modeling
- Focus is on robust predictions
- Focus is on precision of the performance estimates (e.g. AUC, calibration)
- Focus is on quantifying heterogeneity
- Focus is on qualitative difference between populations
- Focus is on a priori selection of predictors
- further data-driven selection can be based on p-values; high alpha recommended
(Steyerberg & Van Calster, 2020)
20
Steyerberg & Van Calster. Eur J Clin Invest 2020;50:e13229.
Machine learning popularity
21
“Typical machine learning algorithms are highly flexible
So will uncover associations we could not find before
Hence better predictions and management decisions”
→ One of the master keys, with guaranteed success!
Machine Learning: success guaranteed?
22
Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
Poor methodology and reporting is common
23
Christodoulou et al (2019) – 71 studies:
- What was done about missing data? 100% poor or unclear
- How was performance validated? 68% unclear or biased approach
- Was calibration of risk estimates studied? 79% not at all
- Prognostic models: time horizon often ignored completely
Kleinrouweler et al (2016) – 263 models:
- Was calibration studied? 82% not at all
- Was the model fully presented so people can use it? Not for 38% of models
- Was the clinical use discussed? Not for 89% of models
FOLLOW TRIPOD GUIDELINES FOR REPORTING!
www.tripod-statement.org
Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
Kleinrouweler et al. AJOG 2016;214:79-90.
Moons et al. Ann Intern Med 2015;162:w1-73.
The harm of poor methodology
24
Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
Resources on prediction modeling
25
Involve a statistician with knowledge of prediction modeling!
Steyerberg EW. Clinical prediction models (2nd ed). Springer, 2019.
Riley RD et al. Prognosis research in healthcare. OUP, 2019.
Moons KGM et al. Transparent reporting of a multivariable prediction model for individual
prognosis and diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med
2015;162:W1-73.
Wynants L et al. Key steps and common pitfalls in developing and validating risk models.
BJOG 2017; 2017;124:423-432.
Prognosisresearch.com (newly launched website)

Contenu connexe

Tendances

NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...European School of Oncology
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkStats Statswork
 
850 keynote savage_using his laptop
850 keynote savage_using his laptop850 keynote savage_using his laptop
850 keynote savage_using his laptopRising Media, Inc.
 
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...MEASURE Evaluation
 
Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ajums
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IMaarten van Smeden
 
Emergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthEmergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthHughSingleton
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsMaarten van Smeden
 
Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...SystemOne
 
Bioanalytical validation house of cards
Bioanalytical validation house of cardsBioanalytical validation house of cards
Bioanalytical validation house of cardsE. Dennis Bashaw
 
A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16Yesenia Castillo Salinas
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906Mark Gusack
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cDamian R. Mingle, MBA
 
Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Jayanthi Repalli, PhD
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemMaarten van Smeden
 
Glymour aaai
Glymour aaaiGlymour aaai
Glymour aaaimglymour
 
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...NAMSA
 
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiOptimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiMads Astvad
 

Tendances (20)

NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
850 keynote savage_using his laptop
850 keynote savage_using his laptop850 keynote savage_using his laptop
850 keynote savage_using his laptop
 
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
 
Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Emergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthEmergency Department Triage and Digital Health
Emergency Department Triage and Digital Health
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questions
 
Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...
 
Bioanalytical validation house of cards
Bioanalytical validation house of cardsBioanalytical validation house of cards
Bioanalytical validation house of cards
 
A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
 
SgtSaraEdition
SgtSaraEditionSgtSaraEdition
SgtSaraEdition
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
 
Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problem
 
Glymour aaai
Glymour aaaiGlymour aaai
Glymour aaai
 
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
 
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiOptimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
 
Austin Ophthalmology
Austin OphthalmologyAustin Ophthalmology
Austin Ophthalmology
 

Similaire à A plea for good methodology when developing clinical prediction models

Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great againBenVanCalster
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Evangelos Kritsotakis
 
Comparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneComparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneDanskSygeplejeraad
 
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...IRJET Journal
 
Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Evangelos Kritsotakis
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Ewout Steyerberg
 
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraTherapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraVadim Tantsyura
 
David Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventDavid Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventShawnHoke
 
ISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxBenVanCalster
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...mlaij
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...mlaij
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...mlaij
 
Evaluating the Medical Literature
Evaluating the Medical LiteratureEvaluating the Medical Literature
Evaluating the Medical LiteratureClista Clanton
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitLaure Wynants
 
20160223 patient experience2
20160223 patient experience220160223 patient experience2
20160223 patient experience2jescarra
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 
Pediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationPediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationJordan Gamart
 
Integrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseIntegrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseEric Weaver
 

Similaire à A plea for good methodology when developing clinical prediction models (20)

Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great again
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
 
Comparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneComparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment alone
 
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
 
Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraTherapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
 
David Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventDavid Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off Event
 
Neal Lesh
Neal LeshNeal Lesh
Neal Lesh
 
ISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptx
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
Evaluating the Medical Literature
Evaluating the Medical LiteratureEvaluating the Medical Literature
Evaluating the Medical Literature
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net Benefit
 
20160223 patient experience2
20160223 patient experience220160223 patient experience2
20160223 patient experience2
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
Clinical prediction models
Clinical prediction modelsClinical prediction models
Clinical prediction models
 
Pediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationPediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events Presentation
 
Integrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseIntegrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation Showcase
 

Dernier

Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...HyderabadDolls
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 

Dernier (20)

Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 

A plea for good methodology when developing clinical prediction models

  • 1. A plea for good methodology: the strengths and limitations of approaches to developing prediction models in obstetrics and gynecology Ben Van Calster Department of Development and Regeneration, KU Leuven (B) Department of Biomedical Data Sciences, LUMC (NL) Research Ethics Committee, University Hospitals Leuven (B) Epi-Centre, KU Leuven (B) Glasgow/Leuven, October 16th 2020
  • 2. 2
  • 3. To explain or to predict? DESCRIBE / EXPLAIN • Study independent associations / predictors / risk factors • Key: effect size per variable • Not prediction modeling! PREDICT • Obtain a system that gives predictions (risk estimates) • Aim is the use in NEW patients: it should work ‘tomorrow’, not now • Key: quality of the predictions 3
  • 4. Strengths of prediction models • Help in (shared) clinical decision making • Objectify predictions • Patient counseling • Effect on clinical workflow and outcomes GOOD METHODOLOGY AND GOOD REPORTING ARE ESSENTIAL! 4 Beam and Kohane. JAMA 2018;319:1317-8.
  • 5. Get the objective right 5 Riley. Nature 2019;572:27-9. Cronin & Vickers. Urology 2010;76:1298-301.
  • 6. Get the objective right • Is there a real clinical need for a new model? • For which outcome, and for which management decision? • When during the clinical workflow should the prediction be made? • Does this match with the timing of the predictors? • Do you have/can you collect data that is (really) fit for purpose? 6
  • 7. Example 7 Tangiisuran et al. PLoS One 2014;9:e111254.
  • 8. Too many models, too few validations • 1060 models predicting outcomes after CVD (1990-2015) (Wessler et al, 2017) • 363 models predicting CVD (Damen et al, 2016) • 231 models related to Covid-19 (Wynants et al, 2020; living syst review) ObGyn related: • 263 models in obstetrics (Kleinrouweler et al, 2016) • 116 models to diagnose ovarian malignancy (Kaijser et al, 2014)  Perhaps academic CVs need help, but patients need help more 8 Thanks to @GSCollins Wessler et al. Diagn Progn Res 2017;1:20. Damen et al. BMJ 2016;353:i2416. Wynants et al. BMJ 2020;369:m1328. Kleinrouweler et al. AJOG 2016;214:79-90. Kaijser et al. Hum Reprod Update 2014;20:229-62.
  • 9. Models in obstetrics Only 23 of 263 models (9%) have been externally validated! 9 Kleinrouweler et al. AJOG 2016;214:79-90.
  • 10. Knowledge is power (1) Avoid dichotomization of continuous predictor variables • Biologically implausible • Deletes information, worse predictions (AUC ) (Collins 2016; Steyerberg 2018) • Only clinical decisions should be binary 10 Collins et al. Stat Med 2016;35:4124-35. Steyerberg et al. J Clin Epidemiol 2018;98:133-43. Butts & Ng. Statistical and methodological myths and urban legends, p361-86. Routledge/Taylor & Francis, 2009.
  • 11. Knowledge is power (2) Use available knowledge, do not always ask the data! 11 Good & Hardin. Common errors in statistics (and how to avoid them). Wiley, 2006. “Bypassing the brain to compute by reflex is a sure recipe for disaster”
  • 12. Knowledge is power (3) Explain how and when predictors are measured, standardize where reasonably possible - Units; e.g. progesterone in ng/ml or nmol/L - How tumor volume or diameter is calculated - What is meant by ‘hormonal therapy use’ (Which? When?) - Smoking - BMI: measured vs self-reported If measurement varies across studies, model performance deteriorates (Luijken, 2019; Luijken, 2020) 12 Luijken et al. Stat Med 2019;38:3444-59. Luijken et al. J Clin Epidemiol 2020;119:7-18.
  • 13. Knowledge is power (4): sample size You think of buying a Porsche. But if you do not want to pay for it, you may get this. The same applies for developing risk models. 13
  • 14. The currency is sample size The more complicated (or ‘fancy’) the modeling strategy, the more you have to pay with sample size. (counterfeit money does not help: we need good quality data) In this respect, avoid train-test split, this reduces sample size for model development: you’re burning your money 14
  • 15. The currency is sample size Many have heard of the “10 events per variable” rule 1. Often incorrect use: This is not about 10 patients per variable in the final model! 2. This is outdated, 10 EPV is often not enough. See new procedure (BMJ 2020). 3. Flexible algorithms are data hungry, EPV>>10 may be needed (van der Ploeg 2014). 15 Van der Ploeg et al. BMC Med Res Methodol 2014;14:137 Riley et al. BMJ 2020;368:m441.
  • 16. Knowledge is power (5): missing data Usually, “empty cells” are “full of information”! Using only complete cases - decreases sample size (less money) - typically leaves a non-representative sample (biased risk estimates) Presence of a test can be more predictive than the test result! See EHR data. 16 Agniel et al. BMJ 2018;360:k1479.
  • 17. Model validation: assess calibration! Key elements of model performance: discrimination between patients with and without the event calibration (correctness) of risk estimates 17 DISCRIMINATION When it rained, was the estimated chance of rain higher (on average)? CALIBRATION For days with 80% estimated chance of rain, did it rain on 8 out of 10 days?
  • 18. Calibration: the Achilles heel 18 Van Calster & Vickers. Med Decis Making 2015;35:162-9. Van Calster et al. BMC Med 2019;17:230. Miscalibration: estimated risk is inaccurate  Patient and clinician are misinformed, may lead to inappropriate decisions (Van Calster & Vickers, 2015)
  • 19. Performance depends on place and time One external validation in one hospital does not tell much about a model! “There is no such thing as a validated model”  Study heterogeneity 19 Van Calster et al. BMJ 2020;370:m2614.
  • 20. P-values and significance testing Very small role in prediction modeling - Focus is on robust predictions - Focus is on precision of the performance estimates (e.g. AUC, calibration) - Focus is on quantifying heterogeneity - Focus is on qualitative difference between populations - Focus is on a priori selection of predictors - further data-driven selection can be based on p-values; high alpha recommended (Steyerberg & Van Calster, 2020) 20 Steyerberg & Van Calster. Eur J Clin Invest 2020;50:e13229.
  • 21. Machine learning popularity 21 “Typical machine learning algorithms are highly flexible So will uncover associations we could not find before Hence better predictions and management decisions” → One of the master keys, with guaranteed success!
  • 22. Machine Learning: success guaranteed? 22 Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
  • 23. Poor methodology and reporting is common 23 Christodoulou et al (2019) – 71 studies: - What was done about missing data? 100% poor or unclear - How was performance validated? 68% unclear or biased approach - Was calibration of risk estimates studied? 79% not at all - Prognostic models: time horizon often ignored completely Kleinrouweler et al (2016) – 263 models: - Was calibration studied? 82% not at all - Was the model fully presented so people can use it? Not for 38% of models - Was the clinical use discussed? Not for 89% of models FOLLOW TRIPOD GUIDELINES FOR REPORTING! www.tripod-statement.org Christodoulou et al. J Clin Epidemiol 2019;110:12-22. Kleinrouweler et al. AJOG 2016;214:79-90. Moons et al. Ann Intern Med 2015;162:w1-73.
  • 24. The harm of poor methodology 24 Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
  • 25. Resources on prediction modeling 25 Involve a statistician with knowledge of prediction modeling! Steyerberg EW. Clinical prediction models (2nd ed). Springer, 2019. Riley RD et al. Prognosis research in healthcare. OUP, 2019. Moons KGM et al. Transparent reporting of a multivariable prediction model for individual prognosis and diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1-73. Wynants L et al. Key steps and common pitfalls in developing and validating risk models. BJOG 2017; 2017;124:423-432. Prognosisresearch.com (newly launched website)