SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
Thoughts on
Machine Learning and Artificial Intelligence
Maarten van Smeden, PhD

Leiden University Medical Center, Netherlands

STRATOS Lorenz Meeting

21/09/2018
Interested reader perspective
• Statistician by training

• Limited experience applying machine learning techniques

• Three examples that I think are illustrative for ML/AI in medicine
as it is applied nowadays

• Focus: prediction
Tech company business model
Apple Watch 4
FDA Approval
https://www.statnews.com/2018/09/13/heres-the-data-behind-the-new-apple-watch-ekg-app/?mc_cid=0fbfd65c13&mc_eid=75f1d5aea2
Impressive artificial intelligence
IBM Watson win against 2 Jeopardy’s champions in 2011
Reviewer #2
Less impressive artificial intelligence
Warning!
Statistical policing going on
Yesterday’s news
http://www.timvanderzee.com/the-wansink-dossier-an-overview/

Example 1: ML predicting mortality
• Caliber dataset (UK, EHR)

• N = 80,000 pre-existing coronary artery disease

• Predict all cause mortality (18,000 events, time horizon unclear)

• “used Cox models, random forests and elastic net regression”

• 586 candidate predictors vs 27 pre-selected variables

• Complete case / multiple imputation / missing indicator method

• Cox models: linear main effects only

• Split sample (1/3 test, 2/3 training)
Example 1: ML predicting mortality
Example 1: ML predicting mortality
Example 1: ML predicting mortality
One take
Linear regression is an example of
Machine Learning?
If so, what isn’t Machine Learning?
Perhaps more reasonable?
Beam & Kohane, JAMA, 2018
Example 2: lymph node metastases
Example 2: lymph node metastases
Example 2: lymph node metastases
• Researcher challenge competition

• Whole slide images of women diagnosed with breast cancer

• Training data: N = 270 (110 events); test data: N = 129 (49 events)

• 11 pathologists evaluating the test data

• 390 teams signed up for the competition

• 23 teams submitted 32 algorithms for evaluation
Example 2: lymph node metastases
Example 2: lymph node metastases
• Unfair comparison between pathologists and DL

• Pathologists no access to regularly available diagnostics

• AUC comparison DL (continuous) vs pathologists (5-item
scale) 

• Promising algorithms overrepresented (390 teams -> 32
algorithms submitted)
Example 2: lymph node metastases
• No attention to risk prediction / calibration

• ML: attention classification only without probability

• Hugh (often implicit) difference between the traditional (risk)
prediction modeling in medicine and (traditional ML)

• Probably fine for Netflix recommendations; not so much for
real life medical decision making
Misuse of “risk"
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
• Patients with newly diagnosed diabetes (N = 8980) 

• 6 continuous variables 

• K-means clustering (‘unsupervised learning’)
Example 3: 5 types of diabetes
Example 3: 5 types of diabetes
BS detection simulation
• Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations 

• “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
K-means clustering
“K-means finds a Voronoi partition, only if that partition coincides with a
"clustering" does it have a hope of actually doing clustering”

Max Little: https://twitter.com/MaxALittle/status/970277900871262213
Freak examples?
Probably?
Maybe?
What I observe is:
• Confusion and disagreement about what is and isn’t ML/AI 

• Analyses labeled “ML/AI” have a tendency to concentrate on
classification (exceptions exist, e.g. high dimensional PS
approaches suggested that are called “ML”) 

• Analyses labeled “ML/AI” in medicine are surprisingly often
done by people not thoroughly trained in statistics

• Basic statistical principles are often forgotten or ignored (e.g.
improper scoring rules)
Concluding remarks (1)
• Just because an algorithm is novel or flexible doesn’t mean it is
any good, obviously

• Dismissing the potential value of novel “ML/AI” algorithms out-
of-hand doesn’t make sense

• We need more realistic simulations and many applications to
compare the traditional vs more novel / flexible algorithms

• The primary issue in medical applications seems to be with the
modelers not so much with the models
Concluding remarks (2)
• Statisticians should be more involved in the application and
evaluation of novel / flexible algorithms, especially for risk
prediction

• Statisticians should be involved in studying performance of
novel / flexible algorithms (e.g. data hungriness) -> realistic
simulation studies

• Collaboration with computer scientists

• Computationally intensive -> may not be cheap

• Serious experimental design and reporting
Simulation is…
“…it is using simulation for multiplication that I find objectionable. Eight patients are
eight patients and so should remain.”
“All the impressive achievements of
deep learning amount to just curve
fitting”
Judea Pearl
Thoughts on Machine Learning and Artificial Intelligence
Thoughts on Machine Learning and Artificial Intelligence

Contenu connexe

Tendances

Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyondMaarten van Smeden
 
The basics of prediction modeling
The basics of prediction modeling The basics of prediction modeling
The basics of prediction modeling Maarten van Smeden
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Ewout Steyerberg
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkStats Statswork
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IMaarten van Smeden
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Maarten van Smeden
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEwout Steyerberg
 
Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...BenVanCalster
 
ML and AI: a blessing and curse for statisticians and medical doctors
ML and AI: a blessing and curse forstatisticians and medical doctorsML and AI: a blessing and curse forstatisticians and medical doctors
ML and AI: a blessing and curse for statisticians and medical doctorsMaarten van Smeden
 
Machine learning in medicine: calm down
Machine learning in medicine: calm downMachine learning in medicine: calm down
Machine learning in medicine: calm downBenVanCalster
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 modelsLaure Wynants
 
Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...BenVanCalster
 
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AIStr-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AIBenVanCalster
 
Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Ewout Steyerberg
 
Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...Laure Wynants
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical researchMaarten van Smeden
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Maarten van Smeden
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingMaarten van Smeden
 
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...Maarten van Smeden
 

Tendances (20)

Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyond
 
The basics of prediction modeling
The basics of prediction modeling The basics of prediction modeling
The basics of prediction modeling
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead Why the EPV≥10 sample size rule is rubbish and what to use instead
Why the EPV≥10 sample size rule is rubbish and what to use instead
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...Calibration of risk prediction models: decision making with the lights on or ...
Calibration of risk prediction models: decision making with the lights on or ...
 
ML and AI: a blessing and curse for statisticians and medical doctors
ML and AI: a blessing and curse forstatisticians and medical doctorsML and AI: a blessing and curse forstatisticians and medical doctors
ML and AI: a blessing and curse for statisticians and medical doctors
 
Machine learning in medicine: calm down
Machine learning in medicine: calm downMachine learning in medicine: calm down
Machine learning in medicine: calm down
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 models
 
Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...Development and evaluation of prediction models: pitfalls and solutions (Part...
Development and evaluation of prediction models: pitfalls and solutions (Part...
 
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AIStr-AI-ght to heaven? Pitfalls for clinical decision support based on AI
Str-AI-ght to heaven? Pitfalls for clinical decision support based on AI
 
Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20
 
Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...Prediction research in a pandemic: 3 lessons from a living systematic review ...
Prediction research in a pandemic: 3 lessons from a living systematic review ...
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical research
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confounding
 
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...COVID-19 related prediction models for diagnosis and prognosis - a living sys...
COVID-19 related prediction models for diagnosis and prognosis - a living sys...
 

Similaire à Thoughts on Machine Learning and Artificial Intelligence

Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...University of Malaya
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Evangelos Kritsotakis
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareNUS-ISS
 
staistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptxstaistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptxSailajaReddyGunnam
 
Presentation (9).pptx
Presentation (9).pptxPresentation (9).pptx
Presentation (9).pptxAmitMasand5
 
Informatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside careInformatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside careMike Hogarth, MD, FACMI, FACP
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Joe Gricar, MS
 
Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015MedicReS
 
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENTCOMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENTArunpandiyan59
 
Interpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical ResearchInterpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical ResearchPaul Agapow
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelpingxiaoou
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelpingxiaoou
 
A review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signalsA review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signalsReza Sadeghi
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaReetabrata Bhattacharyya
 
Chase presentation
Chase presentationChase presentation
Chase presentationReza Sadeghi
 
Health advances ai in diagnostic development
Health advances ai in diagnostic developmentHealth advances ai in diagnostic development
Health advances ai in diagnostic developmentHealth Advances
 

Similaire à Thoughts on Machine Learning and Artificial Intelligence (20)

Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...Theory and Practice of Integrating Machine Learning and Conventional Statisti...
Theory and Practice of Integrating Machine Learning and Conventional Statisti...
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
 
staistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptxstaistical analysis ppt of CADD.pptx
staistical analysis ppt of CADD.pptx
 
Presentation (9).pptx
Presentation (9).pptxPresentation (9).pptx
Presentation (9).pptx
 
Informatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside careInformatics and the merging of research and quality measures with bedside care
Informatics and the merging of research and quality measures with bedside care
 
Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06Draft AMCP 2006 Model Quality 4-4-06
Draft AMCP 2006 Model Quality 4-4-06
 
Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015Shing Lee MedicReS World Congress 2015
Shing Lee MedicReS World Congress 2015
 
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENTCOMPUTERS IN PHARMACEUTICAL DEVELOPMENT
COMPUTERS IN PHARMACEUTICAL DEVELOPMENT
 
man0 ppt.pptx
man0 ppt.pptxman0 ppt.pptx
man0 ppt.pptx
 
Interpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical ResearchInterpreting Complex Real World Data for Pharmaceutical Research
Interpreting Complex Real World Data for Pharmaceutical Research
 
poster_Reza
poster_Rezaposter_Reza
poster_Reza
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
 
The methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive modelThe methodology for handling missing data during development of predictive model
The methodology for handling missing data during development of predictive model
 
A review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signalsA review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signals
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
 
Presentation CIOMS VIII
Presentation CIOMS VIIIPresentation CIOMS VIII
Presentation CIOMS VIII
 
Chase presentation
Chase presentationChase presentation
Chase presentation
 
Health advances ai in diagnostic development
Health advances ai in diagnostic developmentHealth advances ai in diagnostic development
Health advances ai in diagnostic development
 

Plus de Maarten van Smeden

Rage against the machine learning 2023
Rage against the machine learning 2023Rage against the machine learning 2023
Rage against the machine learning 2023Maarten van Smeden
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicineMaarten van Smeden
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Maarten van Smeden
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Maarten van Smeden
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Maarten van Smeden
 
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthMaarten van Smeden
 
Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Maarten van Smeden
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Maarten van Smeden
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligenceMaarten van Smeden
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the futureMaarten van Smeden
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirusMaarten van Smeden
 
Anatomy of a successful science thread
Anatomy of a successful science threadAnatomy of a successful science thread
Anatomy of a successful science threadMaarten van Smeden
 

Plus de Maarten van Smeden (18)

Uncertainty in AI
Uncertainty in AIUncertainty in AI
Uncertainty in AI
 
UMC Utrecht AI Methods Lab
UMC Utrecht AI Methods LabUMC Utrecht AI Methods Lab
UMC Utrecht AI Methods Lab
 
Rage against the machine learning 2023
Rage against the machine learning 2023Rage against the machine learning 2023
Rage against the machine learning 2023
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicine
 
Associate professor lecture
Associate professor lectureAssociate professor lecture
Associate professor lecture
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
 
Predictimands
PredictimandsPredictimands
Predictimands
 
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
 
Algorithm based medicine
Algorithm based medicineAlgorithm based medicine
Algorithm based medicine
 
Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligence
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the future
 
Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirus
 
Anatomy of a successful science thread
Anatomy of a successful science threadAnatomy of a successful science thread
Anatomy of a successful science thread
 

Dernier

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingadibshanto115
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxrohankumarsinghrore1
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 

Dernier (20)

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 

Thoughts on Machine Learning and Artificial Intelligence

  • 1. Thoughts on Machine Learning and Artificial Intelligence Maarten van Smeden, PhD Leiden University Medical Center, Netherlands STRATOS Lorenz Meeting 21/09/2018
  • 2. Interested reader perspective • Statistician by training • Limited experience applying machine learning techniques • Three examples that I think are illustrative for ML/AI in medicine as it is applied nowadays • Focus: prediction
  • 4.
  • 7.
  • 8. Impressive artificial intelligence IBM Watson win against 2 Jeopardy’s champions in 2011
  • 14. Example 1: ML predicting mortality • Caliber dataset (UK, EHR) • N = 80,000 pre-existing coronary artery disease • Predict all cause mortality (18,000 events, time horizon unclear) • “used Cox models, random forests and elastic net regression” • 586 candidate predictors vs 27 pre-selected variables • Complete case / multiple imputation / missing indicator method • Cox models: linear main effects only • Split sample (1/3 test, 2/3 training)
  • 15. Example 1: ML predicting mortality
  • 16. Example 1: ML predicting mortality
  • 17. Example 1: ML predicting mortality
  • 18. One take Linear regression is an example of Machine Learning? If so, what isn’t Machine Learning?
  • 19. Perhaps more reasonable? Beam & Kohane, JAMA, 2018
  • 20. Example 2: lymph node metastases
  • 21. Example 2: lymph node metastases
  • 22. Example 2: lymph node metastases • Researcher challenge competition • Whole slide images of women diagnosed with breast cancer • Training data: N = 270 (110 events); test data: N = 129 (49 events) • 11 pathologists evaluating the test data • 390 teams signed up for the competition • 23 teams submitted 32 algorithms for evaluation
  • 23. Example 2: lymph node metastases
  • 24. Example 2: lymph node metastases • Unfair comparison between pathologists and DL • Pathologists no access to regularly available diagnostics • AUC comparison DL (continuous) vs pathologists (5-item scale) • Promising algorithms overrepresented (390 teams -> 32 algorithms submitted)
  • 25. Example 2: lymph node metastases • No attention to risk prediction / calibration • ML: attention classification only without probability • Hugh (often implicit) difference between the traditional (risk) prediction modeling in medicine and (traditional ML) • Probably fine for Netflix recommendations; not so much for real life medical decision making
  • 27. Example 3: 5 types of diabetes
  • 28. Example 3: 5 types of diabetes
  • 29. Example 3: 5 types of diabetes • Patients with newly diagnosed diabetes (N = 8980) • 6 continuous variables • K-means clustering (‘unsupervised learning’)
  • 30. Example 3: 5 types of diabetes
  • 31. Example 3: 5 types of diabetes
  • 32. BS detection simulation • Data generated from 2 independent MVN-distributions with .3 equal pairwise correlations • “Sunday morning simulations”, code: https://github.com/MvanSmeden/DiabetesClusters
  • 33. K-means clustering “K-means finds a Voronoi partition, only if that partition coincides with a "clustering" does it have a hope of actually doing clustering” Max Little: https://twitter.com/MaxALittle/status/970277900871262213
  • 35. What I observe is: • Confusion and disagreement about what is and isn’t ML/AI • Analyses labeled “ML/AI” have a tendency to concentrate on classification (exceptions exist, e.g. high dimensional PS approaches suggested that are called “ML”) • Analyses labeled “ML/AI” in medicine are surprisingly often done by people not thoroughly trained in statistics • Basic statistical principles are often forgotten or ignored (e.g. improper scoring rules)
  • 36. Concluding remarks (1) • Just because an algorithm is novel or flexible doesn’t mean it is any good, obviously • Dismissing the potential value of novel “ML/AI” algorithms out- of-hand doesn’t make sense • We need more realistic simulations and many applications to compare the traditional vs more novel / flexible algorithms • The primary issue in medical applications seems to be with the modelers not so much with the models
  • 37. Concluding remarks (2) • Statisticians should be more involved in the application and evaluation of novel / flexible algorithms, especially for risk prediction • Statisticians should be involved in studying performance of novel / flexible algorithms (e.g. data hungriness) -> realistic simulation studies • Collaboration with computer scientists • Computationally intensive -> may not be cheap • Serious experimental design and reporting
  • 38. Simulation is… “…it is using simulation for multiplication that I find objectionable. Eight patients are eight patients and so should remain.”
  • 39. “All the impressive achievements of deep learning amount to just curve fitting” Judea Pearl