SlideShare une entreprise Scribd logo
1  sur  11
Multivariate Algorithms and
Classifiers in Cancer
Micro-RNA profiles help predict distant diseasefree survival in breast cancer
Bits and pieces of bioinformatics workflow

Mehis Pold, MD
October 18, 2013
Feature Selection &
algorithm development

Training
samples

Iterative process

Internal Algorithm
Validation

Validation
samples

Clinical Validation
Training and validation datasets in each step don’t
overlap
Rule of thumb: validation always produces weaker
statistics than training
• Analysis of early primary breast cancer to identify prognostic
markers and associated pathways: mRNA and miRNA profiling
• GEO (Gene Expression Omnibus) accession ID: GSE22220
• Technology platform: ILLUMINA
• 733 micro-RNA
• 210 breast cancer samples
• 79 complete pathological response (pCR) to chemotherapy; 131
recurrent disease samples (RD)
• Data collected up to 10 years after start of chemotherapy
Buffa et al. microRNA-Associated Progression Pathways and
Potential Therapeutic Targets Identified by Integrated mRNA and
microRNA Expression Profiling in Breast
Cancer. Cancer Res. 2011, 71:5635
BIOINFORMATICS WORKFLOW
Multiple statistical
approaches to
maximize outcome

TRAINING SET:
36 RD
74 pCR

VALIDATION SET:
43 RD
57 pCR

Kaplan-Meier & ROC

Sensitivity (Se)
Specificity (Sp)
Positive Predictive Value (PPV)
Negative Predictive Value (NPV)

Comparison of two
algorithms and
classification by kNN
Custom-scripting (R, VBA)
Standard Software : MS Excel
Medical Statistics: MedCalc
FEATURE SELECTION
Reduction of dimensionality from n = 733 to n = 1
Approach 1: iterative clustering

Approach 2: T-test combined with enriching for weak
inter-profile correlation
Significance of feature selection evaluated by KaplanMeyer survival analysis and ROC (receiver-operator curve)

RD

Up
pCR

Down
KAPLAN-MEIER SURVIVAL CURVE
The Kaplan–Meier estimator, also known as the product limit estimator, is an
estimator for estimating the survival function from lifetime data. In medical
research, it is often used to measure the fraction of patients living for a certain
amount of time after treatment. In economics, it can be used to measure the
length of time people remain unemployed after a job loss. In engineering, it can
be used to measure the time until failure of machine parts. In ecology, it can be
used to estimate how long fleshy fruits remain on plants before they are removed
by frugivores. The estimator is named after Edward L. Kaplan and Paul Meier.

Receiver operating characteristic (ROC)
In signal detection theory, a receiver operating characteristic (ROC), or simply
ROC curve, is a graphical plot which illustrates the performance of a binary
classifier system as its discrimination threshold is varied. It is created by plotting
the fraction of true positives out of the total actual positives (TPR = true positive
rate) vs. the fraction of false positives out of the total actual negatives (FPR =
false positive rate), at various threshold settings. TPR is also known as sensitivity
(also called recall in some fields), and FPR is one minus the specificity or true
negative rate.
ITERATIVE CLUSTERING TO BINARY OUTCOME
T-TEST ENRICHED TOWARD WEAK CORRELATIONS
Nearest Neighbor Classification - kNN
• Based on a measure of distance between observations (e.g.
Euclidean distance or one minus correlation).
• k-nearest neighbor rule (Fix and Hodges (1951)) classifies an
observation X as follows:
– find the k closest observations in the training data,
– predict the class by majority vote, i.e. choose the class that is
most common among those k neighbors.
Classification of
data in 2D space
K=3

K=5
SUMMARY
ITERATIVE CLUSTERING TO BINARY OUTCOME
TRAINING p-value
Kaplan-Meier
ROC

AOC

Sensitivity Specificity

0.0001
<.0001

0.773

72

0.67

65

0.61

0.50

0.63

65

0.51

NPV

68

0.0002
0.0024

PPV

VALIDATION
Kaplan-Meier
ROC

CLASSIFICATION
kNN

T-TEST ENRICHED FOR WEAK CORRELATIONS
TRAINING p-value
Kaplan-Meier
ROC

AOC

Sensitivity Specificity

<.0001
<.0001

0.898

83

0.624

58

0.86

0.65

0.64

56

0.35

NPV

82

0.012
0.0334

PPV

VALIDATION
Kaplan-Meier
ROC

CLASSIFICATION
kNN
CONCLUDING REMARKS
• There is no single ‘right’ approach to algorithm development.
• Validation always produces weaker statistics than training.
• Significance of training statistics and validation statistics are
not very well correlating.
• Algorithms are only as stable and significant as upstream
R&D data. The better standardized and controlled the wetbench, the more stable and significant the algorithms and
eventual clinical validation.

Contenu connexe

Similaire à Development of multivariate classifiers in cancer

In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...Kamel Mansouri
 
Data mining in pharmacovigilance
Data mining in pharmacovigilanceData mining in pharmacovigilance
Data mining in pharmacovigilanceBhaswat Chakraborty
 
Summer 2015 Internship
Summer 2015 InternshipSummer 2015 Internship
Summer 2015 InternshipTaylor Martell
 
Classification of Mammogram Images for Detection of Breast Cancer
Classification of Mammogram Images for Detection of Breast CancerClassification of Mammogram Images for Detection of Breast Cancer
Classification of Mammogram Images for Detection of Breast Canceriosrjce
 
Shorter Multi-marker Signatures: a new tool to facilitate cancer diagnosis
Shorter Multi-marker Signatures:  a new tool to facilitate cancer diagnosisShorter Multi-marker Signatures:  a new tool to facilitate cancer diagnosis
Shorter Multi-marker Signatures: a new tool to facilitate cancer diagnosisdanieltm33
 
Shorter Multimarker signatures: a new tool to facilitate cancer diagnosis
Shorter Multimarker signatures:  a new tool to facilitate cancer diagnosisShorter Multimarker signatures:  a new tool to facilitate cancer diagnosis
Shorter Multimarker signatures: a new tool to facilitate cancer diagnosisdanieltm33
 
2013 machine learning_choih
2013 machine learning_choih2013 machine learning_choih
2013 machine learning_choihHongyoon Choi
 
Evolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerEvolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerBell Symposium &amp; MSP Seminar
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISFUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISIrene Pochinok
 
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...Md Rahman
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposterElsa Fecke
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...Servio Fernando Lima Reina
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaCytel
 
Eugm 2012 mehta - future plans for east - 2012 eugm
Eugm 2012   mehta - future plans for east - 2012 eugmEugm 2012   mehta - future plans for east - 2012 eugm
Eugm 2012 mehta - future plans for east - 2012 eugmCytel USA
 
STATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARSTATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARRaniBhagat1
 
ADMET-Predictor-Webinar_AO-AM-final.pdf
ADMET-Predictor-Webinar_AO-AM-final.pdfADMET-Predictor-Webinar_AO-AM-final.pdf
ADMET-Predictor-Webinar_AO-AM-final.pdfsweed5
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsIncedo
 

Similaire à Development of multivariate classifiers in cancer (20)

In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...
 
Data mining in pharmacovigilance
Data mining in pharmacovigilanceData mining in pharmacovigilance
Data mining in pharmacovigilance
 
Summer 2015 Internship
Summer 2015 InternshipSummer 2015 Internship
Summer 2015 Internship
 
B017261117
B017261117B017261117
B017261117
 
Classification of Mammogram Images for Detection of Breast Cancer
Classification of Mammogram Images for Detection of Breast CancerClassification of Mammogram Images for Detection of Breast Cancer
Classification of Mammogram Images for Detection of Breast Cancer
 
Shorter Multi-marker Signatures: a new tool to facilitate cancer diagnosis
Shorter Multi-marker Signatures:  a new tool to facilitate cancer diagnosisShorter Multi-marker Signatures:  a new tool to facilitate cancer diagnosis
Shorter Multi-marker Signatures: a new tool to facilitate cancer diagnosis
 
Shorter Multimarker signatures: a new tool to facilitate cancer diagnosis
Shorter Multimarker signatures:  a new tool to facilitate cancer diagnosisShorter Multimarker signatures:  a new tool to facilitate cancer diagnosis
Shorter Multimarker signatures: a new tool to facilitate cancer diagnosis
 
2013 machine learning_choih
2013 machine learning_choih2013 machine learning_choih
2013 machine learning_choih
 
Evolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancerEvolution of molecular prognostic testing in ER positive breast cancer
Evolution of molecular prognostic testing in ER positive breast cancer
 
Healthcare
HealthcareHealthcare
Healthcare
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISFUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
 
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehta
 
Eugm 2012 mehta - future plans for east - 2012 eugm
Eugm 2012   mehta - future plans for east - 2012 eugmEugm 2012   mehta - future plans for east - 2012 eugm
Eugm 2012 mehta - future plans for east - 2012 eugm
 
STATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSARSTATISTICAL METHOD OF QSAR
STATISTICAL METHOD OF QSAR
 
ADMET-Predictor-Webinar_AO-AM-final.pdf
ADMET-Predictor-Webinar_AO-AM-final.pdfADMET-Predictor-Webinar_AO-AM-final.pdf
ADMET-Predictor-Webinar_AO-AM-final.pdf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Plus de Mehis Pold

Rare events or not i want to know about them
Rare events or not i want to know about themRare events or not i want to know about them
Rare events or not i want to know about themMehis Pold
 
COSMIC ALK-FUSIONS
COSMIC ALK-FUSIONSCOSMIC ALK-FUSIONS
COSMIC ALK-FUSIONSMehis Pold
 
Hidden value in medical genetics databases. Splice the silence!
Hidden value in medical genetics databases. Splice the silence!Hidden value in medical genetics databases. Splice the silence!
Hidden value in medical genetics databases. Splice the silence!Mehis Pold
 
Why do the silent mutations matter?
Why do the silent mutations matter?Why do the silent mutations matter?
Why do the silent mutations matter?Mehis Pold
 
Art Of Breast Cancer
Art Of Breast CancerArt Of Breast Cancer
Art Of Breast CancerMehis Pold
 
Endometriosis gene-expression, meta-analysis
Endometriosis gene-expression, meta-analysisEndometriosis gene-expression, meta-analysis
Endometriosis gene-expression, meta-analysisMehis Pold
 
Why Does FDA Need Standards For In Vitro Diagnostic Devices
Why Does FDA Need Standards For In Vitro Diagnostic DevicesWhy Does FDA Need Standards For In Vitro Diagnostic Devices
Why Does FDA Need Standards For In Vitro Diagnostic DevicesMehis Pold
 

Plus de Mehis Pold (7)

Rare events or not i want to know about them
Rare events or not i want to know about themRare events or not i want to know about them
Rare events or not i want to know about them
 
COSMIC ALK-FUSIONS
COSMIC ALK-FUSIONSCOSMIC ALK-FUSIONS
COSMIC ALK-FUSIONS
 
Hidden value in medical genetics databases. Splice the silence!
Hidden value in medical genetics databases. Splice the silence!Hidden value in medical genetics databases. Splice the silence!
Hidden value in medical genetics databases. Splice the silence!
 
Why do the silent mutations matter?
Why do the silent mutations matter?Why do the silent mutations matter?
Why do the silent mutations matter?
 
Art Of Breast Cancer
Art Of Breast CancerArt Of Breast Cancer
Art Of Breast Cancer
 
Endometriosis gene-expression, meta-analysis
Endometriosis gene-expression, meta-analysisEndometriosis gene-expression, meta-analysis
Endometriosis gene-expression, meta-analysis
 
Why Does FDA Need Standards For In Vitro Diagnostic Devices
Why Does FDA Need Standards For In Vitro Diagnostic DevicesWhy Does FDA Need Standards For In Vitro Diagnostic Devices
Why Does FDA Need Standards For In Vitro Diagnostic Devices
 

Dernier

Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...narwatsonia7
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...indiancallgirl4rent
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...Neha Kaur
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...Taniya Sharma
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 

Dernier (20)

Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 

Development of multivariate classifiers in cancer

  • 1. Multivariate Algorithms and Classifiers in Cancer Micro-RNA profiles help predict distant diseasefree survival in breast cancer Bits and pieces of bioinformatics workflow Mehis Pold, MD October 18, 2013
  • 2. Feature Selection & algorithm development Training samples Iterative process Internal Algorithm Validation Validation samples Clinical Validation Training and validation datasets in each step don’t overlap Rule of thumb: validation always produces weaker statistics than training
  • 3. • Analysis of early primary breast cancer to identify prognostic markers and associated pathways: mRNA and miRNA profiling • GEO (Gene Expression Omnibus) accession ID: GSE22220 • Technology platform: ILLUMINA • 733 micro-RNA • 210 breast cancer samples • 79 complete pathological response (pCR) to chemotherapy; 131 recurrent disease samples (RD) • Data collected up to 10 years after start of chemotherapy Buffa et al. microRNA-Associated Progression Pathways and Potential Therapeutic Targets Identified by Integrated mRNA and microRNA Expression Profiling in Breast Cancer. Cancer Res. 2011, 71:5635
  • 4. BIOINFORMATICS WORKFLOW Multiple statistical approaches to maximize outcome TRAINING SET: 36 RD 74 pCR VALIDATION SET: 43 RD 57 pCR Kaplan-Meier & ROC Sensitivity (Se) Specificity (Sp) Positive Predictive Value (PPV) Negative Predictive Value (NPV) Comparison of two algorithms and classification by kNN Custom-scripting (R, VBA) Standard Software : MS Excel Medical Statistics: MedCalc
  • 5. FEATURE SELECTION Reduction of dimensionality from n = 733 to n = 1 Approach 1: iterative clustering Approach 2: T-test combined with enriching for weak inter-profile correlation Significance of feature selection evaluated by KaplanMeyer survival analysis and ROC (receiver-operator curve) RD Up pCR Down
  • 6. KAPLAN-MEIER SURVIVAL CURVE The Kaplan–Meier estimator, also known as the product limit estimator, is an estimator for estimating the survival function from lifetime data. In medical research, it is often used to measure the fraction of patients living for a certain amount of time after treatment. In economics, it can be used to measure the length of time people remain unemployed after a job loss. In engineering, it can be used to measure the time until failure of machine parts. In ecology, it can be used to estimate how long fleshy fruits remain on plants before they are removed by frugivores. The estimator is named after Edward L. Kaplan and Paul Meier. Receiver operating characteristic (ROC) In signal detection theory, a receiver operating characteristic (ROC), or simply ROC curve, is a graphical plot which illustrates the performance of a binary classifier system as its discrimination threshold is varied. It is created by plotting the fraction of true positives out of the total actual positives (TPR = true positive rate) vs. the fraction of false positives out of the total actual negatives (FPR = false positive rate), at various threshold settings. TPR is also known as sensitivity (also called recall in some fields), and FPR is one minus the specificity or true negative rate.
  • 7. ITERATIVE CLUSTERING TO BINARY OUTCOME
  • 8. T-TEST ENRICHED TOWARD WEAK CORRELATIONS
  • 9. Nearest Neighbor Classification - kNN • Based on a measure of distance between observations (e.g. Euclidean distance or one minus correlation). • k-nearest neighbor rule (Fix and Hodges (1951)) classifies an observation X as follows: – find the k closest observations in the training data, – predict the class by majority vote, i.e. choose the class that is most common among those k neighbors. Classification of data in 2D space K=3 K=5
  • 10. SUMMARY ITERATIVE CLUSTERING TO BINARY OUTCOME TRAINING p-value Kaplan-Meier ROC AOC Sensitivity Specificity 0.0001 <.0001 0.773 72 0.67 65 0.61 0.50 0.63 65 0.51 NPV 68 0.0002 0.0024 PPV VALIDATION Kaplan-Meier ROC CLASSIFICATION kNN T-TEST ENRICHED FOR WEAK CORRELATIONS TRAINING p-value Kaplan-Meier ROC AOC Sensitivity Specificity <.0001 <.0001 0.898 83 0.624 58 0.86 0.65 0.64 56 0.35 NPV 82 0.012 0.0334 PPV VALIDATION Kaplan-Meier ROC CLASSIFICATION kNN
  • 11. CONCLUDING REMARKS • There is no single ‘right’ approach to algorithm development. • Validation always produces weaker statistics than training. • Significance of training statistics and validation statistics are not very well correlating. • Algorithms are only as stable and significant as upstream R&D data. The better standardized and controlled the wetbench, the more stable and significant the algorithms and eventual clinical validation.