Radiomics Analysis of Pulmonary Nodules in Low Dose CT for Early Detection of Lung Cancer

Radiomics analysis of pulmonary
nodules in low-dose CT for early
detection of lung cancer
Wookjin Choi, PhD, Jung Hun Oh, PhD, Sadegh Riyahi, PhD, Feng Jiang, MD, PhD, Wengen Chen, MD, PhD,
Joseph O. Deasy, PhD, and Wei Lu PhD
Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, NY 10065
Department of Pathology, University of Maryland School of Medicine, Baltimore, MD 21201
Department of Diagnostic Radiology and Nuclear Medicine, University of Maryland School of Medicine, Baltimore, MD 21201
Radiomics and Quantitative Imaging
TH-AB-201-10

Lung Cancer Screening
• Early detection of lung cancer by LDCT can reduce
mortality
– LDCT dramatically increases the number of
indeterminate pulmonary nodules (PNs)
• Known features correlated with PN malignancy
– Size, growth rate
– Calcification, enhancement, solidity → texture features
– Boundary margins (spiculation, lobulation) → shape and
appearance features
2
Benign pattern of calcification
Malignant nodules
Benign nodules
Images from radiologyassistant.nl, AJR Am J Roentgenol. 2003 May;180(5):1255-63, and AJR Am J Roentgenol. 2002 May;178(5):1053-7.

Data set
A subset of LIDC-IDRI fromTCIA
• Multi-institution data
• Four radiologists detected and contoured
PNs
• Consensus contour: generated by STAPLE
using 2 or more contours of PN
• Biopsy-proven ground-truth or 2 years of
stable PN
• 36 benign and 43 malignant cases, 7 missing
contours (5 benign and 2 malignant)
• 72 cases evaluated (31 benign and 41
malignant cases)
3
LIDC-IDRI: Lung Image Database Consortium image collection, TCIA: The Cancer Imaging Archive,
STAPLE: the simultaneous truth and performance level estimation
Data From LIDC-IDRI. The Cancer Imaging Archive. http://doi.org/10.7937/K9/TCIA.2015.LO9QL9SX
# Pts
Total 1,010
Having diagnosis data 157
Primary cancer
biopsy-proven
progression
43
42
1
Benign
biopsy-proven
2yrs of stable PN
progression
36
7
26
3
Metastatic cancer
or unknown
78

ACR Lung-RADS
Category Baseline Screening Malignancy
1 No PNs; PNs with calcification
Negative
<1% chance of malignancy
2
Solid/part-solid: <6 mm
GGN: <20 mm
Benign appearance
<1% chance of malignancy
3
Solid: ≥6 to <8 mm
Part-solid: ≥6 mm with solid component <6 mm
GGN: ≥20 mm
Probably benign
1-2% chance of malignancy
4A
Solid: ≥8 to <15 mm
Part-solid: ≥8 mm with solid component ≥6 and <8 mm
Suspicious
5-15% chance of malignancy
4B
Solid: ≥15 mm
Part-solid: Solid component ≥8 mm
>15% chance of malignancy
4X
Category 3 or 4 PNs with suspicious features (e.g. enlarged lymph nodes)
or suspicious imaging findings (e.g. spiculation)
>15% chance of malignancy
4
Summary of Lung-RADS categorization for baseline screening
ACR: American College of Radiology
Lung-RADS: Lung CT Screening Reporting and Data System

Radiomics for Lung Cancer Screening
• Radiomic features from 3D volume and 2D axial slice with largest area
(n=103)
– Shape: 40 features (3D: 26 and 2D: 14)
– Texture: 36 features (GLCM: 16 and GLCM: 20)
– Intensity: 18 features (3D: 9 and 2D: 9)
– Shape+Intensity: 9 features, shape features weighted by intensity using image
moment (3D: 5 and 2D: 4)
5
GLCM: gray level co-occurrence matrix, GLRM: gray level run-length matrix
GLCM GLRM
Texture features Intensity features
3D 2D
Shape features

Prediction model
• Distinctive features (n=50)
– Hierarchical clustering using Pearson
correlation
– 9 shape, 26 texture, 8 intensity, and 7
shape+intensity features
– 15 significant features after Bonferroni
correction
• SVM classification coupled with LASSO
feature selection
– Selected 10 most important features by 10-
fold CV of the LASSO
– Radial basis function kernel
(γ = 0.001 and C = 64)
– 10 times 10-fold CV
6
SVM: Support vector machine, LASSO: Least absolute shrinkage and selection operator,
CV: Cross validation

Performance of the SVM-LASSO model
7
CV: Cross validation, SVM: Support Vector Machine
with increasing number of features in the 10x10-fold CV

using the two important features and compared with Lung-RADS
Performance of the SVM-LASSO models
Prediction Model Sensitivity Specificity Accuracy AUC # of Features
Lung-RADS 73.3% 70.4% 72.2% 0.74 4
SVM-LASSO 10×10-fold 87.9±2.5% 78.2±1.6% 83.7±1.7% 0.86±0.01 2
20×5-fold 86.0±3.3% 75.9±3.9% 81.6±2.6% 0.85±0.02 2
50×2-fold 83.4±4.9% 71.9±8.8% 78.5±5.1% 0.84±0.03 2
8BB: Bounding Box, AP: Anterior-Posterior, SD: Standard Deviation, IDM: Inverse Difference Moment
• BB_AP
– Highly correlated with the axial longest diameter and its
perpendicular diameter (r = 0.96, larger – more malignant)
• SD_IDM
– Directional variation of local homogeneity (smaller – more
malignant)

Scatter plot of the two features
9
and the classification curve by the SVM-LASSO model

Cases misclassified by Lung-RADS
10
BB: Bounding Box, SD: Standard Deviation, AP: Anterior-Posterior, SI: Superior-Inferior, IDM: Inverse Difference Moment
Scale bar is 10 mm, Spiculation: 1(no)-5(marked) scale
but correctly classified by the SVM-LASSO model

Comparison with recent models
Dataset Model description
Hawkins et al.
(2016)
 Baseline CT scans of 261pts in
NLST
 Biopsy-proven ground-truth or 2
years of stable PN
 23 RIDER stable radiomic features
 Random forest classifier
 10×10-fold CV
Ma et al.
(2016)
 LIDC 72pts
years of stable PN
 583 radiomic features
 10-fold CV
Buty et al.
(2016)
 LIDC 2054 PNs
 Ground-truth by radiologist’s
assessment
 Spherical Harmonics (100, 150, and 400 shape features)
and AlexNet33 (4096 appearance features)
 10-fold CV
Kumar et al.
(2015)
 LIDC 97pts,
including metastatic tumors
years of stable PN
 Deep convolutional neural network model (5000
features)
 10-fold CV
Proposed
 LIDC 72pts
 Biopsy-proven ground-truth or
2 years of stable PN
 2 important features
 LASSO features selection and SVM classification
 10×10-fold CV
11

Comparison with recent models
Sensitivity Specificity Accuracy AUC
Hawkins et al. (2016) 51.7% 92.9% 80.0% 0.83
Ma et al. (2016) 80.0% 85.5% 82.7%
Buty et al. (2016) 82.4%
Kumar et al. (2015) 79.1% 76.1% 77.5%
Proposed 87.9% 78.2% 83.7% 0.86
12
• A large number of features applied comparing to number of patients
– May cause model overfitting problem
• No discussions on how the selected features might have contributed to
the prediction of malignancy
• Deep learning needs numerous training data to avoid model overfitting,
and transfer learning is questionable

Future Works
• Candidate feature approach
– Quantification of spiculated or lobulated margins
– Calcification, attachment, solidity and cavitation of PNs
• Integrate plasma biomakers in the SVM-LASSO model
– Difficult to diagnose small PNs, 50% accuracy when PN size
< 15mm
– Combining plasma biomarkers with clinical variables and image
features (AUC = 0.95)
• Deep learning - Data Science Bowl 2017, Predicting Lung Cancer
– 3D Fully Convolutional Neural Network model
– Ranked 99th out of 1972 teams (Top 6%, Bronze medal)
13
Jiang et al. Int J Cancer. 2017. [published online ahead of print 2017/06/06].

Conclusion
• Developed an SVM-LASSO model to predict
malignancy of the indeterminate PNs
– Two important features: the bounding box anterior-
posterior dimension and the standard deviation of local
homogeneity
– The proposed model outperformed Lung-RADS
• A multicenter clinical trial in a large population is
required
– To prospectively and vigorously validate the radiomic
features
– Can be translated into clinical practice
14

Acknowledgements
• NIH Grant R01CA172638
15

Significant features
Rank Feature name Type P-value AUC Correlation
1 BB_AP Shape 0.00070 0.81 +
2 BB_SI Shape 0.0012 0.80 +
3 SD_IDM Texture 0.0018 0.79 -
4 Weighted Principal Moments2 Shape+Intensity 0.0022 0.78 +
5 Grey Level Nonuniformity Texture 0.0026 0.78 +
6 Oriented BB_SI Shape 0.0027 0.79 +
7 Weighted Principal Moments3 Shape+Intensity 0.0030 0.78 +
8 Low Grey Level Run Emphasis Texture 0.0031 0.78 -
9 SD Run Length Nonuniformity Texture 0.0033 0.78 +
10 SD Low Grey Level Run Emphasis Texture 0.017 0.75 -
11 Correlation Texture 0.018 0.75 +
12 IDM Texture 0.020 0.75 +
13 SD Long Run Emphasis Texture 0.024 0.75 -
14 Long Run Low Grey Level Emphasis Texture 0.028 0.75 -
15 Inertia Texture 0.035 0.74 -
17

Results
ROC curve analysis on the best model
of SVM-LASSO and Lung-RADS
The box plots show the difference between benign and malignant
PNs for the selected features (BB_AP and SD_IDM) and the
largest diameter. P-values were obtained by theWilcoxon rank
sum test and adjusted using Bonferroni correction
18

BB_AP 10mm
19
Benign
IDM_LR: 0.172
IDM_AP: 0.182
IDM_SI: 0.284
Mean_IDM: 0.174
SD_IDM: 0.033
Malignant
IDM_LR: 0.116
IDM_AP: 0.136
IDM_SI: 0.138
Mean_IDM: 0.111
SD_IDM: 0.014
Axial Sagittal Coronal
d e f
LR
AP
AP
SI
LR
SI
a b c
LR
AP
AP
SI
LR
SI

BB_AP 17mm
20
Benign
IDM_LR: 0.276
IDM_AP: 0.316
IDM_SI: 0.210
Mean_IDM: 0.220
SD_IDM: 0.030
Malignant
IDM_LR: 0.234
IDM_AP: 0.215
IDM_SI: 0.236
Mean_IDM: 0.203
SD_IDM: 0.020
Axial Sagittal Coronal
d e f
LR
AP
AP
SI
LR
SI
a b c
LR
AP
AP
SI
LR
SI

Radiomics Analysis of Pulmonary Nodules in Low Dose CT for Early Detection of Lung Cancer

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Radiomics Analysis of Pulmonary Nodules in Low Dose CT for Early Detection of Lung Cancer

Similaire à Radiomics Analysis of Pulmonary Nodules in Low Dose CT for Early Detection of Lung Cancer (20)

Plus de Wookjin Choi

Plus de Wookjin Choi (8)

Dernier

Dernier (20)

Radiomics Analysis of Pulmonary Nodules in Low Dose CT for Early Detection of Lung Cancer

Notes de l'éditeur