2. Outline
● Background on Deep learning
○ Definition
○ Common Models (AE, DBN, CNN and RNN)
○ Transfer Learning
● Paper 1: Dermatologist-level classification of skin
cancer with deep neural networks
● Paper 2: Deep learning for healthcare: review,
opportunities and challenges
2
3. Deep learning Definition
Deep learning is a class of machine learning algorithms that:
● use a cascade of many layers of nonlinear processing units for feature
extraction and transformation.
● Each successive layer uses the output from the previous layer as input.
Higher level features are derived from lower level features to form a
hierarchical representation.
3
4. H = WT
. X + b -- Logistic regression --- Multiple perceptron
Output nodes: label; input; ….
Can be applied on supervised / unsupervised / semi-supervised problems
samples
Features
4
5. Becoming popular (2006 -- )
● Big Data
● Strong
computing
capability
● Bottleneck
solution in DL
https://beamandrew.github.io/deeplearning/2017/02/23/deep_learning_101_part1.html 5
6. Breakthrough in areas
● Computer vision /image recognition
● Speech recognition
● Text analysis
● Natural Language Processing
● Self-driving cars
● Drug development and genomics
● …..
6
23. Skin cancer
● 5.4 m new cases in US every year
● Melanomas(< 5%) causing 75% skin-cancer-related deaths
● 5-year survival 99% for early stage diagnosis and 14% for late
stage diagnosis
● Classification from dermoscopy and histological image lacking
generalization capability
23
24. Analysis flow
Image data: 129,450 with 2032 labels
(dermoscopy and histological)
CNN
Google inception v3
Performance evaluation
ACC; AUC;
specificity and sensitivity
*Pre-trained DL
architecture,
including 1.28 m
images with 1000
labels
24
27. t-SNE visualization of the last hidden layer representations in the CNN for
four disease classes.
27
28. Conclusions
● Similar performance of deep learning model with 21
dermatologists
● The method can be deployable on mobile device to track skin
cancer
28
33. Take-home messages
● What can we conclude from several successful cases?
● Is DL appropriate in molecule-level biological data?
● What is the potential hierarchical architecture in gene
representation?
● How much data is enough for gene expression-based DL training?
● How to consider the one-shot information from transcriptome?
● How to combine omic data from different resources to DL models?
33