SlideShare une entreprise Scribd logo
1  sur  34
Télécharger pour lire hors ligne
Multi-Domain Image Completion
for Random Missing Input Data
Stanford University, NVIDIA, and National Institutes of Health
Yonsei University Severance Hospital CCIDS
Choi Dongmin
Introduction
• Multi-domain images could provide complementary knowledge

- ex. Four MRI modalities (T1, T1CE, T2, FLAIR) provide distinct features to locate tumor
boundaries from different diagnosis perspective

- ex. Person re-identification across different cameras or times

• However, some image domains might be missing in practice

- Solution 1. Nearest neighbor approach : lack of semantic consistency

- Solution 2. Generative models

• ReMIC (Representational disentanglement schemes for Multi-domain
Image Completion)

- -to- image completion framework 

- utilized for the high-level task by joint training (ex. segmentation)

- completes the missing domains given random distributed numbers of visible domains

- consistent performance improvement on three datasets
n n
Related Works
Image-to-Image

Translation
J.Y Zhu et al. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. ICCV 2017
- Impressive performance via cycle-consistency loss

- only 1-to-1 mapping
• CycleGAN
Related Works
Image-to-Image

Translation
Y Choi et al. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. CVPR 2018

J Yoon et al. RadialGAN: Leveraging multiple datasets to improve target-specific predictive models using GANs. ICML 2018
- Multi-domain image generation

- only 1-to- mapping (generation is always conditioned on the single input
image as the only source domain)
n
• StarGAN & RadialGAN
StarGAN RadialGAN
Related Works
Image-to-Image

Translation
D Lee et al. CollaGAN: Collaborative GAN for Missing Image Data Imputation. CVPR 2019
- Collaborative model to incorporate multiple domains for generating one
missing domain

- only -to-1 mappingn
• CollaGAN
Related Works
Image-to-Image

Translation
Red boxes are missing-domain images
Only ReMIC is -to- mappingn n
Related Works
Learning

Disentangled

Representations
• Learning Disentangled Representations

- to capture the full distribution of possible outputs by introducing a random style code

- to transfer information across domains for adaptation

- InfoGAN and -VAE learn the disentangled representation in unsupervised mannerβ
Xi Chen et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets. NIPS 2016

I Higgins et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework. ICLR 2017
https://www.slideshare.net/NaverEngineering/ss-96581209
Related Works
Learning

Disentangled

Representations
• DRIT and MUNIT

- disentangles content and attribute features in image translation

- However, only 1-to-1 translation
H.Y Lee et al. Diverse image-to- image translation via disentangled representations. ECCV 2018

X Huang et al. Multimodal Unsupervised Image-to-Image Translation. ECCV 2018
Related Works
Learning

Disentangled

Representations
• Liu et al

- tackles multi-domain learning cross-domain latent code

- less discussion about the domain-specific style code
Liu et al. A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation. NIPS 2018
Related Works Medical

Image Synthesis
• Previous works also discuss how to extract representations from multi-
modalities especially for segmentation with missing modalities

- However, fuse the features from multiple modalities but not from the perspective of
representation disentanglement
V Nguyen et al. Cross-domain synthesis of medical images using efficient location-sensitive deep network. MICCAI 2015

M Havaei et al. HeMIS: Hetero-Modal Image Segmentation. MICCAI 2016

A Chartsias et al.Multimodal mr synthesis via modality-invariant latent representation. IEEE transactions on medical imaging 2017
Method
Red boxes are

missing-domain images
Method
- Image decomposition

: Shared content structure (skeleton) + Unique characteristics (flesh)

- Missing image reconstruction during the testing

: Shared skeleton from available domains + Sampled flesh from the learned model
Style code (Domain-specific)

- Style encoder Es
i (xi) = si (1 ≤ i ≤ N)
Content code (Shared)

- Content encoder Ec
(x1, x2, …, xN) = c
Method
- Content codes visualization (randomly selected 8 out of 256 channels) of BraTs

: Various focuses on different anatomical structures (ex. tumor, brain, skull) are
demonstrated by different channel-wise feature maps
Input images
Method
- Generation : Style codes from a prior distribution + Content Code 

-
si c
Gi(c, si) = ˜xi
Image Generation Process
Method
Segmentation Branch
- Segmentation generator after content codes

- Assumption : The content codes contain essential image structure information

- Joint training (generation loss + segmentation Dice loss)

: adaptively learn how to generate missing images
GS
Method
Training Loss
Image Consistency Loss
Latent Consistency Loss
Adversarial Loss
Reconstruction Loss
Segmentation Loss
Method
Total Loss
Training Loss
Adversarial

(λadv = 1)
Image

Consistency

(λx
cyc = 10)
Style Latent

Consistency

(λs
cyc = 1)
Reconstruction

(λrec = 20)
Content Latent

Consistency

(λc
cyc = 1)
Segmentation

(λseg = 1)
Experiments
• BraTS 2018 dataset

- Multi-modal brain MRI with four modalities : T1, T1Gd, T2, FLAIR

- Following CollaGAN, 218 training and 28 testing samples randomly selected

- A set of 2D slices (40,148 training / 5,340 test) extracted from 3D volumes

- Resized to 256 256

- Three tumor categories

: Enhancing tumor (ET), tumor core (TC), and whole tumor (WT)
×
D Lee et al. CollaGAN: Collaborative GAN for Missing Image Data Imputation. CVPR 2019

B.H Menze et al. The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). IEEE TMI
Experiments
• ProstateX dataset

- Multi-parametric prostate MR scans for 98 subjects : T2, ADC, HighB

- 78 training and 20 testing samples randomly selected

- A set of 2D slices (3,540 training / 840 test) extracted from 3D volumes

- Resized to 256 256

- Prostate regions are manually labeled as the whole prostate (WP)
×
G Litjens et al. Computer-aided detection of prostate cancer in MRI. IEEE TMI
Experiments
• RaFD (Radboud Faces Database)

- Eight facial expressions

: neutral, angry, contemptuous, disgusted, fearful, happy, sand, and surprised

- Following StarGAN, adopt image from three camera angles with three gaze
directions

- 3,888 training (54 participants) / 936 test (13 participants)

- Cropped with the face in the center and Resized to 128 128×
Y Choi et al. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. CVPR 2018
http://www.socsci.ru.nl:8180/RaFD2/RaFD
Results
• Multi-Domain Image Completion on Domains

- Only One Missing Domain ( -to-1)

* Training : the one missing domain is randomly distributed

* Testing : Fix the one missing domain and generate outputs only on that 



- More than One Missing Domains ( -to- )

* Training : randomly selected visible domains

* Testing : Fix while these domains are randomly selected visible 

domains. Evaluate all the generated images



- Evaluation metrics

* NRMSE (Normalized Root Mean Squared Error)

* SSIM (Structural Similarity) 

* PSNR (Peak Signal-to-Noise Ratio)
N
n
n n
k (k ∈ {1,…, N − 1})
k k
N
Results
• Multi-Domain Image Completion

- Comparison with MUNIT, StarGAN, and CollaGAN

- ReMIC w/o Recon : ReMIC without reconstruction loss (single missing domain)

- ReMIC-Random : random visible domains (multiple missing domains)(k = * ) k
Results
• Multi-Domain Image Completion

- Comparison with MUNIT, StarGAN, and CollaGAN

- ReMIC w/o Recon : ReMIC without reconstruction loss (single missing domain)

- ReMIC-Random : random visible domains (multiple missing domains)(k = * ) k
Results
• Multi-Domain Image Completion
Results
• Multi-Domain Image Completion
Results
• Multi-Domain Image Completion
Results
• Multi-Domain Image Completion
Results
• Multi-Domain Segmentation

- Oracle : fully supervised 2D U-Net variation without missing images

- Oracle+* : the missing images generated from the “*” method

with the pre-trained “Oracle” model (All : without any missing domains)

- ReMIC+Seg : separate content encoders for image generation and 

segmentation tasks

- ReMIC+Joint : sharing the weights of content encoder for the two tasks
Conclusion
• A general framework for multi-domain image completion, given
that one or more input domains are missing

• Learning shared content and domain-specific style encoding
across multiple domains

• Well generalized to both natural and medical images

• Extended for a unified image generation and segmentation
framework for missing-domain segmentation task
Question
• According to this paper, “different modalities provide distinct
features to locate tumor boundaries from differential diagnosis
perspectives”.

But ReMIC uses a content code, which encodes the shared
skeleton, as an input for the connected segmentation generator.
Isn’t is a contradiction?
ICLR 2020 Reviews
• The main contribution is representational disentanglement,
namely the content and style separation, but there is no explicit
evidence that this separation is really happened 

• Evaluation on high-resolution dataset such as CelebHQ and other
conventional metrics such as FID
https://openreview.net/forum?id=rkg_wREYDS
Appendix. A : Implementation Details
• A.1 Hyperparameters

- Adam optimizer 

- Batch size 1 and 100,000 iterations

- Style code dimension : 8

- During testing, a fixed style code of 0.5 in each dimension

• A.2 Network Architectures (Check details in paper)

- ReMIC is developed on the backbone of MUNIT

- Unified Content Encoder : Down-sampling module + Residual Blocks (IN)

- Style Encoder : Down-sampling module + Residual Blocks + GAP + FC

- Generator : Four residual blocks + Up-sampling + AdaIN*

- Discriminator : Four convolutional blocks

- Segmentor : U-Net shaped network
(β1 = 0.5, β2 = 0.999)
X Huang et al. Arbitrary style transfer in real-time with adaptive instance normalization. ICCV 2017
Appendix. C : Extended Ablation Study and
Results for Multi-domain Segmentation
• C.4 Analysis of missing-domain segmentation results
Thank you

Contenu connexe

Tendances

Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Dongmin Choi
 
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...changedaeoh
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]Dongmin Choi
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in VisionSangmin Woo
 
Shai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble trackingShai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble trackingwolf
 
PR-232: AutoML-Zero:Evolving Machine Learning Algorithms From Scratch
PR-232:  AutoML-Zero:Evolving Machine Learning Algorithms From ScratchPR-232:  AutoML-Zero:Evolving Machine Learning Algorithms From Scratch
PR-232: AutoML-Zero:Evolving Machine Learning Algorithms From ScratchSunghoon Joo
 
Road Network Extraction using Satellite Imagery.
Road Network Extraction using Satellite Imagery.Road Network Extraction using Satellite Imagery.
Road Network Extraction using Satellite Imagery.SUMITRAJ312049
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
 
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...Sangmin Woo
 
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...CSCJournals
 
[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)Susang Kim
 
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Dongmin Choi
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer VisionDongmin Choi
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Vincenzo Lomonaco
 
Comparing Incremental Learning Strategies for Convolutional Neural Networks
Comparing Incremental Learning Strategies for Convolutional Neural NetworksComparing Incremental Learning Strategies for Convolutional Neural Networks
Comparing Incremental Learning Strategies for Convolutional Neural NetworksVincenzo Lomonaco
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyNUPUR YADAV
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: reviewDmytro Mishkin
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang
 
Transfer Learning in NLP: A Survey
Transfer Learning in NLP: A SurveyTransfer Learning in NLP: A Survey
Transfer Learning in NLP: A SurveyNUPUR YADAV
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksUsman Qayyum
 

Tendances (20)

Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]
 
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
 
Shai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble trackingShai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble tracking
 
PR-232: AutoML-Zero:Evolving Machine Learning Algorithms From Scratch
PR-232:  AutoML-Zero:Evolving Machine Learning Algorithms From ScratchPR-232:  AutoML-Zero:Evolving Machine Learning Algorithms From Scratch
PR-232: AutoML-Zero:Evolving Machine Learning Algorithms From Scratch
 
Road Network Extraction using Satellite Imagery.
Road Network Extraction using Satellite Imagery.Road Network Extraction using Satellite Imagery.
Road Network Extraction using Satellite Imagery.
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
 
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
 
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
 
[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)
 
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...
Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer Vision
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
 
Comparing Incremental Learning Strategies for Convolutional Neural Networks
Comparing Incremental Learning Strategies for Convolutional Neural NetworksComparing Incremental Learning Strategies for Convolutional Neural Networks
Comparing Incremental Learning Strategies for Convolutional Neural Networks
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: review
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
Transfer Learning in NLP: A Survey
Transfer Learning in NLP: A SurveyTransfer Learning in NLP: A Survey
Transfer Learning in NLP: A Survey
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
 

Similaire à Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in ImagesAnil Kumar Gupta
 
Weave-D - 2nd Progress Evaluation Presentation
Weave-D - 2nd Progress Evaluation PresentationWeave-D - 2nd Progress Evaluation Presentation
Weave-D - 2nd Progress Evaluation Presentationlasinducharith
 
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...Savvas Chatzichristofis
 
Image processing and alignment with RNiftyReg and mmand
Image processing and alignment with RNiftyReg and mmandImage processing and alignment with RNiftyReg and mmand
Image processing and alignment with RNiftyReg and mmandJonathan Clayden
 
Computer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonComputer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonAditya Bhattacharya
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overviewLEE HOSEONG
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain
 
OReilly AI Transfer Learning
OReilly AI Transfer LearningOReilly AI Transfer Learning
OReilly AI Transfer LearningDanielle Dean
 
How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...Wee Hyong Tok
 
TIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfTIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfBoahKim2
 
Lecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptxLecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptxKarimdabbabi
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple featuresHirantha Pradeep
 
AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxssuserb4a9ba
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxdhaliwalharsh055
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Jihong Kang
 
Deep learning in Computer Vision
Deep learning in Computer VisionDeep learning in Computer Vision
Deep learning in Computer VisionDavid Dao
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptxNaveensai51
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingYu Huang
 

Similaire à Review : Multi-Domain Image Completion for Random Missing Input Data [cdm] (20)

lec6a.ppt
lec6a.pptlec6a.ppt
lec6a.ppt
 
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
 
Weave-D - 2nd Progress Evaluation Presentation
Weave-D - 2nd Progress Evaluation PresentationWeave-D - 2nd Progress Evaluation Presentation
Weave-D - 2nd Progress Evaluation Presentation
 
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
 
Image processing and alignment with RNiftyReg and mmand
Image processing and alignment with RNiftyReg and mmandImage processing and alignment with RNiftyReg and mmand
Image processing and alignment with RNiftyReg and mmand
 
Computer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonComputer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathon
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
OReilly AI Transfer Learning
OReilly AI Transfer LearningOReilly AI Transfer Learning
OReilly AI Transfer Learning
 
How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...
 
CBIR_white.ppt
CBIR_white.pptCBIR_white.ppt
CBIR_white.ppt
 
TIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfTIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdf
 
Lecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptxLecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptx
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
AGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptxAGE AND GENDER DETECTION.pptx
AGE AND GENDER DETECTION.pptx
 
ageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptxageandgenderdetection-220802061020-9ee5a2cd.pptx
ageandgenderdetection-220802061020-9ee5a2cd.pptx
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331
 
Deep learning in Computer Vision
Deep learning in Computer VisionDeep learning in Computer Vision
Deep learning in Computer Vision
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptx
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object tracking
 

Plus de Dongmin Choi

YolactEdge Review [cdm]
YolactEdge Review [cdm]YolactEdge Review [cdm]
YolactEdge Review [cdm]Dongmin Choi
 
Review : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
Review : Inter-slice Context Residual Learning for 3D Medical Image SegmentationReview : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
Review : Inter-slice Context Residual Learning for 3D Medical Image SegmentationDongmin Choi
 
Pyradiomics Customization [CDM]
Pyradiomics Customization [CDM]Pyradiomics Customization [CDM]
Pyradiomics Customization [CDM]Dongmin Choi
 
Seeing What a GAN Cannot Generate [cdm]
Seeing What a GAN Cannot Generate [cdm]Seeing What a GAN Cannot Generate [cdm]
Seeing What a GAN Cannot Generate [cdm]Dongmin Choi
 
Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Dongmin Choi
 
Network Deconvolution review [cdm]
Network Deconvolution review [cdm]Network Deconvolution review [cdm]
Network Deconvolution review [cdm]Dongmin Choi
 
Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Dongmin Choi
 
Augmix review [cdm]
Augmix review [cdm]Augmix review [cdm]
Augmix review [cdm]Dongmin Choi
 
Bag of tricks for image classification with convolutional neural networks r...
Bag of tricks for image classification with convolutional neural networks   r...Bag of tricks for image classification with convolutional neural networks   r...
Bag of tricks for image classification with convolutional neural networks r...Dongmin Choi
 
ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]Dongmin Choi
 
[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...Dongmin Choi
 

Plus de Dongmin Choi (11)

YolactEdge Review [cdm]
YolactEdge Review [cdm]YolactEdge Review [cdm]
YolactEdge Review [cdm]
 
Review : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
Review : Inter-slice Context Residual Learning for 3D Medical Image SegmentationReview : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
Review : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
 
Pyradiomics Customization [CDM]
Pyradiomics Customization [CDM]Pyradiomics Customization [CDM]
Pyradiomics Customization [CDM]
 
Seeing What a GAN Cannot Generate [cdm]
Seeing What a GAN Cannot Generate [cdm]Seeing What a GAN Cannot Generate [cdm]
Seeing What a GAN Cannot Generate [cdm]
 
Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]
 
Network Deconvolution review [cdm]
Network Deconvolution review [cdm]Network Deconvolution review [cdm]
Network Deconvolution review [cdm]
 
Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]
 
Augmix review [cdm]
Augmix review [cdm]Augmix review [cdm]
Augmix review [cdm]
 
Bag of tricks for image classification with convolutional neural networks r...
Bag of tricks for image classification with convolutional neural networks   r...Bag of tricks for image classification with convolutional neural networks   r...
Bag of tricks for image classification with convolutional neural networks r...
 
ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]ICCV 2019 REVIEW [CDM]
ICCV 2019 REVIEW [CDM]
 
[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...[Review] High-performance medicine: the convergence of human and artificial i...
[Review] High-performance medicine: the convergence of human and artificial i...
 

Dernier

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 

Dernier (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

  • 1. Multi-Domain Image Completion for Random Missing Input Data Stanford University, NVIDIA, and National Institutes of Health Yonsei University Severance Hospital CCIDS Choi Dongmin
  • 2. Introduction • Multi-domain images could provide complementary knowledge
 - ex. Four MRI modalities (T1, T1CE, T2, FLAIR) provide distinct features to locate tumor boundaries from different diagnosis perspective
 - ex. Person re-identification across different cameras or times • However, some image domains might be missing in practice
 - Solution 1. Nearest neighbor approach : lack of semantic consistency
 - Solution 2. Generative models • ReMIC (Representational disentanglement schemes for Multi-domain Image Completion)
 - -to- image completion framework 
 - utilized for the high-level task by joint training (ex. segmentation)
 - completes the missing domains given random distributed numbers of visible domains
 - consistent performance improvement on three datasets n n
  • 3. Related Works Image-to-Image
 Translation J.Y Zhu et al. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. ICCV 2017 - Impressive performance via cycle-consistency loss - only 1-to-1 mapping • CycleGAN
  • 4. Related Works Image-to-Image
 Translation Y Choi et al. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. CVPR 2018 J Yoon et al. RadialGAN: Leveraging multiple datasets to improve target-specific predictive models using GANs. ICML 2018 - Multi-domain image generation - only 1-to- mapping (generation is always conditioned on the single input image as the only source domain) n • StarGAN & RadialGAN StarGAN RadialGAN
  • 5. Related Works Image-to-Image
 Translation D Lee et al. CollaGAN: Collaborative GAN for Missing Image Data Imputation. CVPR 2019 - Collaborative model to incorporate multiple domains for generating one missing domain - only -to-1 mappingn • CollaGAN
  • 6. Related Works Image-to-Image
 Translation Red boxes are missing-domain images Only ReMIC is -to- mappingn n
  • 7. Related Works Learning
 Disentangled
 Representations • Learning Disentangled Representations
 - to capture the full distribution of possible outputs by introducing a random style code
 - to transfer information across domains for adaptation
 - InfoGAN and -VAE learn the disentangled representation in unsupervised mannerβ Xi Chen et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets. NIPS 2016 I Higgins et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework. ICLR 2017 https://www.slideshare.net/NaverEngineering/ss-96581209
  • 8. Related Works Learning
 Disentangled
 Representations • DRIT and MUNIT
 - disentangles content and attribute features in image translation
 - However, only 1-to-1 translation H.Y Lee et al. Diverse image-to- image translation via disentangled representations. ECCV 2018 X Huang et al. Multimodal Unsupervised Image-to-Image Translation. ECCV 2018
  • 9. Related Works Learning
 Disentangled
 Representations • Liu et al
 - tackles multi-domain learning cross-domain latent code
 - less discussion about the domain-specific style code Liu et al. A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation. NIPS 2018
  • 10. Related Works Medical
 Image Synthesis • Previous works also discuss how to extract representations from multi- modalities especially for segmentation with missing modalities
 - However, fuse the features from multiple modalities but not from the perspective of representation disentanglement V Nguyen et al. Cross-domain synthesis of medical images using efficient location-sensitive deep network. MICCAI 2015 M Havaei et al. HeMIS: Hetero-Modal Image Segmentation. MICCAI 2016
 A Chartsias et al.Multimodal mr synthesis via modality-invariant latent representation. IEEE transactions on medical imaging 2017
  • 12. Method - Image decomposition
 : Shared content structure (skeleton) + Unique characteristics (flesh) - Missing image reconstruction during the testing
 : Shared skeleton from available domains + Sampled flesh from the learned model Style code (Domain-specific)
 - Style encoder Es i (xi) = si (1 ≤ i ≤ N) Content code (Shared)
 - Content encoder Ec (x1, x2, …, xN) = c
  • 13. Method - Content codes visualization (randomly selected 8 out of 256 channels) of BraTs
 : Various focuses on different anatomical structures (ex. tumor, brain, skull) are demonstrated by different channel-wise feature maps Input images
  • 14. Method - Generation : Style codes from a prior distribution + Content Code - si c Gi(c, si) = ˜xi Image Generation Process
  • 15. Method Segmentation Branch - Segmentation generator after content codes - Assumption : The content codes contain essential image structure information - Joint training (generation loss + segmentation Dice loss)
 : adaptively learn how to generate missing images GS
  • 16. Method Training Loss Image Consistency Loss Latent Consistency Loss Adversarial Loss Reconstruction Loss Segmentation Loss
  • 17. Method Total Loss Training Loss Adversarial
 (λadv = 1) Image
 Consistency
 (λx cyc = 10) Style Latent
 Consistency
 (λs cyc = 1) Reconstruction
 (λrec = 20) Content Latent
 Consistency
 (λc cyc = 1) Segmentation
 (λseg = 1)
  • 18. Experiments • BraTS 2018 dataset
 - Multi-modal brain MRI with four modalities : T1, T1Gd, T2, FLAIR
 - Following CollaGAN, 218 training and 28 testing samples randomly selected
 - A set of 2D slices (40,148 training / 5,340 test) extracted from 3D volumes
 - Resized to 256 256
 - Three tumor categories
 : Enhancing tumor (ET), tumor core (TC), and whole tumor (WT) × D Lee et al. CollaGAN: Collaborative GAN for Missing Image Data Imputation. CVPR 2019 B.H Menze et al. The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). IEEE TMI
  • 19. Experiments • ProstateX dataset
 - Multi-parametric prostate MR scans for 98 subjects : T2, ADC, HighB
 - 78 training and 20 testing samples randomly selected
 - A set of 2D slices (3,540 training / 840 test) extracted from 3D volumes
 - Resized to 256 256
 - Prostate regions are manually labeled as the whole prostate (WP) × G Litjens et al. Computer-aided detection of prostate cancer in MRI. IEEE TMI
  • 20. Experiments • RaFD (Radboud Faces Database)
 - Eight facial expressions
 : neutral, angry, contemptuous, disgusted, fearful, happy, sand, and surprised
 - Following StarGAN, adopt image from three camera angles with three gaze directions
 - 3,888 training (54 participants) / 936 test (13 participants)
 - Cropped with the face in the center and Resized to 128 128× Y Choi et al. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. CVPR 2018 http://www.socsci.ru.nl:8180/RaFD2/RaFD
  • 21. Results • Multi-Domain Image Completion on Domains
 - Only One Missing Domain ( -to-1)
 * Training : the one missing domain is randomly distributed
 * Testing : Fix the one missing domain and generate outputs only on that 
 
 - More than One Missing Domains ( -to- )
 * Training : randomly selected visible domains
 * Testing : Fix while these domains are randomly selected visible 
 domains. Evaluate all the generated images
 
 - Evaluation metrics
 * NRMSE (Normalized Root Mean Squared Error)
 * SSIM (Structural Similarity) 
 * PSNR (Peak Signal-to-Noise Ratio) N n n n k (k ∈ {1,…, N − 1}) k k N
  • 22. Results • Multi-Domain Image Completion
 - Comparison with MUNIT, StarGAN, and CollaGAN
 - ReMIC w/o Recon : ReMIC without reconstruction loss (single missing domain)
 - ReMIC-Random : random visible domains (multiple missing domains)(k = * ) k
  • 23. Results • Multi-Domain Image Completion
 - Comparison with MUNIT, StarGAN, and CollaGAN
 - ReMIC w/o Recon : ReMIC without reconstruction loss (single missing domain)
 - ReMIC-Random : random visible domains (multiple missing domains)(k = * ) k
  • 28. Results • Multi-Domain Segmentation
 - Oracle : fully supervised 2D U-Net variation without missing images
 - Oracle+* : the missing images generated from the “*” method
 with the pre-trained “Oracle” model (All : without any missing domains)
 - ReMIC+Seg : separate content encoders for image generation and 
 segmentation tasks
 - ReMIC+Joint : sharing the weights of content encoder for the two tasks
  • 29. Conclusion • A general framework for multi-domain image completion, given that one or more input domains are missing • Learning shared content and domain-specific style encoding across multiple domains • Well generalized to both natural and medical images • Extended for a unified image generation and segmentation framework for missing-domain segmentation task
  • 30. Question • According to this paper, “different modalities provide distinct features to locate tumor boundaries from differential diagnosis perspectives”.
 But ReMIC uses a content code, which encodes the shared skeleton, as an input for the connected segmentation generator. Isn’t is a contradiction?
  • 31. ICLR 2020 Reviews • The main contribution is representational disentanglement, namely the content and style separation, but there is no explicit evidence that this separation is really happened • Evaluation on high-resolution dataset such as CelebHQ and other conventional metrics such as FID https://openreview.net/forum?id=rkg_wREYDS
  • 32. Appendix. A : Implementation Details • A.1 Hyperparameters
 - Adam optimizer 
 - Batch size 1 and 100,000 iterations
 - Style code dimension : 8
 - During testing, a fixed style code of 0.5 in each dimension • A.2 Network Architectures (Check details in paper)
 - ReMIC is developed on the backbone of MUNIT
 - Unified Content Encoder : Down-sampling module + Residual Blocks (IN)
 - Style Encoder : Down-sampling module + Residual Blocks + GAP + FC
 - Generator : Four residual blocks + Up-sampling + AdaIN*
 - Discriminator : Four convolutional blocks
 - Segmentor : U-Net shaped network (β1 = 0.5, β2 = 0.999) X Huang et al. Arbitrary style transfer in real-time with adaptive instance normalization. ICCV 2017
  • 33. Appendix. C : Extended Ablation Study and Results for Multi-domain Segmentation • C.4 Analysis of missing-domain segmentation results