텐서플로우 2.0 튜토리얼 - CNN

TensorFlow 2.0 Tutorial
CNN
김환희
2019.04.27

목차
• CNN 소개
• Tensorflow 2.0 - CNN
• Tensorflow 2.0 Sample code - CNN

CNN
• State of The Art – ImageNet
• 1,400 만 장의 이미지를 올바른 카테고리로 분류하는 대회
• 2015 년에 ResNet 이 인간의 퍼포먼스를 뛰어넘음
• 2017 년 이후로 더 이상 대회가 열리지 않음
http://blog.a-stack.com/2018/07/06/ImageNet-DataSet/
https://alexisbcook.github.io/2017/using-transfer-learning-to-classify-images-with-keras/
에러율 (%)

CNN
• Convolutional Neural Network
https://terms.naver.com/entry.nhn?docId=864931&cid=42346&categoryId=42346

CNN
• Convolutional Neural Network
= Convolutional Layer(Feature Extractor)
+ Dense Layer(Classifier)
https://www.completegate.com/2017022864/blog/deep-machine-learning-images-lenet-alexnet-cnn/general-architecture-of-a-convolutional-neural-network

Convolution
• 원본 이미지에 일정 크기의 필터를 합성(곱)해서 더한 값으로 새
로운 이미지를 생성
http://deeplearning.stanford.edu/wiki/index.php/Feature_extraction_using_convolution

Convolution
• 간단한 Convolution 필터와 결과 이미지
https://en.wikipedia.org/wiki/Kernel_(image_processing)
수직선 검출 수평선 검출
Box blur Sharpen

전통적 Convolution
• 결과 이미지에서 Feature 를 얻을 수 있음
• 다양한 문제 – object detection, object recognition, denoising – 를 풀 수 있음
https://kr.mathworks.com/help/vision/ug/local-feature-detection-and-extraction.html
https://www.researchgate.net/figure/Sobel-operator-applied-to-the-image-Fig-22-Prewitt-operator-applied-to-image_fig15_221581664
BRISK - blob FAST - corner Sobel Filter - edge

Convolution
• 한계점 : Feature 의 사전 정의 필요
• 전문적 지식 필요
• 수행에 시간이 오래 걸림
• 다른 도메인에 일반화하기 힘듦
https://www.slideshare.net/zukun/eccv2010-feature-learning-for-image-classification-part-0

CNN
• Feature Extraction
• 학습 과정에서 네트워크가 Feature 를 자동 생성
https://opensource.googleblog.com/2018/03/the-building-blocks-of-interpretability.html

Feature Extraction
• Low-level : 모서리, 면
• Mid-level : 바퀴, 큰 부품
• High-level : 헤드라이트 램프, 작은 부품
https://technodocbox.com/3D_Graphics/72864833-Deep-learning-in-radiology-recent-advances-challenges-and-future-trends.html

Feature Extraction
https://adeshpande3.github.io/The-9-Deep-Learning-Papers-You-Need-To-Know-About.html

일반적인 CNN 의 구조
• Conv 와 Pool 이 교차되며 배치됨
• 오버피팅을 막기 위해 Dense 뒤에 Dropout 사용
Conv
Layer
Pool
Layer
Conv
Layer
Pool
Layer
Flat
Layer
Dense
Layer
Dropout
Layer
Dense
Layer
Cat : 0.99
Dog : 0.1
Input
Output
Feature Extractor
Classifier

Conv Layer
• Filter : 데이터의 feature 를 자동 추출하는 역할
• Filter 는 이미지의 모든 영역에 대해서 Convolution을 계산
• Filter 를 재사용하기 때문에 파라미터 수는 많지 않은 편
https://datascience.stackexchange.com/questions/23183/why-convolutions-always-use-odd-numbers-as-filter-size

Pool Layer
• Subsampling 기법
• 인접한 셀은 비슷한 정보를 갖기 때문에 압축으로 효율을 높임
• Max Pooling, Average Pooling 기법이 자주 쓰임
https://www.quora.com/What-is-max-pooling-in-convolutional-neural-networks
https://computersciencewiki.org/index.php/Max-pooling_/_Pooling

Flat Layer
• Conv, Pool 뒤에 배치됨
• Conv, Pool 의 결과를 Flat 이 받아서 Dense(FC) 를 배치할 수 있도록
1차원으로 길게 나열
https://towardsdatascience.com/convolutional-neural-networks-from-the-ground-up-c67bb41454e1

Dense Layer
• Fully-Connected Layer 라고도 함
• 모든 뉴런이 1대 1로 대응되는 가장 기본적인 Layer
• 뉴런 개수에 따라 파라미터 수가 급증
https://github.com/drewnoff/spark-notebook-ml-labs/tree/master/labs/DLFramework

Dropout Layer
• 학습 과정에서 랜덤하게 일정 뉴런을 off 하는 Layer
• 뉴런 사이의 상호 의존성(codependency) 을 줄여서 오버피팅을 방
지하기 위한 목적
https://medium.com/@amarbudhiraja/https-medium-com-amarbudhiraja-learning-less-to-learn-better-dropout-in-deep-machine-learning-74334da4bfc5

퍼포먼스를 높이기 위한 노력
• 더 많은 (더 깊은) Conv Layer
• Image Augmentation

더 많은 (더 깊은) Conv Layer
• 딥러닝에서 네트워크 구조를 깊게 쌓
는 것이 가능해진 이후, Conv Layer 가
중첩된 더 깊은 구조가 계속 나타남
https://arxiv.org/abs/1709.01921

더 많은 (더 깊은) Conv Layer
• 깊이가 깊어질수록 퍼포먼스
는 상승
• 학습 속도 느려짐, 오버피팅
위험
https://medium.com/finc-engineering/cnn-do-we-need-to-go-deeper-afe1041e263e

Image Augmentation
• Train data 의 편향을 보완하기 위한 기법
• 이미지 기울이기, flip, zoom in/out, crop 등으로 train data 수를 늘려 네
트워크의 대응성을 높임
• 학습 속도 느려짐
https://becominghuman.ai/data-augmentation-using-fastai-aefa88ca03f1

Conv2D
• 컬러 채널이 있는 2D 이미지(3D 데이터)를 input 으로 받음
• 높이 x 너비 x 컬러 채널
• Output 도 3D 데이터가 됨
• 높이 x 너비 x 필터 개수
https://medium.com/@udemeudofia01/basic-overview-of-convolutional-neural-network-cnn-4fcc7dbb4f17

Conv2D
• tf.keras.layers 에서 import 할 수 있음

kernel_size
• Filter 가 한 step 마다 계산하는 영역의 크기
• Receptive field 라고도 함
https://github.com/vdumoulin/conv_arithmetic
kernel_size = (3, 3) kernel_size = (4, 4)

strides
• Filter 가 한 step 마다 이동하는 크기
• Strides 가 커지면 출력 이미지가 작아짐
strides = (1, 1) strides = (2, 2)

padding
• valid : filter window 가 input 안에만 위치하도록 함
• same : output 의 size 가 input 과 같아지도록 함
• zero padding : 남는 공간을 0 으로 채워서 계산
padding = ‘valid’ padding = ‘same’

filters
• Conv Layer 를 구성하는 filter 의 수
• Filter 의 수는 네트워크가 얼마나 많은 feature 를 추출할 수 있는지
를 결정

https://www.researchgate.net/figure/Deep-features-visualization-a-output-feature-maps-and-b-convolution-kernels-of-the_fig3_320461081

Tensorflow 2.0 sample code - CNN

Hello World – MNIST
• 0~9로 라벨링된 28x28 pixel의 손글씨 이미지 70,000장을 분류
• 1998년부터 머신러닝의 benchmark 중 하나로 활용됨
https://theanets.readthedocs.io/en/stable/examples/mnist-classifier.html

• Performance : Error rate
• CNN 이 가장 좋은 퍼포먼스를 보임
https://en.wikipedia.org/wiki/MNIST_database

• Tensorflow 홈페이지의 <Get started with TensorFlow 2.0 for beginners> 를
기반으로 수정

CNN with MNIST Dataset
• Google Colab 코드 링크

텐서플로우 2.0 튜토리얼 - CNN

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à 텐서플로우 2.0 튜토리얼 - CNN

Similaire à 텐서플로우 2.0 튜토리얼 - CNN (20)

텐서플로우 2.0 튜토리얼 - CNN

Notes de l'éditeur