Lecture on Deep Learning

Deep Learning
Presentation By:
Yasas Senarath, Research Assistant,
University of Moratuwa
Lecturer in Charge:
Dr. Uthayasanker Thayasivam
1

Overview
1. Artificial Neural Network Basics
2. Introduction to Deep Learning
3. Convolutional Neural Networks (CNNs)
4. Recurrent neural network (RNNs)
5. Practical Session
2

Artificial Neural Network Basics
3

Artificial Neural Network
• Computational model based on the structure and functions of
biological neural networks
The structure of a single artificial neuronThe structure of a basic biological neuron
4

A Neuron - Function
• Receiving information: the processing unit obtains the information as
input x1,x2,....,xn.
• Weighting: each input is weighted
by its corresponding weights
denoted as w0,w1,w2,....,wn.
• Activation: an activation function
f is applied to the sum of all the
weighted inputs z.
• Output: an output is y generated
depending on z.
The structure of a single artificial neuron
5

Activation Function
• Threshold Function
• Sigmoid Function
• Hyperbolic Tangent Function
• Rectified Linear Units
6

Feedforward Neural Network
• Connections between the nodes do not form a cycle
The structure of a fully connected 3-Layer neural network
7

The Black Box
CAR
Preprocessing
Feature
Extraction
Post Processing Classifier
9

Issues with Hand
Engineered Features
• Most critical for accuracy
• Most time-consuming in development
• What is the best feature???
• What is next?? Keep on crafting better
features?
• Let’s learn feature representation directly
from data.
10

Learning Features and classifier
together
• A non-linear mapping that takes raw
pixels directly to labels
• How to build?
• By combining simple building blocks (i.e.
layers in Neural Network)
Hmmm…
Which is
better?
Option 2 is better
Option 1 Option 2
11

Intuition behind Deep Neural Nets
• Each layer will have parameters subject to learning
• Composition makes a highly non-linear system
• In case of classification:
• Final layer outputs a probability
distribution of categories.
Final Layer
A Layer
12

Training a Deep Neural Network
• Compute loss on small batches(Forward Propagation)
• Compute Gradient w.r.t. parameters
• Use gradient to update parameters
𝑦1
𝑋
𝑦
Error
Number of Hidden
Units
Number of Hidden
Layers
Type of Layer
Loss Function
13

Types of Layers
• Dense Layer (activation=ReLU)
• Convolutional Layer in Convolutional Neural Network (CNN)
• Recurrent Neural Network (RNN) Layer
• Simple RNN cells
• LSTM cell
• GRU cell
14

Convolutional Neural Networks (CNNs)
• AKA ConvNets
• Regular Neural Nets don’t scale well
• 3D volumes of neurons
• Depth
• Height
• Width
• Mainly used in
• Image Processing
• Natural Language Processing
15

Consider an Image…
Example: 1000 X 1000 image
1M Hidden Units
1 B Parameters to Optimize!!
16

Reduce connections to local regions
Example: 1000 x 1000 image
1 M hidden units
Filter size: 10 * 10
10 M parameters
17

Reuse the same kernel everywhere
Why?
Because interesting
features (edges) can
happen at anywhere in
the image
Share the same parameters across
different locations
Convolution with learned kernels
18

Convolutional Neural Nets
Learn Multiple Filters
Example: 1000 x 1000 image
100 Filters
Filter size: 10 * 10
10 K parameters
19

Handling Multiple Channels
• Image may contain
multiple channels
• Eg: 3 channel (R, G, B)
image
• 3 separate k by k filter
is applied to each
channel
20

Translation Invariance
Assume we are going to make an Eye detector
Problem: How to make the detection
robust to exact Eye location?
21

Translation Invariance
Solution: Use pooling (max / average)
on the filter responses
• Provides robustness to exact spatial location of
features
• Also sub-samples the image allowing next layer
to look @ larger spatial regions
22

Summary of Complete CNN
• Doing all of this consists one layer.
• Pooling and normalization is optional
• Stack them up and train just like multilayer
neural nets
• Multiple Conv Layers can be used to learn high
level features
• Final layer is usually fully connected 𝑛𝑒𝑢𝑟𝑎𝑙 𝑛𝑒𝑡
𝑤𝑖𝑡ℎ 𝑜𝑢𝑡𝑝𝑢𝑡 𝑠𝑖𝑧𝑒 == 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠
23

Recurrent neural network (RNN)
• Considers sequence
• Used in Forecasting
• Applications
• Language Modelling
• Machine Translation
• Conversation Bots
• Image Description
• Image Search
24

Structure of RNN
• Performs the same task for every element of a sequence, with the
output being depended on the previous computations
• Have a “memory” which captures information about what has been
calculated so far
An unrolled recurrent neural network. 25

A Simple RNN
• Performs the same task for every element of a sequence, with the
output being depended on the previous computations
Unrolled RNN 26

The Problem of Long-Term Dependencies
• Consider a language model
trying to predict the next word
based on the previous ones
• Larger Gap => Unable to learn
features by RNN
• Theoretically, this should be
possible but practically simple
RNNs are not capable of
representing long-term
dependencies
𝑇ℎ𝑒 𝑐𝑙𝑜𝑢𝑑𝑠 𝑎𝑟𝑒 𝑖𝑛 𝑡ℎ𝑒 𝑠𝑘𝑦
𝐼 𝑔𝑟𝑒𝑤 𝑢𝑝 𝑖𝑛 𝐹𝑟𝑎𝑛𝑐𝑒 … 𝐼 𝑠𝑝𝑒𝑎𝑘 𝑓𝑙𝑢𝑒𝑛𝑡 𝐹𝑟𝑒𝑛𝑐ℎ27

LSTM - Hochreiter & Schmidhuber (1997)
• A special kind of RNN
• Capable of learning long-term dependencies
• Long periods of time is practically their default behaviour, not
something they struggle to learn!
An unrolled LSTM 28

Example Model (Image Captioning)
30

Practical Session
• See https://online.mrt.ac.lk/mod/folder/view.php?id=65448
• Follow instructions in Moodle to get started using Colab
• Then follow the instructions in Python Notebook
31

Resources
1. http://cs231n.github.io/convolutional-networks/
2. http://www.cs.umd.edu/~djacobs/CMSC733/CNN.pdf
3. http://colah.github.io/posts/2015-08-Understanding-LSTMs/
4. https://wiki.tum.de/display/lfdv/Artificial+Neural+Networks
32

Lecture on Deep Learning

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Lecture on Deep Learning

Similaire à Lecture on Deep Learning (20)

Plus de Yasas Senarath

Plus de Yasas Senarath (7)

Dernier

Dernier (20)

Lecture on Deep Learning

Notes de l'éditeur