SlideShare une entreprise Scribd logo
1  sur  39
Euler angle and gimbal lock
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 1
Euler angle and gimbal lock
Loss of a degree of freedom with Euler angles
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 2
When 𝛽 =
𝜋
2
then cos
𝜋
2
= 0 and sin
𝜋
2
= 1
Euler angle and gimbal lock
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 3
Euler angle and gimbal lock
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 4
Loss of a degree of freedom with Euler angles
Resolve gimbal lock (Loss of a degree of freedom )
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 5
1. Change 𝛽
2. Use different orientation representation
=> quaternion
Rotation don’t commute
𝑅 𝑥 𝑅 𝑦 ≠ 𝑅 𝑦 𝑅 𝑥
Quaternion (四元數)
The history
Complex number
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 6
https://www.youtube.com/watch?v=mHVwd8gYLnI&t=2s
Extend Complex number
What is 𝑏𝑐 𝑖𝑗 ?
How to define 𝑖𝑗 ?
Quaternion (四元數)
Forget about 𝑖𝑗, how about define another one 𝑘 ?
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 7
https://www.youtube.com/watch?v=mHVwd8gYLnI&t=2s
𝑖
𝑗 𝑘
Double cover of quaternion
There are two distinct quaternions for each distinct orientation frame in 3D space.
The belt trick reflects this double-valued relationship, distinguishing a one-circuit 360-degree rotation
from the equivalent two-circuit 720-degree rotation.*
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 8
When applying regression on similar image,
we may get distinct quaternions.
* Andrew J. Hanson (6 February 2006). Visualizing Quaternions. Elsevier. pp. 114–. ISBN 978-0-08-047477-9.
National Chung Cheng University, Taiwan
Robot Vision Laboratory
2017/12/03
Jacky Liu
(Research Note)
Delving deeper into convolutional neural
networks for camera relocalization
About this work
Delving deeper into convolutional neural networks
for camera relocalization
Wu, Jian1 , Ma, Liwei2 , Hu, Xiaolin1
ICRA2017 - IEEE International Conference on Robotics and Automation
1. Tsinghua National Laboratory for Information Science and Technology (TNList), De- partm
ent of Computer Science and Technology, Tsinghua Univer- sity, 100084, Beijing, China
2. Intel Labs China, Intel Corporation, 100090, Beijing, China liwei.ma@intel.com
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 10
Contributions
1. Good rotation representation that solve
the double cover problem of quaternion
(which used by PoseNet)
=> Euler6
2. Camera poses in training set are
always very sparse in the whole pose
space.
=> pose synthesis
3. Regressing orientation & translation
together might not be optimal
=> BranchNet
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 11
Related work
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 12
Camera relocalization
Keypoints
SIFT ORB SCoRe
Keyframes
G.
Klein2008
A. P.
Gee2012
However, these methods only provide a coarse estimation to the
camera pose because of the sparsity of poses in training set.
Camera relocalization Multi-task CNNs
Related work
Camera relocalization - CNN
PoseNet (keyframes-based approach)
• Encodes the key frames in training set into the parameters of models.
SE3-Net
• Point cloud data limits this algorithm to RGB-D
• The number of predicted objects must be specified in training
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 13
Camera relocalization Multi-task CNNs
Related work
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 14
Multi-task CNNs
TCDCN
1. Facial landmark detection
2. Appearance attribute and expression
HyperFace
1. Faces detection
2. Localizaing landmarks
3. Head pose
4. Gender
• Sharing lower layer for low level
common knowledge
• Separate higher layer for specific
predictions
R-CNN
1. Human pose estimation
2. Action detection
MCNNs
1. Attribute relationships
2. Attribute classifiers
Camera relocalization Multi-task CNNs
Related work
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 15
• Sharing lower layer for low level
common knowledge
• Separate higher layer for specific
predictions
Camera relocalization Multi-task CNNs
Input
Task1
Task2
Method
Summary
A. Orientation Representation
B. Pose Synthesis
C. Mutli-task CNN for Camera Relocalization
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 16
Method - Orientation Representation
Predict
Q = [0,1,0,0]
Ground truth
Q’ = [0,-1,0,0]
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 17
translation orientation
orientation
Even if we got the right orientation,
we still have large error
Quoternion Euler6
Pose Synthesis
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 18
Overfitting on sparse trajectory
2017/12/13
Delving deeper into convolutional neural ne
tworks for camera relocalization
19
How to resolve overfitting?
(Hint: 2 methods)
Pose Synthesis
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 20
Overfitting on sparse trajectory
Method
Mutli-task CNN for Camera Relocalization
To quantitatively understand relationship between orientation and translation
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 21
Translation
Rotation
6𝐷𝑜𝐹 = |𝑋, 𝑌, 𝑍, 𝜙, 𝜃, 𝜓|
Intra group correlations
• Orientation:0.391
• Translation:0.293
(self-correlations are not
involved)
Inter group correlations
• 0.256
Method
Mutli-task CNN for Camera Relocalization
Learn from statictic
• In the extreme case, regressing orientation
and translation separately by two individual
networks may also give better results.
High computation cost of individual network
• But regressing orientation and translation
individually significantly increases the
computing cost.
Balance - branching
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 22
translation
orientation
translation
orientation
Method
Summary
A. Orientation Representation
B. Pose Synthesis
C. Mutli-task CNN for Camera Relocalization
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 23
Experiment
Dataset: 7Scenes
• Each sequence (seq-XX.zip) consists of 500-1000 frames
• RGBD: 640x480 => 343x256
• Initial learning rate 10−5
(dropped by 90% every 10000 iter.)
• End iteration at 45000
Hardware
• 2 Nvidia Titan X GPU
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 24
Network design
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 25
Inception network (googlenet)
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 26
Inception network (googlenet)
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 27
Network design
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 28
PoseNet / BranchNet
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 29
Euler6
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 30
Data augmentation
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 31
Data augmentation
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 32
Branch
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 33
Pretrain
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 34
Surprisingly pretain on ImageNet increase error
Did FCN helps?
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 35
Efficiency of the BranchNet
Storing weights took 46 MB for BranchNet-Euler6.
Branching networks slowed down the forward speed from 5ms to 6ms per
frame on a NVIDIA Titan X GPU.
BranchNet-Euler6 in the GPU of an Intel NUC mobile platform (Intel CoreTM
i5-6260U) with clCaffe [24], and reached a speed of 43 fps, which meets the
real-time requirement of many robotic applications.
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 36
Conclusion
CNN-based camera relocalization
1. A new orientation representation Euler6.
2. The pose synthesis for data augmentation.
3. The BranchNet for multi-task regression.
Experiments showed that all of the above techniques improved the
relocalization accuracy, and
they together reduced the error of previous methods by a significant margin.
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 37
Conclusion
• Work well on monocular image => RGBD => SCoRe Forests [2] still
perform better
• They attempted to utilize the depth information by simply add the depth
image as the fourth channel to the original input which has RGB channels
but did not obtain much better results than our current results.
• How to utilize the depth information to improve the performance of CNN
remains to be an
open problem.
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 38
Recap
1. Euler => Quoternion => Euler6
2. Correlation analysis => important for multi-task CNN
3. Separate network / Branching => efficiency
4. Data augmentation (pose synthesis)
5. Do we need FC (or other layer)?
6. Did pretrain data set always help?
2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 39

Contenu connexe

Tendances

A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representations
Devansh16
 
The Importance of Time in Visual Attention Models
The Importance of Time in Visual Attention ModelsThe Importance of Time in Visual Attention Models
The Importance of Time in Visual Attention Models
Universitat Politècnica de Catalunya
 

Tendances (20)

Enhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-ResolutionEnhanced Deep Residual Networks for Single Image Super-Resolution
Enhanced Deep Residual Networks for Single Image Super-Resolution
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
 
Dd net
Dd netDd net
Dd net
 
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
 
Convolutional Neural Network
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural Network
 
Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)
 
Convolutional Neural Networks for Computer vision Applications
Convolutional Neural Networks for Computer vision ApplicationsConvolutional Neural Networks for Computer vision Applications
Convolutional Neural Networks for Computer vision Applications
 
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
 
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
 
Region-oriented Convolutional Networks for Object Retrieval
Region-oriented Convolutional Networks for Object RetrievalRegion-oriented Convolutional Networks for Object Retrieval
Region-oriented Convolutional Networks for Object Retrieval
 
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representations
 
[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...
[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...
[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...
 
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向
 
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep LearningDLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
 
The Importance of Time in Visual Attention Models
The Importance of Time in Visual Attention ModelsThe Importance of Time in Visual Attention Models
The Importance of Time in Visual Attention Models
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
 
(Msc Thesis) Sparse Coral Classification Using Deep Convolutional Neural Netw...
(Msc Thesis) Sparse Coral Classification Using Deep Convolutional Neural Netw...(Msc Thesis) Sparse Coral Classification Using Deep Convolutional Neural Netw...
(Msc Thesis) Sparse Coral Classification Using Deep Convolutional Neural Netw...
 
Medical Imaging at DCU - Kevin McGuinness - UPC Barcelona 2018
Medical Imaging at DCU - Kevin McGuinness - UPC Barcelona 2018Medical Imaging at DCU - Kevin McGuinness - UPC Barcelona 2018
Medical Imaging at DCU - Kevin McGuinness - UPC Barcelona 2018
 

Similaire à (Research Note) Delving deeper into convolutional neural networks for camera relocalization

Transfer Learning and Fine-tuning Deep Neural Networks
 Transfer Learning and Fine-tuning Deep Neural Networks Transfer Learning and Fine-tuning Deep Neural Networks
Transfer Learning and Fine-tuning Deep Neural Networks
PyData
 
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo appDetails of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
PAY2 YOU
 
imageclassification-160206090009.pdf
imageclassification-160206090009.pdfimageclassification-160206090009.pdf
imageclassification-160206090009.pdf
KammetaJoshna
 

Similaire à (Research Note) Delving deeper into convolutional neural networks for camera relocalization (20)

Transfer Learning and Fine-tuning Deep Neural Networks
 Transfer Learning and Fine-tuning Deep Neural Networks Transfer Learning and Fine-tuning Deep Neural Networks
Transfer Learning and Fine-tuning Deep Neural Networks
 
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo appDetails of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
 
A presentation on the Convolutional Neural Network (CNN)
A presentation on the Convolutional Neural Network (CNN)A presentation on the Convolutional Neural Network (CNN)
A presentation on the Convolutional Neural Network (CNN)
 
Scene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural NetworkScene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural Network
 
Recent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectivesRecent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectives
 
Surveillance scene classification using machine learning
Surveillance scene classification using machine learningSurveillance scene classification using machine learning
Surveillance scene classification using machine learning
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331
 
imageclassification-160206090009.pdf
imageclassification-160206090009.pdfimageclassification-160206090009.pdf
imageclassification-160206090009.pdf
 
Small Deep-Neural-Networks: Their Advantages and Their Design
Small Deep-Neural-Networks: Their Advantages and Their DesignSmall Deep-Neural-Networks: Their Advantages and Their Design
Small Deep-Neural-Networks: Their Advantages and Their Design
 
A Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep Learning
 
kanimozhi2019.pdf
kanimozhi2019.pdfkanimozhi2019.pdf
kanimozhi2019.pdf
 
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A Survey
 

Dernier

Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Dernier (20)

Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 

(Research Note) Delving deeper into convolutional neural networks for camera relocalization

  • 1. Euler angle and gimbal lock 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 1
  • 2. Euler angle and gimbal lock Loss of a degree of freedom with Euler angles 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 2 When 𝛽 = 𝜋 2 then cos 𝜋 2 = 0 and sin 𝜋 2 = 1
  • 3. Euler angle and gimbal lock 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 3
  • 4. Euler angle and gimbal lock 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 4 Loss of a degree of freedom with Euler angles
  • 5. Resolve gimbal lock (Loss of a degree of freedom ) 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 5 1. Change 𝛽 2. Use different orientation representation => quaternion Rotation don’t commute 𝑅 𝑥 𝑅 𝑦 ≠ 𝑅 𝑦 𝑅 𝑥
  • 6. Quaternion (四元數) The history Complex number 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 6 https://www.youtube.com/watch?v=mHVwd8gYLnI&t=2s Extend Complex number What is 𝑏𝑐 𝑖𝑗 ? How to define 𝑖𝑗 ?
  • 7. Quaternion (四元數) Forget about 𝑖𝑗, how about define another one 𝑘 ? 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 7 https://www.youtube.com/watch?v=mHVwd8gYLnI&t=2s 𝑖 𝑗 𝑘
  • 8. Double cover of quaternion There are two distinct quaternions for each distinct orientation frame in 3D space. The belt trick reflects this double-valued relationship, distinguishing a one-circuit 360-degree rotation from the equivalent two-circuit 720-degree rotation.* 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 8 When applying regression on similar image, we may get distinct quaternions. * Andrew J. Hanson (6 February 2006). Visualizing Quaternions. Elsevier. pp. 114–. ISBN 978-0-08-047477-9.
  • 9. National Chung Cheng University, Taiwan Robot Vision Laboratory 2017/12/03 Jacky Liu (Research Note) Delving deeper into convolutional neural networks for camera relocalization
  • 10. About this work Delving deeper into convolutional neural networks for camera relocalization Wu, Jian1 , Ma, Liwei2 , Hu, Xiaolin1 ICRA2017 - IEEE International Conference on Robotics and Automation 1. Tsinghua National Laboratory for Information Science and Technology (TNList), De- partm ent of Computer Science and Technology, Tsinghua Univer- sity, 100084, Beijing, China 2. Intel Labs China, Intel Corporation, 100090, Beijing, China liwei.ma@intel.com 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 10
  • 11. Contributions 1. Good rotation representation that solve the double cover problem of quaternion (which used by PoseNet) => Euler6 2. Camera poses in training set are always very sparse in the whole pose space. => pose synthesis 3. Regressing orientation & translation together might not be optimal => BranchNet 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 11
  • 12. Related work 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 12 Camera relocalization Keypoints SIFT ORB SCoRe Keyframes G. Klein2008 A. P. Gee2012 However, these methods only provide a coarse estimation to the camera pose because of the sparsity of poses in training set. Camera relocalization Multi-task CNNs
  • 13. Related work Camera relocalization - CNN PoseNet (keyframes-based approach) • Encodes the key frames in training set into the parameters of models. SE3-Net • Point cloud data limits this algorithm to RGB-D • The number of predicted objects must be specified in training 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 13 Camera relocalization Multi-task CNNs
  • 14. Related work 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 14 Multi-task CNNs TCDCN 1. Facial landmark detection 2. Appearance attribute and expression HyperFace 1. Faces detection 2. Localizaing landmarks 3. Head pose 4. Gender • Sharing lower layer for low level common knowledge • Separate higher layer for specific predictions R-CNN 1. Human pose estimation 2. Action detection MCNNs 1. Attribute relationships 2. Attribute classifiers Camera relocalization Multi-task CNNs
  • 15. Related work 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 15 • Sharing lower layer for low level common knowledge • Separate higher layer for specific predictions Camera relocalization Multi-task CNNs Input Task1 Task2
  • 16. Method Summary A. Orientation Representation B. Pose Synthesis C. Mutli-task CNN for Camera Relocalization 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 16
  • 17. Method - Orientation Representation Predict Q = [0,1,0,0] Ground truth Q’ = [0,-1,0,0] 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 17 translation orientation orientation Even if we got the right orientation, we still have large error Quoternion Euler6
  • 18. Pose Synthesis 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 18 Overfitting on sparse trajectory
  • 19. 2017/12/13 Delving deeper into convolutional neural ne tworks for camera relocalization 19 How to resolve overfitting? (Hint: 2 methods)
  • 20. Pose Synthesis 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 20 Overfitting on sparse trajectory
  • 21. Method Mutli-task CNN for Camera Relocalization To quantitatively understand relationship between orientation and translation 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 21 Translation Rotation 6𝐷𝑜𝐹 = |𝑋, 𝑌, 𝑍, 𝜙, 𝜃, 𝜓| Intra group correlations • Orientation:0.391 • Translation:0.293 (self-correlations are not involved) Inter group correlations • 0.256
  • 22. Method Mutli-task CNN for Camera Relocalization Learn from statictic • In the extreme case, regressing orientation and translation separately by two individual networks may also give better results. High computation cost of individual network • But regressing orientation and translation individually significantly increases the computing cost. Balance - branching 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 22 translation orientation translation orientation
  • 23. Method Summary A. Orientation Representation B. Pose Synthesis C. Mutli-task CNN for Camera Relocalization 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 23
  • 24. Experiment Dataset: 7Scenes • Each sequence (seq-XX.zip) consists of 500-1000 frames • RGBD: 640x480 => 343x256 • Initial learning rate 10−5 (dropped by 90% every 10000 iter.) • End iteration at 45000 Hardware • 2 Nvidia Titan X GPU 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 24
  • 25. Network design 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 25
  • 26. Inception network (googlenet) 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 26
  • 27. Inception network (googlenet) 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 27
  • 28. Network design 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 28
  • 29. PoseNet / BranchNet 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 29
  • 30. Euler6 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 30
  • 31. Data augmentation 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 31
  • 32. Data augmentation 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 32
  • 33. Branch 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 33
  • 34. Pretrain 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 34 Surprisingly pretain on ImageNet increase error
  • 35. Did FCN helps? 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 35
  • 36. Efficiency of the BranchNet Storing weights took 46 MB for BranchNet-Euler6. Branching networks slowed down the forward speed from 5ms to 6ms per frame on a NVIDIA Titan X GPU. BranchNet-Euler6 in the GPU of an Intel NUC mobile platform (Intel CoreTM i5-6260U) with clCaffe [24], and reached a speed of 43 fps, which meets the real-time requirement of many robotic applications. 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 36
  • 37. Conclusion CNN-based camera relocalization 1. A new orientation representation Euler6. 2. The pose synthesis for data augmentation. 3. The BranchNet for multi-task regression. Experiments showed that all of the above techniques improved the relocalization accuracy, and they together reduced the error of previous methods by a significant margin. 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 37
  • 38. Conclusion • Work well on monocular image => RGBD => SCoRe Forests [2] still perform better • They attempted to utilize the depth information by simply add the depth image as the fourth channel to the original input which has RGB channels but did not obtain much better results than our current results. • How to utilize the depth information to improve the performance of CNN remains to be an open problem. 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 38
  • 39. Recap 1. Euler => Quoternion => Euler6 2. Correlation analysis => important for multi-task CNN 3. Separate network / Branching => efficiency 4. Data augmentation (pose synthesis) 5. Do we need FC (or other layer)? 6. Did pretrain data set always help? 2017/12/13 Delving deeper into convolutional neural networks for camera relocalization 39