SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
#denatechcon
#denatechcon
Building HD Maps with Dashcams
Kosuke Kuzuoka
AI System Group
DeNA Co., Ltd.
#denatechcon
Agenda
• Who I am
• Our Goal
• Intro to DL and SfM
• 3D Point Reconstruction
• Recognizing Objects
• Putting It All Together
#denatechcon
Who I am
• Profile
• Kosuke Kuzuoka
• 22 years old
• Experience
• June 2018 - Present
AI Research Engineer at DeNA Co., Ltd.
• March 2017 - June 2018
R&D manager at CONCORE’S, inc.
• Interests
• Self Driving Cars
• Computer Vision
Facebook: https://www.facebook.com/kousuke.kuzuoka.9
LinkedIn: https://www.linkedin.com/in/kousuke-kuzuoka-4101ba160/
#denatechcon
What I have done before
Detecting objects from construction
plans using deep learning algorithms
Patent pending algorithm that I
developed for detecting pillars across
multiple tiled images
#denatechcon
Our Goal
● To create high definition maps at
a lower price
● 3D point reconstruction and
object detection in dashcam
images
● No use of expensive equipment,
such as LiDAR
https://medium.com/@surmenok/hd-maps-for-self-driving-cars-c41bc01e0d40
#denatechcon
Isn’t it like google maps?
● A map designed for humans
● It has useful information for
humans
● A map designed for machines
● It has useful information for cars,
such as where traffic signs exist
#denatechcon
Is it for self-driving cars?
● It’s extensively used in self-driving cars,
such as for localization and path planning
● Therefore, the location accuracy for HD
maps need to be within a few centimeters
● A self-driving car needs to know which
direction the lane is leading, where the
traffic signs are, etc.
https://www.youtube.com/watch?time_continue=207&v=EUq5DlPQdhg
#denatechcon
Introduction to Deep Learning
● The idea of deep learning has existed from the late 1950s, invented by Frank Rosenblatt.
● It was originally called Perceptron, and it was able to solve linearly separable problems.
● Later, it turned out that simple Perceptron wasn’t able to solve non-linearly separable
problems.
https://becominghuman.ai/deep-learning-made-easy-with-deep-cognition-403fbe445351
#denatechcon
Why is deep learning popular nowadays?
● Large scale datasets such as ImageNet have been made public for research purposes
● High computational resources such as GPU are more accessible than ever before
https://en.wikipedia.org/wiki/Nvidia
http://www.image-net.org/
#denatechcon
Okay, but what can you do with DL?
● Using deep learning, we can
solve object detection and
instance segmentation
problems
● Object detection detects
multiple objects in the image,
while instance segmentation
segments object boundaries
● Using deep learning, we can
solve image classification and
image localization problems
● Image classification classifies
what is in the image, while
image localization classifies
what and where in the image
https://medium.com/comet-app/review-of-deep-learning-algorithms-for-object-detection-c1f3d437b852
#denatechcon
Okay, let’s sum that up
• Deep learning is not new
• Data is important for deep learning
• High computational resources are necessary
• You can do so many things with deep learning
#denatechcon
Introduction to SfM
SfM stands for Structure from
Motion, and is an algorithm to
reconstruct 3D points (called
structure) from images taken
with different angles or positions
(called motion). Large scale
applications include for example
reconstructing all of Rome using
only images found on the web.
https://grail.cs.washington.edu/rome/rome_paper.pdf
#denatechcon
How does SfM work?
https://www.mathworks.com/help/vision/ug/structure-from-motion.html
● Extracts features from images. e.g.
corners or edges
● Matches the features in images taken
from different positions
● Calculates the corresponding points
in 3D coordinates using triangulation
● Calculates camera position and
optimizes reconstructed 3D points
#denatechcon
What can you do with SfM?
https://grail.cs.washington.edu/rome/rome_paper.pdf
It built a 3D representation of Rome within a day with images found on the web. It used
150k images, and the processing time was around 21 hours using 496 CPU cores.
#denatechcon
Let’s sum that up
• SfM can reconstruct 3D shapes from 2D images
• 3D representation of Rome can be built in a day
using images from the web
#denatechcon
So we have tools. What now?
● Dashcam images are used for reconstructing 3D points by SfM
● The same images are used for detecting objects in 2D space
● Both results are integrated to get 3D representations of each object
#denatechcon
3D Point Reconstruction
● Images are taken by driving in the
highlighted region in Minatomirai
● Dashcam images are used for SfM
and object detection
#denatechcon
Overall shape looks good
● a
● b
● c
● 3D modeling in relatively small
region in Minatomirai
● Reconstructed shape matches the
highlighted region in the map
#denatechcon
Slightly larger region, still good
● Red arrows indicate the direction
the car was driving
● The reconstructed shape matches
the highlighted region in the map
#denatechcon
Hooray, view from top is good
● SfM was applied in a larger region
in the Minatomirai area
● Overall shape still matches the map
#denatechcon
What about the closer view?
The detail of road markings and speed
limit signs can be found, though some
information is unnecessary
Lanes are reconstructed well on the left
side, but the the center lane markings on
the right are missing. This is caused by
the divider
#denatechcon
Some findings with SfM are:
• Reconstructed 3D points contain small details
• GPU can reduce the processing time significantly
• The more images, the better the result
#denatechcon
Recognizing Objects
● We chose Faster R-CNN for detecting
traffic signs
● Faster R-CNN was a state-of-the-art
detector in 2016
● Faster R-CNN is a really accurate object
detector when compared to other real-time
detectors, but it’s slower
https://arxiv.org/abs/1506.01497
#denatechcon
Objects are detected correctly
● Most of traffic signs are detected correctly, though
there is a small traffic sign missed by the detector
● The network predicts the category for each box,
and there are more than 100 categories to choose
from
#denatechcon
Another example for traffic signs
#denatechcon
What now for lane detection?
https://arxiv.org/abs/1802.05591
● We chose LaneNet published in 2018 as a lane detector
● LaneNet transforms an original image to a bird’s eye image with learned parameters
● It can detect multiple lane instances at real-time speed and high accuracy
#denatechcon
Deep learning can detect lanes!
● Different colors indicate different instances
● You can see that the lanes are detected correctly
● It can detect curved lanes as well, though they
aren’t in the image
#denatechcon
Another example for lane detection
#denatechcon
What about road markings?
Bird’s eye
transformation on
original image
Inverse transformation
on bird’s eye image
Faster R-CNN on
bird’s eye image
#denatechcon
Deep learning works for road markings!
● Road markings are detected correctly.
● It distinguishes the lane from the stop sign
● The detection fits objects, though not perfectly
#denatechcon
Another example for road markings
#denatechcon
The result is impressive
#denatechcon
Objects are detected precisely
#denatechcon
Let’s sum that up
• Traffic sign recognition with more than 100
categories can be solved with deep learning
• Deep learning works well on complicated tasks
such as lane and road marking detection
• The more data, the better the results
#denatechcon
Putting It All Together
● Green points indicate the region used for 3D
reconstruction
● The detection has to be done in frames where
the objects are highlighted in green
#denatechcon
Results are now integrated
We can get a 3D representation of
detected objects by integrating both
results. The final result will look like
image above.
#denatechcon
Now, objects are represented in 3D
● Detected traffic signs and road markings are
converted to 3D
● Each object has a 3D representation after
integrating both SfM and object detection results
#denatechcon
We are done!
● Reconstructed 3D view looking from top
● You can see the detected lanes and road
markings now have a 3D representation
#denatechcon
Using this technique, we could do:
• Automating process for map creation
• Creating HD maps for other services
• Detecting changes automatically
#denatechcon
Thanks!
#denatechcon
#denatechcon

Contenu connexe

Tendances

コンピューテーショナルフォトグラフィ
コンピューテーショナルフォトグラフィコンピューテーショナルフォトグラフィ
コンピューテーショナルフォトグラフィNorishige Fukushima
 
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...Deep Learning JP
 
三次元点群を取り扱うニューラルネットワークのサーベイ
三次元点群を取り扱うニューラルネットワークのサーベイ三次元点群を取り扱うニューラルネットワークのサーベイ
三次元点群を取り扱うニューラルネットワークのサーベイNaoya Chiba
 
三次元表現まとめ(深層学習を中心に)
三次元表現まとめ(深層学習を中心に)三次元表現まとめ(深層学習を中心に)
三次元表現まとめ(深層学習を中心に)Tomohiro Motoda
 
【論文紹介】U-GAT-IT
【論文紹介】U-GAT-IT【論文紹介】U-GAT-IT
【論文紹介】U-GAT-ITmeownoisy
 
ガイデットフィルタとその周辺
ガイデットフィルタとその周辺ガイデットフィルタとその周辺
ガイデットフィルタとその周辺Norishige Fukushima
 
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural NetworksDeep Learning JP
 
SSII2019企画: 点群深層学習の研究動向
SSII2019企画: 点群深層学習の研究動向SSII2019企画: 点群深層学習の研究動向
SSII2019企画: 点群深層学習の研究動向SSII
 
20160724_cv_sfm_revisited
20160724_cv_sfm_revisited20160724_cv_sfm_revisited
20160724_cv_sfm_revisitedKyohei Unno
 
画像処理ライブラリ OpenCV で 出来ること・出来ないこと
画像処理ライブラリ OpenCV で 出来ること・出来ないこと画像処理ライブラリ OpenCV で 出来ること・出来ないこと
画像処理ライブラリ OpenCV で 出来ること・出来ないことNorishige Fukushima
 
確率モデルを用いた3D点群レジストレーション
確率モデルを用いた3D点群レジストレーション確率モデルを用いた3D点群レジストレーション
確率モデルを用いた3D点群レジストレーションKenta Tanaka
 
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Yusuke Uchida
 
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video  Processing (NeRF...[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video  Processing (NeRF...
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...Deep Learning JP
 
[DL輪読会]End-to-End Object Detection with Transformers
[DL輪読会]End-to-End Object Detection with Transformers[DL輪読会]End-to-End Object Detection with Transformers
[DL輪読会]End-to-End Object Detection with TransformersDeep Learning JP
 
30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusionHiroki Mizuno
 
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話Yusuke Uchida
 
近年のHierarchical Vision Transformer
近年のHierarchical Vision Transformer近年のHierarchical Vision Transformer
近年のHierarchical Vision TransformerYusuke Uchida
 
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~SSII
 
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language SupervisionDeep Learning JP
 

Tendances (20)

コンピューテーショナルフォトグラフィ
コンピューテーショナルフォトグラフィコンピューテーショナルフォトグラフィ
コンピューテーショナルフォトグラフィ
 
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...
【DL輪読会】EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Pointsfor...
 
三次元点群を取り扱うニューラルネットワークのサーベイ
三次元点群を取り扱うニューラルネットワークのサーベイ三次元点群を取り扱うニューラルネットワークのサーベイ
三次元点群を取り扱うニューラルネットワークのサーベイ
 
三次元表現まとめ(深層学習を中心に)
三次元表現まとめ(深層学習を中心に)三次元表現まとめ(深層学習を中心に)
三次元表現まとめ(深層学習を中心に)
 
【論文紹介】U-GAT-IT
【論文紹介】U-GAT-IT【論文紹介】U-GAT-IT
【論文紹介】U-GAT-IT
 
ガイデットフィルタとその周辺
ガイデットフィルタとその周辺ガイデットフィルタとその周辺
ガイデットフィルタとその周辺
 
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks
[DLHacks 実装] DeepPose: Human Pose Estimation via Deep Neural Networks
 
SSII2019企画: 点群深層学習の研究動向
SSII2019企画: 点群深層学習の研究動向SSII2019企画: 点群深層学習の研究動向
SSII2019企画: 点群深層学習の研究動向
 
20160724_cv_sfm_revisited
20160724_cv_sfm_revisited20160724_cv_sfm_revisited
20160724_cv_sfm_revisited
 
Depth Estimation論文紹介
Depth Estimation論文紹介Depth Estimation論文紹介
Depth Estimation論文紹介
 
画像処理ライブラリ OpenCV で 出来ること・出来ないこと
画像処理ライブラリ OpenCV で 出来ること・出来ないこと画像処理ライブラリ OpenCV で 出来ること・出来ないこと
画像処理ライブラリ OpenCV で 出来ること・出来ないこと
 
確率モデルを用いた3D点群レジストレーション
確率モデルを用いた3D点群レジストレーション確率モデルを用いた3D点群レジストレーション
確率モデルを用いた3D点群レジストレーション
 
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
 
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video  Processing (NeRF...[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video  Processing (NeRF...
[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...
 
[DL輪読会]End-to-End Object Detection with Transformers
[DL輪読会]End-to-End Object Detection with Transformers[DL輪読会]End-to-End Object Detection with Transformers
[DL輪読会]End-to-End Object Detection with Transformers
 
30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion
 
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話
You Only Look One-level Featureの解説と見せかけた物体検出のよもやま話
 
近年のHierarchical Vision Transformer
近年のHierarchical Vision Transformer近年のHierarchical Vision Transformer
近年のHierarchical Vision Transformer
 
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~
SSII2019TS: 実践カメラキャリブレーション ~カメラを用いた実世界計測の基礎と応用~
 
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
 

Similaire à Building HD maps with dashcams

Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsPrabindh Sundareson
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingPreferred Networks
 
2 D3 D Concersion Swaggmedia
2 D3 D Concersion   Swaggmedia2 D3 D Concersion   Swaggmedia
2 D3 D Concersion SwaggmediaCraig Nobles
 
What is point cloud annotation?
What is point cloud annotation?What is point cloud annotation?
What is point cloud annotation?Annotation Support
 
3D Laser Scanning for Oil & Gas Facilities
3D Laser Scanning for Oil & Gas Facilities3D Laser Scanning for Oil & Gas Facilities
3D Laser Scanning for Oil & Gas FacilitiesYasser Eldegwy
 
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...DeNA
 
From 2D Map to Mobile 3D Mirror World
From 2D Map to Mobile 3D Mirror WorldFrom 2D Map to Mobile 3D Mirror World
From 2D Map to Mobile 3D Mirror WorldYu You
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMeetupDataScienceRoma
 
Deep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDeep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDezyreAcademy
 
Driving Assistant Solutions with Android
Driving Assistant Solutions with AndroidDriving Assistant Solutions with Android
Driving Assistant Solutions with AndroidGiorgio Natili
 
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...DevClub_lv
 
CNTK Object Detection
CNTK Object DetectionCNTK Object Detection
CNTK Object DetectionAndy Huang
 
Enhanced real time semantic segmentation
Enhanced real time semantic segmentationEnhanced real time semantic segmentation
Enhanced real time semantic segmentationAkankshaRawat42
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)varun teja G.V.V
 
detailed experience
detailed experiencedetailed experience
detailed experienceBryan Yan
 
Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))Kamal Shahi
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsPetteriTeikariPhD
 

Similaire à Building HD maps with dashcams (20)

Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in Robotics
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
 
2 D3 D Concersion Swaggmedia
2 D3 D Concersion   Swaggmedia2 D3 D Concersion   Swaggmedia
2 D3 D Concersion Swaggmedia
 
What is point cloud annotation?
What is point cloud annotation?What is point cloud annotation?
What is point cloud annotation?
 
3D Laser Scanning for Oil & Gas Facilities
3D Laser Scanning for Oil & Gas Facilities3D Laser Scanning for Oil & Gas Facilities
3D Laser Scanning for Oil & Gas Facilities
 
Photomodeler
PhotomodelerPhotomodeler
Photomodeler
 
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
 
From 2D Map to Mobile 3D Mirror World
From 2D Map to Mobile 3D Mirror WorldFrom 2D Map to Mobile 3D Mirror World
From 2D Map to Mobile 3D Mirror World
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image Processing
 
Deep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDeep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep Learning
 
Driving Assistant Solutions with Android
Driving Assistant Solutions with AndroidDriving Assistant Solutions with Android
Driving Assistant Solutions with Android
 
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
“Accident Reconstruction” by Aleksis Liekna from Scope Technologies at Auto f...
 
CNTK Object Detection
CNTK Object DetectionCNTK Object Detection
CNTK Object Detection
 
UI and UX for Mobile Developers
UI and UX for Mobile DevelopersUI and UX for Mobile Developers
UI and UX for Mobile Developers
 
Enhanced real time semantic segmentation
Enhanced real time semantic segmentationEnhanced real time semantic segmentation
Enhanced real time semantic segmentation
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)
 
Career portfolio
Career portfolioCareer portfolio
Career portfolio
 
detailed experience
detailed experiencedetailed experience
detailed experience
 
Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
 

Dernier

VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 

Dernier (20)

NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 

Building HD maps with dashcams

  • 1. #denatechcon #denatechcon Building HD Maps with Dashcams Kosuke Kuzuoka AI System Group DeNA Co., Ltd.
  • 2. #denatechcon Agenda • Who I am • Our Goal • Intro to DL and SfM • 3D Point Reconstruction • Recognizing Objects • Putting It All Together
  • 3. #denatechcon Who I am • Profile • Kosuke Kuzuoka • 22 years old • Experience • June 2018 - Present AI Research Engineer at DeNA Co., Ltd. • March 2017 - June 2018 R&D manager at CONCORE’S, inc. • Interests • Self Driving Cars • Computer Vision Facebook: https://www.facebook.com/kousuke.kuzuoka.9 LinkedIn: https://www.linkedin.com/in/kousuke-kuzuoka-4101ba160/
  • 4. #denatechcon What I have done before Detecting objects from construction plans using deep learning algorithms Patent pending algorithm that I developed for detecting pillars across multiple tiled images
  • 5. #denatechcon Our Goal ● To create high definition maps at a lower price ● 3D point reconstruction and object detection in dashcam images ● No use of expensive equipment, such as LiDAR https://medium.com/@surmenok/hd-maps-for-self-driving-cars-c41bc01e0d40
  • 6. #denatechcon Isn’t it like google maps? ● A map designed for humans ● It has useful information for humans ● A map designed for machines ● It has useful information for cars, such as where traffic signs exist
  • 7. #denatechcon Is it for self-driving cars? ● It’s extensively used in self-driving cars, such as for localization and path planning ● Therefore, the location accuracy for HD maps need to be within a few centimeters ● A self-driving car needs to know which direction the lane is leading, where the traffic signs are, etc. https://www.youtube.com/watch?time_continue=207&v=EUq5DlPQdhg
  • 8. #denatechcon Introduction to Deep Learning ● The idea of deep learning has existed from the late 1950s, invented by Frank Rosenblatt. ● It was originally called Perceptron, and it was able to solve linearly separable problems. ● Later, it turned out that simple Perceptron wasn’t able to solve non-linearly separable problems. https://becominghuman.ai/deep-learning-made-easy-with-deep-cognition-403fbe445351
  • 9. #denatechcon Why is deep learning popular nowadays? ● Large scale datasets such as ImageNet have been made public for research purposes ● High computational resources such as GPU are more accessible than ever before https://en.wikipedia.org/wiki/Nvidia http://www.image-net.org/
  • 10. #denatechcon Okay, but what can you do with DL? ● Using deep learning, we can solve object detection and instance segmentation problems ● Object detection detects multiple objects in the image, while instance segmentation segments object boundaries ● Using deep learning, we can solve image classification and image localization problems ● Image classification classifies what is in the image, while image localization classifies what and where in the image https://medium.com/comet-app/review-of-deep-learning-algorithms-for-object-detection-c1f3d437b852
  • 11. #denatechcon Okay, let’s sum that up • Deep learning is not new • Data is important for deep learning • High computational resources are necessary • You can do so many things with deep learning
  • 12. #denatechcon Introduction to SfM SfM stands for Structure from Motion, and is an algorithm to reconstruct 3D points (called structure) from images taken with different angles or positions (called motion). Large scale applications include for example reconstructing all of Rome using only images found on the web. https://grail.cs.washington.edu/rome/rome_paper.pdf
  • 13. #denatechcon How does SfM work? https://www.mathworks.com/help/vision/ug/structure-from-motion.html ● Extracts features from images. e.g. corners or edges ● Matches the features in images taken from different positions ● Calculates the corresponding points in 3D coordinates using triangulation ● Calculates camera position and optimizes reconstructed 3D points
  • 14. #denatechcon What can you do with SfM? https://grail.cs.washington.edu/rome/rome_paper.pdf It built a 3D representation of Rome within a day with images found on the web. It used 150k images, and the processing time was around 21 hours using 496 CPU cores.
  • 15. #denatechcon Let’s sum that up • SfM can reconstruct 3D shapes from 2D images • 3D representation of Rome can be built in a day using images from the web
  • 16. #denatechcon So we have tools. What now? ● Dashcam images are used for reconstructing 3D points by SfM ● The same images are used for detecting objects in 2D space ● Both results are integrated to get 3D representations of each object
  • 17. #denatechcon 3D Point Reconstruction ● Images are taken by driving in the highlighted region in Minatomirai ● Dashcam images are used for SfM and object detection
  • 18. #denatechcon Overall shape looks good ● a ● b ● c ● 3D modeling in relatively small region in Minatomirai ● Reconstructed shape matches the highlighted region in the map
  • 19. #denatechcon Slightly larger region, still good ● Red arrows indicate the direction the car was driving ● The reconstructed shape matches the highlighted region in the map
  • 20. #denatechcon Hooray, view from top is good ● SfM was applied in a larger region in the Minatomirai area ● Overall shape still matches the map
  • 21. #denatechcon What about the closer view? The detail of road markings and speed limit signs can be found, though some information is unnecessary Lanes are reconstructed well on the left side, but the the center lane markings on the right are missing. This is caused by the divider
  • 22. #denatechcon Some findings with SfM are: • Reconstructed 3D points contain small details • GPU can reduce the processing time significantly • The more images, the better the result
  • 23. #denatechcon Recognizing Objects ● We chose Faster R-CNN for detecting traffic signs ● Faster R-CNN was a state-of-the-art detector in 2016 ● Faster R-CNN is a really accurate object detector when compared to other real-time detectors, but it’s slower https://arxiv.org/abs/1506.01497
  • 24. #denatechcon Objects are detected correctly ● Most of traffic signs are detected correctly, though there is a small traffic sign missed by the detector ● The network predicts the category for each box, and there are more than 100 categories to choose from
  • 26. #denatechcon What now for lane detection? https://arxiv.org/abs/1802.05591 ● We chose LaneNet published in 2018 as a lane detector ● LaneNet transforms an original image to a bird’s eye image with learned parameters ● It can detect multiple lane instances at real-time speed and high accuracy
  • 27. #denatechcon Deep learning can detect lanes! ● Different colors indicate different instances ● You can see that the lanes are detected correctly ● It can detect curved lanes as well, though they aren’t in the image
  • 29. #denatechcon What about road markings? Bird’s eye transformation on original image Inverse transformation on bird’s eye image Faster R-CNN on bird’s eye image
  • 30. #denatechcon Deep learning works for road markings! ● Road markings are detected correctly. ● It distinguishes the lane from the stop sign ● The detection fits objects, though not perfectly
  • 34. #denatechcon Let’s sum that up • Traffic sign recognition with more than 100 categories can be solved with deep learning • Deep learning works well on complicated tasks such as lane and road marking detection • The more data, the better the results
  • 35. #denatechcon Putting It All Together ● Green points indicate the region used for 3D reconstruction ● The detection has to be done in frames where the objects are highlighted in green
  • 36. #denatechcon Results are now integrated We can get a 3D representation of detected objects by integrating both results. The final result will look like image above.
  • 37. #denatechcon Now, objects are represented in 3D ● Detected traffic signs and road markings are converted to 3D ● Each object has a 3D representation after integrating both SfM and object detection results
  • 38. #denatechcon We are done! ● Reconstructed 3D view looking from top ● You can see the detected lanes and road markings now have a 3D representation
  • 39. #denatechcon Using this technique, we could do: • Automating process for map creation • Creating HD maps for other services • Detecting changes automatically