SlideShare une entreprise Scribd logo
1  sur  28
3D Pose Estimation for
Transparent Objects
Presenter: 賴柏任
Advisor:羅仁權教授
05.20.2015
Motivation
• Transparent objects are everywhere
• If we know he pose, we can grasp it!
2
Problems
3
Color of
transparent
object changes
Hard to locate
transparent
objects
Edge of
transparent objects
are blur
Hard to estimate
pose of
transparent objects
Effective cure
4
Color of
transparent
object changes
Edge of
transparent
objects are blur
Kinect v.s. Color changes
• Transparent objects produce NaN in
depth map
5
Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in
transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International
Conference on, 2013, pp. 162-169.
Graphcut v.s. Blur edge
• Given foreground & background clue
6
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.
Graphcut v.s. Blur edge
• Generate the prob. distribution
7
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.
Graphcut v.s. Blur edge
• Use distance to compensate
8
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.
Graphcut v.s. Blur edge
• OpenCV implementation
9
A coarse pipeline
10
Detect NaN
area in
depth map
Feed the
area to
Graphcut
Segment
the edge
How to determine pose?
• Model-based matching
• Rotate in x & y axis and store the edge
11
Z-axis Y-axis
The problem becomes a 2D-
2D matching problem
Where is the model?
• Kinect Fusion
12
Where is the model?
Wrap your
object with
paper
Use Kinect
Fusion to
construct the
model
Store the model
13
What if there are some other NaN
objects?
• Some non-transparent objects also
produce NaN in depth map
14
What if there are some other NaN
objects?
• Use characteristics of transparent object
to rule out non-transparent objects
15
Transparent
objects produce
highlights
Color of transparent
object is similar to
peripheral area
What if there are some other NaN
objects?
• Transparent objects produce highlights
16
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.
What if there are some other NaN
objects?
• Transparent objects produce highlights
17
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.
Threshold the image from 0-255
Compute the perimeter in each image
Compute the threshold by line fitting (from
255 to 0)
What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
18
What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
19
Hue histogram
A fine pipeline
20
Some results
• Pose Matching
21
Some results
• Total retrieved candidates are over 200
22
Method Recall Precision
Only NaN 86.11% 38.24%
Characteristics 86.11% 93.93%
Recall = (2/2)*100% =100%
Precision=(2/5)*100% =40%
Some other problems
• How to let robot grasp?
• Is there any choice other from Kinect?
23
How to let robot grasp?
• Teach and Play
24
Grasp
points
Is there any choice other from
Kinect?
• Extract the visual word of transparent
objects
25
Is there any choice other from
Kinect?
26
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.
Is there any choice other from
Kinect?
• The result can be the input of Graphcut
27
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.
28
Thank you!

Contenu connexe

Similaire à Seminar報告_20150520

NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
zukun
 
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
Ahmed Gad
 

Similaire à Seminar報告_20150520 (20)

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...
Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...
Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
 
Visual geometry with deep learning
Visual geometry with deep learningVisual geometry with deep learning
Visual geometry with deep learning
 
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019
Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019
Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019
 
Learning with Unpaired Data
Learning with Unpaired DataLearning with Unpaired Data
Learning with Unpaired Data
 
AR/SLAM and IoT
AR/SLAM and IoTAR/SLAM and IoT
AR/SLAM and IoT
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
 
Deep learning for object detection
Deep learning for object detectionDeep learning for object detection
Deep learning for object detection
 
Visual Transformers
Visual TransformersVisual Transformers
Visual Transformers
 
Government GraphSummit: Leveraging Graphs for AI and ML
Government GraphSummit: Leveraging Graphs for AI and MLGovernment GraphSummit: Leveraging Graphs for AI and ML
Government GraphSummit: Leveraging Graphs for AI and ML
 
Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016
Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016
Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
 
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Seminar報告_20150520

  • 1. 3D Pose Estimation for Transparent Objects Presenter: 賴柏任 Advisor:羅仁權教授 05.20.2015
  • 2. Motivation • Transparent objects are everywhere • If we know he pose, we can grasp it! 2
  • 3. Problems 3 Color of transparent object changes Hard to locate transparent objects Edge of transparent objects are blur Hard to estimate pose of transparent objects
  • 4. Effective cure 4 Color of transparent object changes Edge of transparent objects are blur
  • 5. Kinect v.s. Color changes • Transparent objects produce NaN in depth map 5 Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International Conference on, 2013, pp. 162-169.
  • 6. Graphcut v.s. Blur edge • Given foreground & background clue 6 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.
  • 7. Graphcut v.s. Blur edge • Generate the prob. distribution 7 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.
  • 8. Graphcut v.s. Blur edge • Use distance to compensate 8 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.
  • 9. Graphcut v.s. Blur edge • OpenCV implementation 9
  • 10. A coarse pipeline 10 Detect NaN area in depth map Feed the area to Graphcut Segment the edge
  • 11. How to determine pose? • Model-based matching • Rotate in x & y axis and store the edge 11 Z-axis Y-axis The problem becomes a 2D- 2D matching problem
  • 12. Where is the model? • Kinect Fusion 12
  • 13. Where is the model? Wrap your object with paper Use Kinect Fusion to construct the model Store the model 13
  • 14. What if there are some other NaN objects? • Some non-transparent objects also produce NaN in depth map 14
  • 15. What if there are some other NaN objects? • Use characteristics of transparent object to rule out non-transparent objects 15 Transparent objects produce highlights Color of transparent object is similar to peripheral area
  • 16. What if there are some other NaN objects? • Transparent objects produce highlights 16 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979.
  • 17. What if there are some other NaN objects? • Transparent objects produce highlights 17 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979. Threshold the image from 0-255 Compute the perimeter in each image Compute the threshold by line fitting (from 255 to 0)
  • 18. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 18
  • 19. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 19 Hue histogram
  • 21. Some results • Pose Matching 21
  • 22. Some results • Total retrieved candidates are over 200 22 Method Recall Precision Only NaN 86.11% 38.24% Characteristics 86.11% 93.93% Recall = (2/2)*100% =100% Precision=(2/5)*100% =40%
  • 23. Some other problems • How to let robot grasp? • Is there any choice other from Kinect? 23
  • 24. How to let robot grasp? • Teach and Play 24 Grasp points
  • 25. Is there any choice other from Kinect? • Extract the visual word of transparent objects 25
  • 26. Is there any choice other from Kinect? 26 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.
  • 27. Is there any choice other from Kinect? • The result can be the input of Graphcut 27 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.