Seminar報告_20150520

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,234 vues

Po-Jen Lai

2015.05.20的seminar報告，就當作提早把碩論要講的內容大致整理一次。

Technologie

3D Pose Estimation for
Transparent Objects
Presenter: 賴柏任
Advisor:羅仁權教授
05.20.2015

Motivation
• Transparent objects are everywhere
• If we know he pose, we can grasp it!
2

Problems
3
Color of
transparent
object changes
Hard to locate
transparent
objects
Edge of
transparent objects
are blur
Hard to estimate
pose of
transparent objects

Effective cure
4
Color of
transparent
object changes
Edge of
transparent
objects are blur

Kinect v.s. Color changes
• Transparent objects produce NaN in
depth map
5
Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in
transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International
Conference on, 2013, pp. 162-169.

Graphcut v.s. Blur edge
• Given foreground & background clue
6
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• Generate the prob. distribution
7
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• Use distance to compensate
8
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• OpenCV implementation
9

A coarse pipeline
10
Detect NaN
area in
depth map
Feed the
area to
Graphcut
Segment
the edge

How to determine pose?
• Model-based matching
• Rotate in x & y axis and store the edge
11
Z-axis Y-axis
The problem becomes a 2D-
2D matching problem

Where is the model?
Wrap your
object with
paper
Use Kinect
Fusion to
construct the
model
Store the model
13

What if there are some other NaN
objects?
• Some non-transparent objects also
produce NaN in depth map
14

What if there are some other NaN
objects?
• Use characteristics of transparent object
to rule out non-transparent objects
15
Transparent
objects produce
highlights
Color of transparent
object is similar to
peripheral area

What if there are some other NaN
objects?
• Transparent objects produce highlights
16
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.

What if there are some other NaN
objects?
• Transparent objects produce highlights
17
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.
Threshold the image from 0-255
Compute the perimeter in each image
Compute the threshold by line fitting (from
255 to 0)

What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
18

What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
19
Hue histogram

Some results
• Total retrieved candidates are over 200
22
Method Recall Precision
Only NaN 86.11% 38.24%
Characteristics 86.11% 93.93%
Recall = (2/2)*100% =100%
Precision=(2/5)*100% =40%

Some other problems
• How to let robot grasp?
• Is there any choice other from Kinect?
23

How to let robot grasp?
• Teach and Play
24
Grasp
points

Is there any choice other from
Kinect?
• Extract the visual word of transparent
objects
25

Is there any choice other from
Kinect?
26
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.

Is there any choice other from
Kinect?
• The result can be the input of Graphcut
27
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.

Contenu connexe

Similaire à Seminar報告_20150520

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...

Pirouz Nourian

NIPS2009: Understand Visual Scenes - Part 2

zukun

Introduction to 3D Computer Vision and Differentiable Rendering

Preferred Networks

Visual geometry with deep learning

NAVER Engineering

Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...

Universitat Politècnica de Catalunya

Slides by Amaia Salvador at the UPC Computer Vision Reading Group. Source document on GDocs with clickable links: https://docs.google.com/presentation/d/1jDTyKTNfZBfMl8OHANZJaYxsXTqGCHMVeMeBe5o1EL0/edit?usp=sharing Based on the original work: Ren, Shaoqing, Kaiming He, Ross Girshick, and Jian Sun. "Faster R-CNN: Towards real-time object detection with region proposal networks." In Advances in Neural Information Processing Systems, pp. 91-99. 2015.

Faster R-CNN: Towards real-time object detection with region proposal network...

Universitat Politècnica de Catalunya

Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019

Universitat Politècnica de Catalunya

Many learning tasks can be summarized as learning a mapping from a structured input to a structured output, such as machine translation, image captioning, image style transfer, and image dehazing. Such mappings are usually learned on paired training data, where an input sample and its corresponding output are both provided. Collecting paired training data often involves expensive human annotation, and the scale of paired training data is therefore often limited. As a result, the generalization ability of models trained on paired data is also limited. One way to mitigate this issue is learning with unpaired data, which is far less expensive to collect. Taking machine translation as an example, the unpaired training data can be collected separately from newspapers in the source language and target language without any annotation. The challenge of unpaired learning turns into how to align the unpaired data. With carefully designed objectives, unpaired learning has achieved remarkable progress on several tasks. This talk will cover the data collection and training methods of several unpaired learning tasks to illustrate the power of learning with unpaired data.

Learning with Unpaired Data

Goergen Institute for Data Science

AR/SLAM and IoT

Rakuten Group, Inc.

lecture_16_jiajun.pdf

Kuan-Tsae Huang

The oral presentation of the paper titled "Crowd Density Estimation Method using Multiple Feature Categories and Multiple Regression Models". This paper was accepted for publication and oral presentation in the 12th IEEE International Conference on Computer Engineering and Systems (ICCES 2017) held from 19 to 20 December 2017 in Cairo, Egypt. The paper proposed a new method to estimate the number of people within crowded scenes using regression analysis. The two challenges in crowd density estimation using regression analysis are perspective distortion and non-linearity. This paper solves the perspective distortion using perspective normalization which is the best way to deal with that problem based on recent works. The second challenge is solved by creating a new combination of features collected from multiple already existing categories including segmented region, texture, edge, and keypoints. This paper created a feature vector of length 164. Five regression models are used which are GPR, RF, RPF, LASSO, and KNN. Based on the experimental results, our proposed method gives better results than previous works. ---------------------------------- أحمد فوزي جاد Ahmed Fawzy Gad قسم تكنولوجيا المعلومات Information Technology (IT) Department كلية الحاسبات والمعلومات Faculty of Computers and Information (FCI) جامعة المنوفية, مصر Menoufia University, Egypt Teaching Assistant/Demonstrator ahmed.fawzy@ci.menofia.edu.eg --------------------------------- Find me on: Blog (Arabic) https://aiage-ar.blogspot.com.eg/ (English) https://aiage.blogspot.com.eg/ YouTube https://www.youtube.com/AhmedGadFCIT Google Plus https://plus.google.com/u/0/+AhmedGadIT SlideShare https://www.slideshare.net/AhmedGadFCIT LinkedIn https://www.linkedin.com/in/ahmedfgad reddit https://www.reddit.com/user/AhmedGadFCIT ResearchGate https://www.researchgate.net/profile/Ahmed_Gad13 Academia https://menofia.academia.edu/Gad Google Scholar https://scholar.google.com.eg/citations?user=r07tjocAAAAJ&hl=en Mendelay https://www.mendeley.com/profiles/ahmed-gad12 ORCID https://orcid.org/0000-0003-1978-8574 StackOverFlow http://stackoverflow.com/users/5426539/ahmed-gad Twitter https://twitter.com/ahmedfgad Facebook https://www.facebook.com/ahmed.f.gadd Pinterest https://www.pinterest.com/ahmedfgad

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis

Ahmed Gad

Deep learning for object detection

Wenjing Chen

Visual Transformers

Kwanghee Choi

Phani Dathar, Ph.D., Data Science Solution Architect, Neo4j Relationships are highly predictive of behavior. Graph technology abstracts connections in our data so businesses can apply relationships and network structures to make better predictions. Hear about the journey from graph analytics and machine learning to graph-enhanced AI. We’ll also cover how enterprises are using graph data science in areas such as fraud, targeted marketing, healthcare, and recommendations.

Government GraphSummit: Leveraging Graphs for AI and ML

Neo4j

Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016

Universitat Politècnica de Catalunya

This is the 3rd part of the tutorial on commonsense knowledge (CSK) at ACM WSDM 2021 by Simon Razniewski, Niket Tandon and Aparna Varde. It focuses on evaluation of the acquired knowledge, both intrinsic & extrinsic, as well as highlights, outlook with a brief perspective on COVID and open issues for further research. Abstract: Commonsense knowledge is a foundational cornerstone of artificial intelligence applications. Whereas information extraction and knowledge base construction for instance-oriented assertions, such as Brad Pitt’s birth date, or Angelina Jolie’s movie awards, has received much attention, commonsense knowledge on general concepts (politicians, bicycles, printers) and activities (eating pizza, fixing printers) has only been tackled recently. In this tutorial we present state-of-the-art methodologies towards the compilation and consolidation of such commonsense knowledge (CSK). We cover text-extraction-based, multi-modal and Transformer-based techniques, with special focus on the issues of web search and ranking, as of relevance to the WSDM community.

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3

Dr. Aparna Varde

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...

Universitat Politècnica de Catalunya

Transformer in Vision

Sangmin Woo

Practical computer vision-- A problem-driven approach towards learning CV/ML/DL

Albert Y. C. Chen

最近の研究情勢についていくために - Deep Learningを中心に -

Hiroshi Fukui

Similaire à Seminar報告_20150520 (20)

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...

NIPS2009: Understand Visual Scenes - Part 2

Introduction to 3D Computer Vision and Differentiable Rendering

Visual geometry with deep learning

Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...

Faster R-CNN: Towards real-time object detection with region proposal network...

Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019

Learning with Unpaired Data

AR/SLAM and IoT

lecture_16_jiajun.pdf

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis

Deep learning for object detection

Visual Transformers

Government GraphSummit: Leveraging Graphs for AI and ML

Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...

Transformer in Vision

Practical computer vision-- A problem-driven approach towards learning CV/ML/DL

最近の研究情勢についていくために - Deep Learningを中心に -

Dernier

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Scaling API-first – The story of a global engineering organization

Radu Cotescu

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Friends Colony Women Seeking Men

Delhi Call girls

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

In an era where artificial intelligence (AI) stands at the forefront of business innovation, Information Architecture (IA) is at the core of functionality. See “There’s No AI Without IA” – (from 2016 but even more relevant today) Understanding and leveraging how Information Architecture (IA) supports AI synergies between knowledge engineering and prompt engineering is critical for senior leaders looking to successfully deploy AI for internal and externally facing knowledge processes. This webinar be a high-level overview of the methodologies that can elevate AI-driven knowledge processes supporting both employees and customers. Core Insights Include: Strategic Knowledge Engineering: Delve into how structuring AI's knowledge base is required to prevent hallucinations, enable contextual retrieval of accurate information. This will include discussion of gold standard libraries of use cases support testing various LLMs and structures and configurations of knowledge base. Precision in Prompt Engineering: Learn the art of crafting prompts that direct AI to deliver targeted, relevant responses, thereby optimizing customer experiences and business outcomes. Unified Approach for Enhanced AI Performance: Explore the intersection of knowledge and prompt engineering to develop AI systems that are not only more responsive but also aligned with overarching business strategies. Guiding Principles for Implementation: Equip yourself with best practices, ethical guidelines, and strategic considerations for embedding these technologies into your business ecosystem effectively. This webinar is designed to empower business and technology leaders with the knowledge to harness the full potential of AI, ensuring their organizations not only keep pace with digital transformation but lead the charge. Join us to map a roadmap to fully leverage Information Architecture (IA) and AI chart a course towards a future where AI is a key pillar of strategic innovation and business success.

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Earley Information Science

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Advantages of Hiring UIUX Design Service Providers for Your Business

Pixlogix Infotech

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

[2024]Digital Global Overview Report 2024 Meltwater.pdf

hans926745

Explore 'The Codex of Business: Writing Software for Real-World Solutions,' a compelling SlideShare presentation that delves into digital transformation in healthcare. Discover through a detailed case study how Agile methodologies empower healthcare providers to develop, iterate, and refine digital solutions that address real-world challenges. Learn how strategic planning, user feedback, and continuous improvement drive success in deploying technologies that enhance patient care and operational efficiency. Ideal for healthcare professionals, IT specialists, and digital transformation advocates seeking actionable insights and practical examples of technology making a real difference.

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Malak Abu Hammad

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

Microsoft's Threat Matrix for Kubernetes helps organizations understand the attack surface a Kubernetes deployment introduces to their environments. This ensures that adequate detections and mitigations are in place. By covering over 40 different attacker techniques, defenders can learn about Kubernetes-specific mitigations and controls to deploy to their environments. In this session, we will explore the MS-TA9013 Host Path Mount technique, which is commonly used by attackers to perform privilege escalation in a Kubernetes cluster. Attendees will learn how attackers and defenders can: * Escape the container's host volume mount to gain persistence on an underlying node * Move laterally from the underlying node into the customer's cloud environment * Analyze Kubernetes audit logs to detect pods deployed with a hostPath mount * Deploy an admission controller that prevents new pods from using a hostPath mount

Breaking the Kubernetes Kill Chain: Host Path Mount

Puma Security, LLC

Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA. In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability. In this session, participants gained insights about the following: Most common types of AI categories and use cases; Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives; Taxonomy and ontology design considerations and best practices; Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and Tools, roles, and skills to design and implement AI-powered search solutions.

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Enterprise Knowledge

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Dernier (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Scaling API-first – The story of a global engineering organization

Driving Behavioral Change for Information Management through Data-Driven Gree...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Powerful Google developer tools for immediate impact! (2023-24 C)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

GenCyber Cyber Security Day Presentation

Advantages of Hiring UIUX Design Service Providers for Your Business

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Boost Fertility New Invention Ups Success Rates.pdf

[2024]Digital Global Overview Report 2024 Meltwater.pdf

The Codex of Business Writing Software for Real-World Solutions 2.pptx

What Are The Drone Anti-jamming Systems Technology?

Breaking the Kubernetes Kill Chain: Host Path Mount

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Seminar報告_20150520

1. 3D Pose Estimation for Transparent Objects Presenter: 賴柏任 Advisor:羅仁權教授 05.20.2015

2. Motivation • Transparent objects are everywhere • If we know he pose, we can grasp it! 2

3. Problems 3 Color of transparent object changes Hard to locate transparent objects Edge of transparent objects are blur Hard to estimate pose of transparent objects

4. Effective cure 4 Color of transparent object changes Edge of transparent objects are blur

5. Kinect v.s. Color changes • Transparent objects produce NaN in depth map 5 Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International Conference on, 2013, pp. 162-169.

6. Graphcut v.s. Blur edge • Given foreground & background clue 6 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

7. Graphcut v.s. Blur edge • Generate the prob. distribution 7 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

8. Graphcut v.s. Blur edge • Use distance to compensate 8 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

9. Graphcut v.s. Blur edge • OpenCV implementation 9

10. A coarse pipeline 10 Detect NaN area in depth map Feed the area to Graphcut Segment the edge

11. How to determine pose? • Model-based matching • Rotate in x & y axis and store the edge 11 Z-axis Y-axis The problem becomes a 2D- 2D matching problem

12. Where is the model? • Kinect Fusion 12

13. Where is the model? Wrap your object with paper Use Kinect Fusion to construct the model Store the model 13

14. What if there are some other NaN objects? • Some non-transparent objects also produce NaN in depth map 14

15. What if there are some other NaN objects? • Use characteristics of transparent object to rule out non-transparent objects 15 Transparent objects produce highlights Color of transparent object is similar to peripheral area

16. What if there are some other NaN objects? • Transparent objects produce highlights 16 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979.

17. What if there are some other NaN objects? • Transparent objects produce highlights 17 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979. Threshold the image from 0-255 Compute the perimeter in each image Compute the threshold by line fitting (from 255 to 0)

18. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 18

19. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 19 Hue histogram

20. A fine pipeline 20

21. Some results • Pose Matching 21

22. Some results • Total retrieved candidates are over 200 22 Method Recall Precision Only NaN 86.11% 38.24% Characteristics 86.11% 93.93% Recall = (2/2)*100% =100% Precision=(2/5)*100% =40%

23. Some other problems • How to let robot grasp? • Is there any choice other from Kinect? 23

24. How to let robot grasp? • Teach and Play 24 Grasp points

25. Is there any choice other from Kinect? • Extract the visual word of transparent objects 25

26. Is there any choice other from Kinect? 26 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.

27. Is there any choice other from Kinect? • The result can be the input of Graphcut 27 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.

28. 28 Thank you!

Seminar報告_20150520

Recommandé

Recommandé

Contenu connexe

Similaire à Seminar報告_20150520

Similaire à Seminar報告_20150520 (20)

Dernier

Dernier (20)

Seminar報告_20150520