CVPR2009: Object Detection Using a Max-Margin Hough Transform

•Download as PPT, PDF•

3 likes•1,589 views

zukun

Education

CVPR 2009, Miami, Florida Subhransu Maji and Jitendra Malik University of California at Berkeley, Berkeley, CA-94720 Object Detection Using a Max-Margin Hough Transform

Overview ,[object Object],[object Object],[object Object],[object Object]

Our Approach: Hough Transform ,[object Object],[object Object],[object Object],[object Object],[object Object]

Generalized to object detection Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Spatial occurrence distributions x y s x y s x y s x y s

Detection Pipeline B. Leibe, A. Leonardis, and B. Schiele. Combined object categorization and segmentation with an implicit shape model ‘ 2004 Probabilistic Voting Interest Points eg. SIFT,GB, Local Patches Matched Codebook Entries KD Tree

Probabilistic Hough Transform ,[object Object],[object Object],Position Posterior Codeword Match Codeword likelihood Detection Score Codeword likelihood

Learning Feature Weights ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],Learning Feature Weights : First Try

[object Object],[object Object],Learning Feature Weights : Second Try Position Posterior Codeword Match Codeword likelihood Activations Feature weights

Max-Margin Training ,[object Object],[object Object],[object Object],[object Object],[object Object],Standard ISM model (Leibe et.al.’04) Our Contribution class label {+1,-1} activations non negative

Experimental Results ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Learned Weights (ETHZ shape) Max-Margin Important Parts Naïve Bayes blue (low) , dark red (high) Influenced by clutter (rare structures)

Learned Weights (UIUC cars) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

Learned Weights (INRIA horses) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

Detection Results (ETHZ dataset) Recall @ 1.0 False Positives Per Window

Detection Results (INRIA Horses) Our Work

Detection Results (UIUC Cars) INRIA horses Our Work

Hough Voting + Verification Classifier Recall @ 0.3 False Positives Per Image ETHZ Shape Dataset IKSVM was run on top 30 windows + local search KAS – Ferrari et.al., PAMI’08 TPS-RPM – Ferrari et.al., CVPR’07 better fitting bounding box Implicit sampling over aspect-ratio

Hough Voting + Verification Classifier IKSVM was run on top 30 windows + local search Our Work

Hough Voting + Verification Classifier UIUC Single Scale Car Dataset IKSVM was run on top 10 windows + local search 1.7% improvement

Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],Acknowledgements Thank You Questions?

Backup Slide : Toy Example Rare but poor localization Rare and good localization

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform

iccv2009 tutorial: boosting and random forest - part III

zukun

Ensemble Learning Featuring the Netflix Prize Competition and ...

butest

SIFT-based Arabic Sign Language Recognition (ArSL) System By Alaa Tharwat1,3 And Tarek Gaber2,3 1Faculty of Eng. Suez Canal University, Ismailia, Egypt 2Faculty of Computers & Informatics , Suez Canal University, Ismailia, Egypt 3Scientic Research Group in Egypt (SRGE), http://www.egyptscience.netSuez Canal University Scientific Research Group in Egypt Introduction: Why ArSL Introduction: Aim of the work What is ArSL? Translating ArSL to spoken language, i.e. translate hand gestures to Arabic characters Sign Language hand formations: Hand shape Hand location Hand movement Hand orientation Introduction: Types of ArSL Proposed Method: General Framework Proposed Method: General Framework Training phase Collecting all training images (i.e. gestures of Arabic Sign Language). Extracting the features using SIFT Representing each image by one feature vector. Applying a dimensionality reduction (e.g, LDA) to reduce the number features in the vector Proposed Method: Feature Extraction Proposed Method: Feature Extraction Proposed Method: Feature Extraction Proposed Method: Classification Techniques We have used the following classifiers assess their performance with our approach : SVM is one of the classifers which deals with a problem of high dimensional datasets and gives very good results. K-NN: unknown patterns are distinguished based on the similarity to known samples Nearest Neighbor: Its idea is extremely simple as it does not require learning Experimental Results: Dataset We have used 210 gray level images with size 200x200. These images represent 30 Arabic characters, 7 images for each character). The images are collected in different illumination, rotation, quality levels, and image partiality. Experimental Scenarios To select the most suitable parameters. To understand the effect of changing the number of training images. To prove that our proposed method is robust against rotation To prove that our proposed method is robust against occlusion. Experimental Results Experimental Results Experimental Results Experimental Results Experimental Results Conclusions Our proposal approach for ArSL Recognition Achieve an excellent accuracy to identify ArSL from 2D images Robust against to rotation images with different angels and occluded images horizontally or vertically. Robust against many previous ArSL approaches. Performance of this approach is measured by Using captured images with Matlab implementation Comparison with related work Future Work Improving the results of in case of image occlusion Increase the size of the dataset to check its scalability. Identify characters from video frames and then try to implement real time ArSL system. Thanks

Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...

Tarek Gaber

Exposé Ontology

Joaquin Vanschoren

introducción a Machine Learning

butest

introducción a Machine Learning

butest

AI in Production

Giovanni Fernandez-Kincade

Discovery Hub: on-the-fly linked data exploratory search

Fabien Gandon

Analyse de sentiment et classification par approche neuronale en Python et Weka

Patrice Bellot - Aix-Marseille Université / CNRS (LIS, INS2I)

Computer vision has started to achieve some very impressive results over the last 5-10 years. It is now possible to quickly and reliably detect faces, recognize and localize target images, and even classify pictures of objects into generic categories. Unfortunately, knowledge of these techniques remains largely confined to academia. In this session we’ll go over some of the tools available, placing an emphasis on exploring the ideas and algorithms behind their design. To show how these components can be put together, a sample system will be developed over the course of the presentation. Starting with standard image descriptors, we’ll first see how to do direct image recognition. We’ll then extend that into a simple object classifier, which will be able to distinguish (for example) between images which contain a bicycle and those that don’t.

An Introduction to Computer Vision

guestd1b1b5

Scalable Software Testing and Verification of Non-Functional Properties throu...

Lionel Briand

Artificial Intelligence and Optimization with Parallelism

Olivier Teytaud

Deep Learning: Chapter 11 Practical Methodology

Jason Tsai

What is pattern recognition (lecture 4 of 6)

Randa Elanwar

xai basic solutions , with some examples and formulas

Part 1

Introduction

Introduction

Introduction

Cvpr2007 object category recognition p3 - discriminative models

zukun

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform (20)

iccv2009 tutorial: boosting and random forest - part III

Ensemble Learning Featuring the Netflix Prize Competition and ...

Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...

Exposé Ontology

introducción a Machine Learning

AI in Production

Discovery Hub: on-the-fly linked data exploratory search

Analyse de sentiment et classification par approche neuronale en Python et Weka

An Introduction to Computer Vision

Scalable Software Testing and Verification of Non-Functional Properties throu...

Artificial Intelligence and Optimization with Parallelism

Deep Learning: Chapter 11 Practical Methodology

What is pattern recognition (lecture 4 of 6)

xai basic solutions , with some examples and formulas

Part 1

Introduction

Cvpr2007 object category recognition p3 - discriminative models

Recently uploaded

On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx

Pooja Bhuva

Wizards are very useful for creating a good user experience. In all businesses, interactive sessions are most beneficial. To improve the user experience, wizards in Odoo provide an interactive session. For creating wizards, we can use transient models or abstract models. This gives features of a model class except the data storing. Transient and abstract models have permanent database persistence. For them, database tables are made, and the records in such tables are kept until they are specifically erased.

How to Create and Manage Wizard in Odoo 17

Celine George

How to setup Pycharm environment for Odoo 17.pptx

Celine George

How to Add New Custom Addons Path in Odoo 17

Celine George

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

Dr Vijay Vishwakarma

Accessible Digital Futures project (20/03/2024)

Jisc

Graduate Outcomes Presentation Slides - English

neillewis46

Google Gemini An AI Revolution in Education.pptx

Dr. Sarita Anand

Basic Civil Engineering notes first year Notes Building notes Selection of site for Building Layout of a Building What is Burjis, Mutam Building Bye laws Basic Concept of sunlight ventilation in building National Building Code of India Set back or building line Types of Buildings Floor Space Index (F.S.I) Institutional Vs Educational Building Components & function Sills, Lintels, Cantilever Doors, Windows and Ventilators Types of Foundation AND THEIR USES Plinth Area Shallow and Deep Foundation Super Built-up & carpet area Floor Area Ratio (F.A.R) RCC Reinforced Cement Concrete RCC VS PCC

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Denish Jangid

Salient Features of India constitution especially power and functions

KarakKing

Plant propagation: Sexual and Asexual propapagation.pptx

UmeshTimilsina1

Single or Multiple melodic lines structure

dhanjurrannsibayan2

ICT role in 21st century education and it's challenges.

MaryamAhmad92

On National Teacher Day, meet the 2024-25 Kenan Fellows

Mebane Rash

Klinik_ Apotek Onlin 085657271886 Solusi Menggugurkan Masalah Kehamilan Anda Jual Obat Aborsi Asli KLINIK ABORSI TERPEECAYA _ Jual Obat Aborsi Cytotec Misoprostol Asli 100% Ampuh Hanya 3 Jam Langsung Gugur || OBAT PENGGUGUR KANDUNGAN AMPUH MANJUR OBAT ABORSI OLINE" APOTIK Jual Obat Cytotec, Gastrul, Gynecoside Asli Ampuh. JUAL ” Obat Aborsi Tuntas | Obat Aborsi Manjur | Obat Aborsi Ampuh | Obat Penggugur Janin | Obat Pencegah Kehamilan | Obat Pelancar Haid | Obat terlambat Bulan | Ciri Obat Aborsi Asli | Obat Telat Bulan | Pil Aborsi Asli | Cara Menggugurkan Konten | Cara Aborsi Tuntas | Harga Obat Aborsi Asli | Pil Aborsi | Jual Obat Aborsi Cytotec | Cara Aborsi Sendiri | Cara Aborsi Usia 1 Bulan | Cara Aborsi Usia 2 Tahun | Cara Aborsi Usia 3 Bulan | Obat Aborsi Usia 4 Bulan | Cara Abrasi Usia 5 Bulan | Cara Menggugurkan Konten | Kandungan Obat Penggugur | Cara Menghitung Usia Konten | Cara Mengatasi Terlambat Bulan | Penjual Obat Aborsi Asli | Obat Aborsi Garansi | Kandungan Obat Peluntur | Obat Telat Datang Bulan | Obat Telat Haid | Obat Aborsi Paling Murah | Klinik Jual Obat Aborsi | Jual Pil Cytotec | Apotik Jual Obat Aborsi | Kandungan Dokter Abrasi | Cara Aborsi Cepat | Jual Obat Aborsi Bergaransi | Jual Obat Cytotec Asli | Obat Aborsi Aman Manjur | Obat Misoprostol Cytotec Asli. "APA ITU ABORSI" “Aborsi Adalah dengan membendung hormon yang di perlukan untuk mempertahankan kehamilan yaitu hormon progesteron, karena hormon ini dibendung, maka jalur kehamilan mulai membuka dan leher rahim menjadi melunak,sehingga mengeluarkan darah yang merupakan tanda bahwa obat telah bekerja || maksimal 1 jam obat diminum || PENJELASAN OBAT ABORSI USIA 1 _7 BULAN Pada usia kandungan ini, pasien akan merasakan sakit yang sedikit tidak berlebihan || sekitar 1 jam ||. namun hanya akan terjadi pada saatdarah keluar merupakan pertanda menstruasi. Hal ini dikarenakan pada usiakandungan 3 bulan,janin sudah terbentuk sebesar kepalan tangan orang dewasa. Cara kerja obat aborsi : JUAL OBAT ABORSI AMPUH dosis 3 bulan secara umum sama dengan cara kerja || DOSIS OBAT ABORSI 2 bulan”, hanya berbedanya selain mengisolasijanin juga menghancurkan janin dengan formula methotrexate dikandungdidalamnya. Formula methotrexate ini sangat ampuh untuk menghancurkan janinmenjadi serpihan-serpihan kecil akan sangat berguna pada saat dikeluarkan nanti. APA ALASAN WANITA MELAKUKAN ABORSI? Aborsi di lakukan wanita hamil baik yang sudah menikah maupun belum menikah dengan berbagai alasan , akan tetapi alasan yang utama adalah alasan-alasan non medis (termasuk aborsi sendiri / di sengaja/ buatan] MELAYANI PEMESANAN OBAT ABORSI SETIAP HARI, SIAP KIRIM KESELURUH KOTA BESAR DI INDONESIA DAN LUAR NEGERI. HUBUNGI PEMESANAN LEBIH NYAMAN VIA WA/: 085657271886

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

ZurliaSoop

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

Nguyen Thanh Tu Collection

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

marlenawright1

Towards a code of practice for AI in AT.pptx

Jisc

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Nirmal Dwivedi

Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx

Pooja Bhuva

Recently uploaded (20)

On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx

How to Create and Manage Wizard in Odoo 17

How to setup Pycharm environment for Odoo 17.pptx

How to Add New Custom Addons Path in Odoo 17

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

Accessible Digital Futures project (20/03/2024)

Graduate Outcomes Presentation Slides - English

Google Gemini An AI Revolution in Education.pptx

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Salient Features of India constitution especially power and functions

Plant propagation: Sexual and Asexual propapagation.pptx

Single or Multiple melodic lines structure

ICT role in 21st century education and it's challenges.

On National Teacher Day, meet the 2024-25 Kenan Fellows

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

Towards a code of practice for AI in AT.pptx

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx

CVPR2009: Object Detection Using a Max-Margin Hough Transform

1. CVPR 2009, Miami, Florida Subhransu Maji and Jitendra Malik University of California at Berkeley, Berkeley, CA-94720 Object Detection Using a Max-Margin Hough Transform

5. Detection Pipeline B. Leibe, A. Leonardis, and B. Schiele. Combined object categorization and segmentation with an implicit shape model ‘ 2004 Probabilistic Voting Interest Points eg. SIFT,GB, Local Patches Matched Codebook Entries KD Tree

10.

11. Experiment Datasets ETHZ Shape Dataset ( Ferrari et al., ECCV 2006) 255 images, over 5 classes (Apple logo, Bottle, Giraffe, Mug, Swan) UIUC Single Scale Cars Dataset ( Agarwal & Roth, ECCV 2002) 1050 training, 170 test images INRIA Horse Dataset ( Jurie & Ferrari) 170 positive + 170 negative images (50 + 50 for training)

12.

13. Learned Weights (ETHZ shape) Max-Margin Important Parts Naïve Bayes blue (low) , dark red (high) Influenced by clutter (rare structures)

14. Learned Weights (UIUC cars) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

15. Learned Weights (INRIA horses) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

16. Detection Results (ETHZ dataset) Recall @ 1.0 False Positives Per Window

17. Detection Results (INRIA Horses) Our Work

18. Detection Results (UIUC Cars) INRIA horses Our Work

19. Hough Voting + Verification Classifier Recall @ 0.3 False Positives Per Image ETHZ Shape Dataset IKSVM was run on top 30 windows + local search KAS – Ferrari et.al., PAMI’08 TPS-RPM – Ferrari et.al., CVPR’07 better fitting bounding box Implicit sampling over aspect-ratio

20. Hough Voting + Verification Classifier IKSVM was run on top 30 windows + local search Our Work

21. Hough Voting + Verification Classifier UIUC Single Scale Car Dataset IKSVM was run on top 10 windows + local search 1.7% improvement

22.

23.

24. Backup Slide : Toy Example Rare but poor localization Rare and good localization

Editor's Notes

Thank you. Good morning. I am going to present a learning framework for Hough transform based object detection.
We are interested in the task of object detection where we are interested in localizing an instance of an object in an image. We use an approach based on hough transform. Before I go into the details, I will present an overview of hough tranform followed by our learning framework. I will then present experimental results and conclude.
Yet another way of doing this is hough transform based approach. This is of course an old idea proposed by Hough for detecting lines more than 50 years ago. Since then it has been generalized to detect parametric shapes like ellipses and circles. Local parts cast vote for object pose and the complexity scales linearly with # parts times # votes.
Recently Liebe and Schile have extended this framework for object detection. A slide from their Implicit Shape Model framework illustrates the technique. Local parts are based on patches represented using a dictionary learned form training examples. The position of each codeword is recorded on the training example to from a distribution of each codeword location wrto the object center. For example the patch corresponding to the head of the person is typically at a fixed vertical offset wrto the torso as seen in the bottom left distribution. At test time the interest points are detected and matched to the codebook entries which vote for the object center. The peaks of the voting space correspond to object locations. Quite simple but a powerful framework.
Introducing you to a set of notations for the next set of slides. Let C be the learned codebook, let f denote the features and l the location of the features. The overall detection score is the sum of contributions from each feature f_j observed at a location l_j. Each feature is matched to a codebook as given by p(Ci|fj). This could be simply 1 for the nearest neighbour and 0 for the other codewords. P(x|O,Ci,l_j) is the distribution of the centroid given the Codeword Ci observed at location lj. The last term p(O|Ci,lj) is the confidence (or weight) of the codeword Ci.
Learning codeword weights in the context of Hough transform has not been addressed well in the literature. In an earlier talk today we saw a way of learning discriminative dictionaries for Hough transform. However in situations where the codebook is fixed we would like to learn the importance of each codeword. I.e. we have been given a codebook and the posterior distribution of the object center for each codeword and we would like to learn weights so that the Hough transform detector has the best detection rates. What we show is that these weights can be learned optimally using convex optimization and leads to better detection rates when compared to uniform weights and even a simple learning scheme.
Assign each codebook a weight proportional to the relative frequency of the object. We call this the naïve Bayes weights. (Read from slides)
If you look at the equation of the Hough tranform you realize that the overall score is linear in the codebook weights. This is assuming a location invariance of the object (i.e. the object can appear anywhere in the image). Thus the score is a dot product of the weight vector and a activation vector. The activations are independent of the weights given the features and their locations. This suggests a learning scheme which learns weights which increases the score on the positive locations over negative ones. We formalize this in the next slide.
We perform experiments on 3 datasets (ETHZ, UIUC cars and INRIA horses)
Our HT detector is based on GB descriptors (read from slide) and correct detections are counted using the PASCAL criterion i.e. an overlap of greater than 0.5.
To illustrate the idea : consider a toy example. We are trying to detect squares where the negative examples are parallel lines as shown. We have four kinds of codewords. The tips, vertical edges, horizontal edges and corners. Both corners and horizontal edges occur on the positive example only, however lets assume that corners are easy to localize while the horizontal edge can appear anywhere. The NB scheme assigns equal weights to both these whereas our framework distinguishes them correctly as seen in the table weights. The final scores on the + and – for all the schemes are shown and one can see that the m2ht achieves the maximum separation.

CVPR2009: Object Detection Using a Max-Margin Hough Transform

Recommended

Recommended

More Related Content

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform (20)

More from zukun

More from zukun (20)

Recently uploaded

Recently uploaded (20)

CVPR2009: Object Detection Using a Max-Margin Hough Transform

Editor's Notes