SlideShare une entreprise Scribd logo
1  sur  56
Classical Methods for Object Recognition  Rob Fergus (NYU)
Classical Methods Bag of words approaches Parts and structure approaches  Discriminative methods Condensed version of sections from  2007 edition of  tutorial
Bag of Words Models
Object Bag of ‘words’
Bag of Words Independent features  Histogram representation
1.Feature detectionand representation Compute descriptor  e.g. SIFT [Lowe’99] Normalize patch Detect patches [Mikojaczyk and Schmid ’02] [Mata, Chum, Urban & Pajdla, ’02]  [Sivic & Zisserman, ’03] Local interest operator or Regular grid Slide credit: Josef Sivic
… 1.Feature detectionand representation
… 2. Codewords dictionary formation 128-D SIFT space
… 2. Codewords dictionary formation Codewords + + + Vector quantization 128-D SIFT space Slide credit: Josef Sivic
Image patch examples of codewords Sivic et al. 2005
….. Image representation Histogram of features assigned to each cluster  frequency codewords
Uses of BoW representation Treat as feature vector for standard classifier e.g SVM Cluster BoW vectors over image collection Discover visual themes Hierarchical models  Decompose scene/object Scene
BoW as input to classifier SVM for object classification Csurka, Bray, Dance & Fan, 2004 Naïve Bayes See 2007 edition of this course
Clustering BoW vectors  Use models from text document literature Probabilistic latent semantic analysis (pLSA) Latent Dirichlet allocation (LDA) See 2007 edition for explanation/code d = image,   w = visual word,    z = topic (cluster)
Clustering BoW vectors Scene classification (supervised) Vogel & Schiele, 2004 Fei-Fei & Perona, 2005 Bosch, Zisserman & Munoz, 2006 Object discovery (unsupervised) Each cluster corresponds to visual theme Sivic, Russell, Efros, Freeman & Zisserman, 2005
Related work Early “bag of words” models: mostly texture recognition Cula & Dana, 2001; Leung & Malik 2001; Mori, Belongie & Malik, 2001; Schmid 2001; Varma & Zisserman, 2002, 2003; Lazebnik, Schmid & Ponce, 2003 Hierarchical Bayesian models for documents (pLSA, LDA, etc.) Hoffman 1999; Blei, Ng & Jordan, 2004; Teh, Jordan, Beal & Blei, 2004 Object categorization Csurka, Bray, Dance & Fan, 2004; Sivic, Russell, Efros, Freeman & Zisserman, 2005; Sudderth, Torralba, Freeman & Willsky, 2005; Natural scene categorization Vogel & Schiele, 2004; Fei-Fei & Perona, 2005; Bosch, Zisserman & Munoz, 2006
What about spatial info? ?
Adding spatial info. to BoW Feature level Spatial influence through correlogram features: Savarese, Winn and Criminisi, CVPR 2006
Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Hierarchical model of scene/objects/parts
P1 P2 P3 P4 w Image Bg Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Niebles & Fei-Fei, CVPR 2007
Adding spatial info. to BoW Feature level Generative models Discriminative methods Lazebnik, Schmid & Ponce, 2006
Part-based Models
Problem with bag-of-words All have equal probability for bag-of-words methods Location information is important BoW + location still doesn’t give correspondence
Model: Parts and Structure
Representation Object as set of parts Generative representation Model: Relative locations between parts Appearance of part Issues: How to model location How to represent appearance How to handle occlusion/clutter Figure from [Fischler & Elschlager 73]
History of Parts and Structure approaches ,[object Object]
Yuille ‘91
Brunelli & Poggio ‘93
Lades, v.d. Malsburg et al. ‘93
Cootes, Lanitis, Taylor et al. ‘95
Amit & Geman ‘95, ‘99
Perona et al. ‘95, ‘96, ’98, ’00, ’03, ‘04, ‘05
Felzenszwalb & Huttenlocher ’00, ’04
Crandall & Huttenlocher ’05, ’06
Leibe & Schiele ’03, ’04
Many papers since 2000,[object Object]
The correspondence problem Model with P parts Image with N possible assignments for each part Consider mapping to be 1-1 ,[object Object],[object Object]
Efficient methods ,[object Object]
Felzenszwalb and Huttenlocher ‘00 and ‘05
 O(N2P)  O(NP) for tree structured   models
 Removes need for region detectors,[object Object]
Appearance representation ,[object Object],Decision trees [Lepetit and Fua CVPR 2005] ,[object Object],Figure from Winn & Shotton, CVPR ‘06
Learn Appearance Generative models of appearance Can learn with little supervision E.g. Fergus et al’ 03 Discriminative training of part appearance model SVM part detectors Felzenszwalb, Mcallester, Ramanan, CVPR 2008 Much better performance
Felzenszwalb, Mcallester, Ramanan, CVPR 2008 2-scale model Whole object Parts HOG representation +SVM training to obtainrobust part detectors Distancetransforms allowexamination of every location in the image
Hierarchical Representations  Pixels  Pixel groupings  Parts  Object ,[object Object]
Amit and Geman’98
Ullman et al.
Bouchard & Triggs’05
Zhu and Mumford
Jin & Geman‘06
Zhu & Yuille ’07
Fidler & Leonardis ‘07Images from [Amit98]
Stochastic Grammar of ImagesS.C. Zhu et al. and D. Mumford
Context and Hierarchy in a Probabilistic Image ModelJin & Geman (2006) animal head instantiated by bear head e.g. animals, trees, rocks e.g. contours, intermediate objects e.g. linelets, curvelets, T-junctions e.g. discontinuities, gradient  animal head instantiated by tiger head
A Hierarchical Compositional System for Rapid Object DetectionLong Zhu, Alan L. Yuille, 2007. Able to learn #parts at each level
Learning a Compositional Hierarchy of Object Structure Fidler & Leonardis, CVPR’07; Fidler, Boben & Leonardis, CVPR 2008 Parts model The architecture Learned parts

Contenu connexe

Tendances

Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
Jia-Bin Huang
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
butest
 

Tendances (6)

Blurclassification
BlurclassificationBlurclassification
Blurclassification
 
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)
 

En vedette (6)

美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略
 
机器学习概述
机器学习概述机器学习概述
机器学习概述
 
第一讲 机器学习概述
第一讲 机器学习概述第一讲 机器学习概述
第一讲 机器学习概述
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer Vision Basics
Computer Vision BasicsComputer Vision Basics
Computer Vision Basics
 

Similaire à Iccv2009 recognition and learning object categories p1 c01 - classical methods

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
zukun
 
Lec18 bag of_features
Lec18 bag of_featuresLec18 bag of_features
Lec18 bag of_features
Bo Li
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
zukun
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Universitat Politècnica de Catalunya
 
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
Iccv2009 recognition and learning object categories   p1 c03 - 3d object modelsIccv2009 recognition and learning object categories   p1 c03 - 3d object models
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
zukun
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
zukun
 
Histogram of oriented gradients for human detection
Histogram of oriented gradients for human detectionHistogram of oriented gradients for human detection
Histogram of oriented gradients for human detection
zukun
 

Similaire à Iccv2009 recognition and learning object categories p1 c01 - classical methods (20)

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
 
Lec18 bag of_features
Lec18 bag of_featuresLec18 bag of_features
Lec18 bag of_features
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
 
Verifiable visualization for isosurface extraction vis 2009
Verifiable visualization for isosurface extraction   vis 2009Verifiable visualization for isosurface extraction   vis 2009
Verifiable visualization for isosurface extraction vis 2009
 
Salient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationSalient KeypointSelection for Object Representation
Salient KeypointSelection for Object Representation
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.sc
 
06_features_slides.pdf
06_features_slides.pdf06_features_slides.pdf
06_features_slides.pdf
 
Low level vision - A tuturial
Low level vision - A tuturialLow level vision - A tuturial
Low level vision - A tuturial
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
 
Iciap 2
Iciap 2Iciap 2
Iciap 2
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
 
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
Iccv2009 recognition and learning object categories   p1 c03 - 3d object modelsIccv2009 recognition and learning object categories   p1 c03 - 3d object models
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Bagwords
BagwordsBagwords
Bagwords
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
Histogram of oriented gradients for human detection
Histogram of oriented gradients for human detectionHistogram of oriented gradients for human detection
Histogram of oriented gradients for human detection
 
Cmap presentation
Cmap presentationCmap presentation
Cmap presentation
 
Colour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired ModelColour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired Model
 

Plus de zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
zukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
zukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
zukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
zukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
zukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
zukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
zukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
zukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
zukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
zukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
zukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
zukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
zukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
zukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
zukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
zukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
zukun
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
zukun
 

Plus de zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
 

Dernier

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Dernier (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 

Iccv2009 recognition and learning object categories p1 c01 - classical methods

  • 1. Classical Methods for Object Recognition Rob Fergus (NYU)
  • 2. Classical Methods Bag of words approaches Parts and structure approaches Discriminative methods Condensed version of sections from 2007 edition of tutorial
  • 3. Bag of Words Models
  • 4. Object Bag of ‘words’
  • 5. Bag of Words Independent features Histogram representation
  • 6. 1.Feature detectionand representation Compute descriptor e.g. SIFT [Lowe’99] Normalize patch Detect patches [Mikojaczyk and Schmid ’02] [Mata, Chum, Urban & Pajdla, ’02] [Sivic & Zisserman, ’03] Local interest operator or Regular grid Slide credit: Josef Sivic
  • 7. … 1.Feature detectionand representation
  • 8. … 2. Codewords dictionary formation 128-D SIFT space
  • 9. … 2. Codewords dictionary formation Codewords + + + Vector quantization 128-D SIFT space Slide credit: Josef Sivic
  • 10. Image patch examples of codewords Sivic et al. 2005
  • 11. ….. Image representation Histogram of features assigned to each cluster frequency codewords
  • 12. Uses of BoW representation Treat as feature vector for standard classifier e.g SVM Cluster BoW vectors over image collection Discover visual themes Hierarchical models Decompose scene/object Scene
  • 13. BoW as input to classifier SVM for object classification Csurka, Bray, Dance & Fan, 2004 Naïve Bayes See 2007 edition of this course
  • 14. Clustering BoW vectors Use models from text document literature Probabilistic latent semantic analysis (pLSA) Latent Dirichlet allocation (LDA) See 2007 edition for explanation/code d = image, w = visual word, z = topic (cluster)
  • 15. Clustering BoW vectors Scene classification (supervised) Vogel & Schiele, 2004 Fei-Fei & Perona, 2005 Bosch, Zisserman & Munoz, 2006 Object discovery (unsupervised) Each cluster corresponds to visual theme Sivic, Russell, Efros, Freeman & Zisserman, 2005
  • 16. Related work Early “bag of words” models: mostly texture recognition Cula & Dana, 2001; Leung & Malik 2001; Mori, Belongie & Malik, 2001; Schmid 2001; Varma & Zisserman, 2002, 2003; Lazebnik, Schmid & Ponce, 2003 Hierarchical Bayesian models for documents (pLSA, LDA, etc.) Hoffman 1999; Blei, Ng & Jordan, 2004; Teh, Jordan, Beal & Blei, 2004 Object categorization Csurka, Bray, Dance & Fan, 2004; Sivic, Russell, Efros, Freeman & Zisserman, 2005; Sudderth, Torralba, Freeman & Willsky, 2005; Natural scene categorization Vogel & Schiele, 2004; Fei-Fei & Perona, 2005; Bosch, Zisserman & Munoz, 2006
  • 18. Adding spatial info. to BoW Feature level Spatial influence through correlogram features: Savarese, Winn and Criminisi, CVPR 2006
  • 19. Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Hierarchical model of scene/objects/parts
  • 20. P1 P2 P3 P4 w Image Bg Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Niebles & Fei-Fei, CVPR 2007
  • 21. Adding spatial info. to BoW Feature level Generative models Discriminative methods Lazebnik, Schmid & Ponce, 2006
  • 23. Problem with bag-of-words All have equal probability for bag-of-words methods Location information is important BoW + location still doesn’t give correspondence
  • 24. Model: Parts and Structure
  • 25. Representation Object as set of parts Generative representation Model: Relative locations between parts Appearance of part Issues: How to model location How to represent appearance How to handle occlusion/clutter Figure from [Fischler & Elschlager 73]
  • 26.
  • 29. Lades, v.d. Malsburg et al. ‘93
  • 30. Cootes, Lanitis, Taylor et al. ‘95
  • 31. Amit & Geman ‘95, ‘99
  • 32. Perona et al. ‘95, ‘96, ’98, ’00, ’03, ‘04, ‘05
  • 34. Crandall & Huttenlocher ’05, ’06
  • 35. Leibe & Schiele ’03, ’04
  • 36.
  • 37.
  • 38.
  • 40. O(N2P)  O(NP) for tree structured models
  • 41.
  • 42.
  • 43. Learn Appearance Generative models of appearance Can learn with little supervision E.g. Fergus et al’ 03 Discriminative training of part appearance model SVM part detectors Felzenszwalb, Mcallester, Ramanan, CVPR 2008 Much better performance
  • 44. Felzenszwalb, Mcallester, Ramanan, CVPR 2008 2-scale model Whole object Parts HOG representation +SVM training to obtainrobust part detectors Distancetransforms allowexamination of every location in the image
  • 45.
  • 51. Zhu & Yuille ’07
  • 52. Fidler & Leonardis ‘07Images from [Amit98]
  • 53. Stochastic Grammar of ImagesS.C. Zhu et al. and D. Mumford
  • 54. Context and Hierarchy in a Probabilistic Image ModelJin & Geman (2006) animal head instantiated by bear head e.g. animals, trees, rocks e.g. contours, intermediate objects e.g. linelets, curvelets, T-junctions e.g. discontinuities, gradient animal head instantiated by tiger head
  • 55. A Hierarchical Compositional System for Rapid Object DetectionLong Zhu, Alan L. Yuille, 2007. Able to learn #parts at each level
  • 56. Learning a Compositional Hierarchy of Object Structure Fidler & Leonardis, CVPR’07; Fidler, Boben & Leonardis, CVPR 2008 Parts model The architecture Learned parts
  • 57. Parts and Structure modelsSummary Explicit notion of correspondence between image and model Efficient methods for large # parts and # positions in image With powerful part detectors, can get state-of-the-art performance Hierarchical models allow for more parts
  • 59. Classifier based methods Decision boundary Background Computer screen Bag of image patches In some feature space Object detection and recognition is formulated as a classification problem. The image is partitioned into a set of overlapping windows … and a decision is taken at each window about if it contains a target object or not. Where are the screens?
  • 60.
  • 61.
  • 62.
  • 63. Face recognition using eigenfaces M. Turk and A. Pentland (1991).
  • 64. Human Face Detection in Visual Scenes - Rowley, Baluja, Kanade (1995)
  • 65. Graded Learning for Object Detection - Fleuret, Geman (1999)
  • 66. Robust Real-time Object Detection - Viola, Jones (2001)
  • 67. Feature Reduction and Hierarchy of Classifiers for Fast Object Detection in Video Images - Heisele, Serre, Mukherjee, Poggio (2001)
  • 68.
  • 69. Features: Edges and chamfer distance Gavrila, Philomin, ICCV 1999
  • 70. Features: Edge fragments Opelt, Pinz, Zisserman, ECCV 2006 Weak detector = k edge fragments and threshold. Chamfer distance uses 8 orientation planes
  • 71.
  • 72.
  • 73. Classifier: Neural Networks Fukushima’s Neocognitron, 1980 Rowley, Baluja, Kanade 1998 LeCun, Bottou, Bengio, Haffner 1998 Serre et al. 2005 Riesenhuber, M. and Poggio, T. 1999 LeNetconvolutional architecture (LeCun 1998)
  • 74. Classifier: Support Vector Machine Guyon, Vapnik Heisele, Serre, Poggio, 2001 …….. Dalal & Triggs , CVPR 2005 HOG – Histogram of Oriented gradients Learn weighting of descriptor with linear SVM Image HOG descriptor HOG descriptor weighted by +ve SVM -ve SVM weights
  • 75. Classifier: Boosting Viola & Jones 2001 Haar features via Integral Image Cascade Real-time performance ……. Torralbaet al., 2004 Part-based Boosting Each weak classifier is a part Part location modeled by offset mask
  • 76. Summary of classifier-based methods Many techniques for training discriminative models are used Many not mentioned here Conditional random fields Kernels for object recognition Learning object similarities .....
  • 77.
  • 78. Dalal & Triggs HOG detector HOG – Histogram of Oriented gradients Careful selection of spatial bin size/# orientation bins/normalization Learn weighting of descriptor with learn SVM Image HOG descriptor HOG descriptor weighted by +ve SVM -ve SVM weights