SlideShare une entreprise Scribd logo
1  sur  59
Multiclass object detection
Multiclass object detection
Context: objects appear in configurations
Generalization: objects share parts
How many categories?
“Muchas” Slide by Aude Oliva
How many object categories are there? ~10,000 to 30,000 Biederman 1987
How many categories? Probably this question is not even specific enough to have an answer
? Which level of categorization is the right one? Car is an object composed of: 	a few doors, four wheels (not all visible at all times), a roof,  	front lights, windshield  If you are thinking in buying a car, you might want to be a bit more specific about your categorization level.
A bird An ostrich Entry-level categories(Jolicoeur, Gluck, Kosslyn 1984) Typical member of a basic-level category are categorized at the expected level Atypical members tend to be classified at a subordinate level.
We do not need to recognize the exact category A new class can borrow information from similar categories
So, where is computer vision? Well…
Multiclass object detectionthe not so early days
[object Object],- two cascades for each view  (a) One detector for each class Multiclass object detectionthe not so early days Using a set of independent binary classifiers was a common strategy: ,[object Object],There is nothing wrong with this approach if you have access to lots of training data and you do not care about efficiency.
Generalizing Across Categories Can we transfer knowledge from one object category to another? Slide by Erik Sudderth
Shared features Is learning the object class 1000 easier than learning the first? Can we transfer knowledge from one object to another? Are the shared properties interesting by themselves?  …
Multitask learning R. Caruana. Multitask Learning. ML 1997 “MTL improves generalization by leveraging the domain-specific information contained in the training signals of related tasks. It does this by training tasks in parallel while using a shared representation”. vs. Sejnowski & Rosenberg 1986; Hinton 1986; Le Cun et al. 1989; Suddarth & Kergosien 1990; Pratt et al. 1991; Sharkey & Sharkey 1992; …
Multitask learning R. Caruana. Multitask Learning. ML 1997 Primary task: detect door knobs Tasks used: ,[object Object]
width of left door jamb
width of right door jamb
horizontal location of left edge of door
horizontal location of right edge of door
horizontal location of doorknob
single or double door
horizontal location of doorway center
width of doorway
horizontal location of left door jamb,[object Object]
Convolutional Neural Network Le Cun et al, 98 Translation invariance is already built into the network The output neurons share all the intermediate levels
Sharing transformations Miller, E., Matsakis, N., and Viola, P. (2000). Learning from one example through shared densities on transforms. In IEEE Computer Vision and Pattern Recognition. Transformations are shared and can be learnt from other tasks.
Sharing in constellation models Pictorial StructuresFischler & Elschlager, IEEE Trans. Comp. 1973 SVM DetectorsHeisele, Poggio, et. al., NIPS 2001 Constellation ModelFei-Fei, Fergus, Perona, ICCV 2003  Model-Guided SegmentationMori, Ren, Efros, & Malik, CVPR 2004
Reusable Parts Krempp, Geman, & Amit “Sequential Learning of Reusable Parts for Object Detection”. TR 2002 Goal: Look for a vocabulary of edges that reduces the number of features. Examples of reused parts Number of features Number of classes
Specific feature pedestrian chair Traffic light sign face Background class Non-shared feature: this feature is too specific to faces.
Shared feature shared feature
[object Object],Screen detector Car detector Face detector ,[object Object],Screen detector Car detector Face detector Additive models and boosting Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
50 training samples/class 29 object classes 2000 entries in the dictionary Results averaged on 20 runs Error bars = 80% interval Class-specific features Shared features Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
12 viewpoints 12 unrelated object classes Generalization as a function of object similarities K = 2.1 K = 4.8 Area under ROC Area under ROC Number of training samples per class Number of training samples per class Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
J. Shotton, A. Blake, R. Cipolla.Multi-Scale Categorical Object Recognition Using Contour Fragments. In IEEETrans. on PAMI, 30(7):1270-1281, July 2008. Efficiency Generalization Opelt, Pinz, Zisserman, CVPR 2006
Sharing patches Bart and Ullman, 2004 For a new class, use only features similar to features that where good for other classes: Proposed Dog  features
Some more references Baxter 1996 Caruana 1997 Schapire, Singer, 2000 Thrun, Pratt 1997 Krempp, Geman, Amit, 2002 E.L.Miller, Matsakis, Viola, 2000 Mahamud, Hebert, Lafferty, 2001 Fink et al. 2003, 2004 LeCun, Huang, Bottou, 2004 Holub, Welling, Perona, 2005 …
Modeling object relationships
The “guess what I am trying to detect” challenge The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
What object is detector trying to detect? The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
1. chair, 2. table, 3. road, 4. road, 5. table, 6. car, 7. keyboard.
The context challenge How far can you go without using an object detector?
1 2 What are the hidden objects?
What are the hidden objects? Chance ~ 1/30000
objects image p(O | I) ap(I|O) p(O) Object model Context model
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Aprox. joint
p(O | I) ap(I|O) p(O) Object model Context model Full joint Approx. joint Scene model
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint p(O) = S Pp(Oi|S=s) p(S=s) i s office street
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint
Pixel labeling using MRFs Enforce consistency between neighboring labels, and between labels and pixels Oi Carbonetto, de Freitas & Barnard, ECCV’04
Beyond nearest-neighbor grids Most MRF/CRF models assume nearest-neighbor graph topology This cannot capture long-distance correlations
Object-Object Relationships Use latent variables to induce long distance correlations between labels in a Conditional Random Field (CRF) He, Zemel & Carreira-Perpinan (04)
Object-Object Relationships [Kumar Hebert 2005]
Fink & Perona (NIPS 03) Use output of boosting from other objects at previous iterations as input into boosting for this iteration Object-Object Relationships
Objects in Context Building Building, boat, person Road Road Building, boat, motorbike Boat Water, sky Water Most consistent labeling according to object co-occurrences& locallabel probabilities. A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie. Objects in Context. ICCV 2007

Contenu connexe

Similaire à Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context

Cvpr2007 object category recognition p3 - discriminative models
Cvpr2007 object category recognition   p3 - discriminative modelsCvpr2007 object category recognition   p3 - discriminative models
Cvpr2007 object category recognition p3 - discriminative modelszukun
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2zukun
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2zukun
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11zukun
 
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...L. Thorne McCarty
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1zukun
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligenceMayank Saxena
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus
 
A Featherweight Approach to FOOL
A Featherweight Approach to FOOLA Featherweight Approach to FOOL
A Featherweight Approach to FOOLgreenwop
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introductionzukun
 
Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based modelszukun
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008Jason Morris
 
Using binary classifiers
Using binary classifiersUsing binary classifiers
Using binary classifiersbutest
 
David Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIDavid Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIBayes Nets meetup London
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsMark Billinghurst
 

Similaire à Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context (20)

Cvpr2007 object category recognition p3 - discriminative models
Cvpr2007 object category recognition   p3 - discriminative modelsCvpr2007 object category recognition   p3 - discriminative models
Cvpr2007 object category recognition p3 - discriminative models
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
 
Tangible Interfaces
Tangible InterfacesTangible Interfaces
Tangible Interfaces
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11
 
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"
 
nnml.ppt
nnml.pptnnml.ppt
nnml.ppt
 
Fame cvpr
Fame cvprFame cvpr
Fame cvpr
 
A Featherweight Approach to FOOL
A Featherweight Approach to FOOLA Featherweight Approach to FOOL
A Featherweight Approach to FOOL
 
The Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System ArchitectureThe Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System Architecture
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introduction
 
Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008
 
Using binary classifiers
Using binary classifiersUsing binary classifiers
Using binary classifiers
 
lecun-01.ppt
lecun-01.pptlecun-01.ppt
lecun-01.ppt
 
David Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIDavid Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AI
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality Applications
 

Plus de zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 

Plus de zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Dernier

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 

Dernier (20)

Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 

Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context

  • 3.
  • 4.
  • 5. Context: objects appear in configurations
  • 9. How many object categories are there? ~10,000 to 30,000 Biederman 1987
  • 10. How many categories? Probably this question is not even specific enough to have an answer
  • 11. ? Which level of categorization is the right one? Car is an object composed of: a few doors, four wheels (not all visible at all times), a roof, front lights, windshield If you are thinking in buying a car, you might want to be a bit more specific about your categorization level.
  • 12. A bird An ostrich Entry-level categories(Jolicoeur, Gluck, Kosslyn 1984) Typical member of a basic-level category are categorized at the expected level Atypical members tend to be classified at a subordinate level.
  • 13. We do not need to recognize the exact category A new class can borrow information from similar categories
  • 14. So, where is computer vision? Well…
  • 15. Multiclass object detectionthe not so early days
  • 16.
  • 17. Generalizing Across Categories Can we transfer knowledge from one object category to another? Slide by Erik Sudderth
  • 18. Shared features Is learning the object class 1000 easier than learning the first? Can we transfer knowledge from one object to another? Are the shared properties interesting by themselves? …
  • 19. Multitask learning R. Caruana. Multitask Learning. ML 1997 “MTL improves generalization by leveraging the domain-specific information contained in the training signals of related tasks. It does this by training tasks in parallel while using a shared representation”. vs. Sejnowski & Rosenberg 1986; Hinton 1986; Le Cun et al. 1989; Suddarth & Kergosien 1990; Pratt et al. 1991; Sharkey & Sharkey 1992; …
  • 20.
  • 21. width of left door jamb
  • 22. width of right door jamb
  • 23. horizontal location of left edge of door
  • 24. horizontal location of right edge of door
  • 27. horizontal location of doorway center
  • 29.
  • 30. Convolutional Neural Network Le Cun et al, 98 Translation invariance is already built into the network The output neurons share all the intermediate levels
  • 31. Sharing transformations Miller, E., Matsakis, N., and Viola, P. (2000). Learning from one example through shared densities on transforms. In IEEE Computer Vision and Pattern Recognition. Transformations are shared and can be learnt from other tasks.
  • 32. Sharing in constellation models Pictorial StructuresFischler & Elschlager, IEEE Trans. Comp. 1973 SVM DetectorsHeisele, Poggio, et. al., NIPS 2001 Constellation ModelFei-Fei, Fergus, Perona, ICCV 2003 Model-Guided SegmentationMori, Ren, Efros, & Malik, CVPR 2004
  • 33. Reusable Parts Krempp, Geman, & Amit “Sequential Learning of Reusable Parts for Object Detection”. TR 2002 Goal: Look for a vocabulary of edges that reduces the number of features. Examples of reused parts Number of features Number of classes
  • 34. Specific feature pedestrian chair Traffic light sign face Background class Non-shared feature: this feature is too specific to faces.
  • 36.
  • 37. 50 training samples/class 29 object classes 2000 entries in the dictionary Results averaged on 20 runs Error bars = 80% interval Class-specific features Shared features Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
  • 38. 12 viewpoints 12 unrelated object classes Generalization as a function of object similarities K = 2.1 K = 4.8 Area under ROC Area under ROC Number of training samples per class Number of training samples per class Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
  • 39. J. Shotton, A. Blake, R. Cipolla.Multi-Scale Categorical Object Recognition Using Contour Fragments. In IEEETrans. on PAMI, 30(7):1270-1281, July 2008. Efficiency Generalization Opelt, Pinz, Zisserman, CVPR 2006
  • 40. Sharing patches Bart and Ullman, 2004 For a new class, use only features similar to features that where good for other classes: Proposed Dog features
  • 41. Some more references Baxter 1996 Caruana 1997 Schapire, Singer, 2000 Thrun, Pratt 1997 Krempp, Geman, Amit, 2002 E.L.Miller, Matsakis, Viola, 2000 Mahamud, Hebert, Lafferty, 2001 Fink et al. 2003, 2004 LeCun, Huang, Bottou, 2004 Holub, Welling, Perona, 2005 …
  • 43. The “guess what I am trying to detect” challenge The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
  • 44. What object is detector trying to detect? The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
  • 45. 1. chair, 2. table, 3. road, 4. road, 5. table, 6. car, 7. keyboard.
  • 46. The context challenge How far can you go without using an object detector?
  • 47. 1 2 What are the hidden objects?
  • 48. What are the hidden objects? Chance ~ 1/30000
  • 49. objects image p(O | I) ap(I|O) p(O) Object model Context model
  • 50. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Aprox. joint
  • 51. p(O | I) ap(I|O) p(O) Object model Context model Full joint Approx. joint Scene model
  • 52. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint p(O) = S Pp(Oi|S=s) p(S=s) i s office street
  • 53. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint
  • 54. Pixel labeling using MRFs Enforce consistency between neighboring labels, and between labels and pixels Oi Carbonetto, de Freitas & Barnard, ECCV’04
  • 55. Beyond nearest-neighbor grids Most MRF/CRF models assume nearest-neighbor graph topology This cannot capture long-distance correlations
  • 56. Object-Object Relationships Use latent variables to induce long distance correlations between labels in a Conditional Random Field (CRF) He, Zemel & Carreira-Perpinan (04)
  • 58. Fink & Perona (NIPS 03) Use output of boosting from other objects at previous iterations as input into boosting for this iteration Object-Object Relationships
  • 59. Objects in Context Building Building, boat, person Road Road Building, boat, motorbike Boat Water, sky Water Most consistent labeling according to object co-occurrences& locallabel probabilities. A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie. Objects in Context. ICCV 2007
  • 60. 52 Objects in Context:Contextual Refinement Building Road Boat Independent segment classification Water Mean interaction of all label pairs Φ(i,j) is basically the observed label co-occurrences in training set. Contextual model based on co-occurrences Try to find the most consistent labeling with high posterior probability and high mean pairwise interaction. Use CRF for this purpose. Slide by GokberkCinbis
  • 61. Detecting difficult objects Maybe there is a mouse Office Start recognizing the scene Torralba, Murphy, Freeman. NIPS 2004.
  • 62. Detecting difficult objects Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.
  • 63. Detecting difficult objects Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.
  • 64. BRF for car detection: topology Torralba Murphy Freeman (2004)
  • 65. BRF for car detection: results Torralba Murphy Freeman (2004)
  • 66. A “car” out of context is less of a car From image Thresholded beliefs From detectors Road Car Building b F G b F G b F G
  • 67. Contextual object relationships Carbonetto, de Freitas & Barnard (2004) Kumar, Hebert (2005) Torralba Murphy Freeman (2004) E. Sudderth et al (2005) Fink & Perona (2003)