SlideShare une entreprise Scribd logo
1  sur  22
What is Object Recognition?
DescriptionsSensory data
Color, texture, shape, “toy”, “stuffed Pooh”, “a
motion, size, weight, frontal, close-up shot of
smell, touch, sound, … stuffed Pooh”, “ToysRus
#345812”, …
One of the fundamental problems of computer vision:
+
Apple
8
Applications of Object Recognition
Some images from http://www.cs.utexas.edu/~grauman/research/research.html
 What is Object Recognition?
› Segmentation/Figure-Ground Separation: prerequisite or
consequence
› Labeling an object [The focus of most studies]
› Extracting a parametric description as well
 Object Recognition versus Scene Analysis
› An object may be part of a scene or
› Itself be recognized as a “scene”
 What is Object Recognition for?
› As a context for recognizing something else (locating a house
by the tree in the garden)
› As a target for action (climb that tree)
 We bind features of objects into objects (feature binding)
 We bind objects in space into some arrangement (space
binding)
 We perceive the scene.
 Feature binding = what/how stream
 Space binding = where stream
 Double role of spatial relationships:
› To relate different portions of an object or scene as a guide to
recognition
› Augmented by other “how” parameters, to guide our behavior
with respect to the observed scene.
 The binding problem: binding different features
(color, orientation, etc) to yield a unitary percept.
(see next slide)
 Bottom-up vs. top-down processing: how
 much is assumed top-down vs. extracted
 from the image?
 Perception vs. recognition vs. categorization: seeing an
object vs. seeing is as something. Matching views of
known objects to memory vs. matching a novel object to
object categories in memory.
 Viewpoint invariance: a major issue is to recognize objects
irrespective of the viewpoint from which we see them.
 1) pixel-based (light intensity)
 2) primal sketch (discontinuities in intensity)
 3) 2 ½ D sketch (oriented surfaces, relative
depth between surfaces)
 4) 3D model (shapes, spatial relationships,
volumes)
 Major problem for recognition.
Biederman & Gerhardstein, 1994:
 We can recognize two views of an unfamiliar object
as being the same object.
 Thus, viewpoint invariance cannot only rely on
matching views to memory.
 Direct Template Matching:
 Processing hierarchy yields activation of
view-tuned units.
 A collection of view-tuned units is
associated with one object.
 View tuned units are built from V4-like
units,
 using sets of weights which differ for
each object.
 Hierarchical Template Matching:
 Image passed through layers of units with progressively more
complex features at progressively less specific locations.
 Hierarchical in that features at one stage are built from
features at earlier stages.
 e.g., Fukushima & Miyake (1982)’s Neocognitron:
 Several processing layers, comprising simple (S) and complex
(C) cells.
 S-cells in one layer respond to conjunctions of C-cells in previous
layer.
 C-cells in one layer are excited by small neighborhoods of S-
cells.
 Transform & Match:
 First take care of rotation, translation, scale, etc.
invariances.
 Then recognize based on standardized pixel
representation of objects.
 e.g., Olshausen et al, 1993,
dynamic routing model
 Template match: e.g., with
an associative memory based on
a Hopfield network.
 GEONS: geometric elements of which all objects are
composed (cylinders, cones, etc). On the order of 30
different shapes.
 Skips 2 ½ D sketch: Geons are directly recognized
from edges, based on their nonaccidental properties
(i.e., 3D features that are usually preserved by the
projective imaging process).
 They are sufficiently different from each other
to be easily discriminated
 They are view-invariant (look identical from
most viewpoints)
 They are robust to noise (can be identified
even with parts of image missing)
Edelman, 1997
A. Structural description not
enough, also need metric info
B. Difficult to extract geons
from real images
C. Ambiguity in the structu-
ral description: most often
we have several candidates
D. For some objects,
deriving a structural repre-
sentation can be difficult
 Structural approach to object recognition:
 Biederman, 1987:
 Complex objects are composed so simpler pieces
 We can recognize a novel/unfamiliar object by parsing it
in terms of its component pieces, then comparing the
assemblage of pieces to those of known objects.
Biederman et al. (1991 – )
“geons”: units of
3D geometric structure
37
Object Recognition in Practice
Commercial object recognition
Currently a $4 billion/year industry for inspection and
assembly
Almost entirely based on template matching
Upcoming applications
Mobile robots, toys, user interfaces
Location recognition
Digital camera panoramas, 3D scene modeling
courtesy of David Lowe,
website and CVPR 2003 Tutorial

Contenu connexe

Tendances

FACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGYFACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGYJASHU JASWANTH
 
Retina recognition biometrics drishtysharma
Retina recognition biometrics drishtysharmaRetina recognition biometrics drishtysharma
Retina recognition biometrics drishtysharmaDrishty Sharma
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognitionAshiq Ullah
 
Facial recognition technology by vaibhav
Facial recognition technology by vaibhavFacial recognition technology by vaibhav
Facial recognition technology by vaibhavVaibhav P
 
Review Paper on Image Processing Techniques
Review Paper on Image Processing TechniquesReview Paper on Image Processing Techniques
Review Paper on Image Processing TechniquesIJSRD
 
Image Acquisition
Image AcquisitionImage Acquisition
Image Acquisitionshail288
 
Fundamentals steps in Digital Image processing
Fundamentals steps in Digital Image processingFundamentals steps in Digital Image processing
Fundamentals steps in Digital Image processingKarthicaMarasamy
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extractionRushin Shah
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction Wael Badawy
 
Hough Transform By Md.Nazmul Islam
Hough Transform By Md.Nazmul IslamHough Transform By Md.Nazmul Islam
Hough Transform By Md.Nazmul IslamNazmul Islam
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image RetrievalSOURAV KAR
 

Tendances (20)

FACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGYFACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGY
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Retina recognition biometrics drishtysharma
Retina recognition biometrics drishtysharmaRetina recognition biometrics drishtysharma
Retina recognition biometrics drishtysharma
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 
Facial recognition technology by vaibhav
Facial recognition technology by vaibhavFacial recognition technology by vaibhav
Facial recognition technology by vaibhav
 
Object detection.pptx
Object detection.pptxObject detection.pptx
Object detection.pptx
 
Review Paper on Image Processing Techniques
Review Paper on Image Processing TechniquesReview Paper on Image Processing Techniques
Review Paper on Image Processing Techniques
 
face detection
face detectionface detection
face detection
 
Histogram Equalization
Histogram EqualizationHistogram Equalization
Histogram Equalization
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Hog
HogHog
Hog
 
Image Acquisition
Image AcquisitionImage Acquisition
Image Acquisition
 
Fundamentals steps in Digital Image processing
Fundamentals steps in Digital Image processingFundamentals steps in Digital Image processing
Fundamentals steps in Digital Image processing
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extraction
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Hough Transform By Md.Nazmul Islam
Hough Transform By Md.Nazmul IslamHough Transform By Md.Nazmul Islam
Hough Transform By Md.Nazmul Islam
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 

En vedette

Advanced Computer Architecture chapter 5 problem solutions
Advanced Computer  Architecture  chapter 5 problem solutionsAdvanced Computer  Architecture  chapter 5 problem solutions
Advanced Computer Architecture chapter 5 problem solutionsJoe Christensen
 
Object detection
Object detectionObject detection
Object detectionSomesh Vyas
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition Intel Nervana
 
Lecture 3
Lecture 3Lecture 3
Lecture 3Mr SMAK
 
Computer architecture kai hwang
Computer architecture   kai hwangComputer architecture   kai hwang
Computer architecture kai hwangSumedha
 

En vedette (6)

Advanced Computer Architecture chapter 5 problem solutions
Advanced Computer  Architecture  chapter 5 problem solutionsAdvanced Computer  Architecture  chapter 5 problem solutions
Advanced Computer Architecture chapter 5 problem solutions
 
Object detection
Object detectionObject detection
Object detection
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
 
Computer architecture kai hwang
Computer architecture   kai hwangComputer architecture   kai hwang
Computer architecture kai hwang
 
Introduction to pattern recognition
Introduction to pattern recognitionIntroduction to pattern recognition
Introduction to pattern recognition
 

Similaire à Object recognition

MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1zukun
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introductionzukun
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2zukun
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1zukun
 
Integrated Contextual Representation For Objects’
Integrated Contextual Representation For Objects’Integrated Contextual Representation For Objects’
Integrated Contextual Representation For Objects’HAMlab
 
Object Detection and tracking in Video Sequences
Object Detection and tracking in Video SequencesObject Detection and tracking in Video Sequences
Object Detection and tracking in Video SequencesIDES Editor
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11zukun
 
Shadow Detection and Removal in Still Images by using Hue Properties of Color...
Shadow Detection and Removal in Still Images by using Hue Properties of Color...Shadow Detection and Removal in Still Images by using Hue Properties of Color...
Shadow Detection and Removal in Still Images by using Hue Properties of Color...ijsrd.com
 
Introduction vision
Introduction visionIntroduction vision
Introduction visionAshish Kumar
 
Morphology in graphics and image processing
Morphology in graphics and image processingMorphology in graphics and image processing
Morphology in graphics and image processingDheeban Smart
 

Similaire à Object recognition (20)

MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introduction
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1
 
perception
perceptionperception
perception
 
Perception
PerceptionPerception
Perception
 
Integrated Contextual Representation For Objects’
Integrated Contextual Representation For Objects’Integrated Contextual Representation For Objects’
Integrated Contextual Representation For Objects’
 
Promising avenues for interdisciplinary research in vision
Promising avenues for interdisciplinary research in visionPromising avenues for interdisciplinary research in vision
Promising avenues for interdisciplinary research in vision
 
Perception
PerceptionPerception
Perception
 
Chapter4
Chapter4Chapter4
Chapter4
 
Material design topic 2
Material design topic 2Material design topic 2
Material design topic 2
 
Object Detection and tracking in Video Sequences
Object Detection and tracking in Video SequencesObject Detection and tracking in Video Sequences
Object Detection and tracking in Video Sequences
 
Handling Uncertainty under Spatial Feature Extraction through Probabilistic S...
Handling Uncertainty under Spatial Feature Extraction through Probabilistic S...Handling Uncertainty under Spatial Feature Extraction through Probabilistic S...
Handling Uncertainty under Spatial Feature Extraction through Probabilistic S...
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11
 
160713
160713160713
160713
 
Shadow Detection and Removal in Still Images by using Hue Properties of Color...
Shadow Detection and Removal in Still Images by using Hue Properties of Color...Shadow Detection and Removal in Still Images by using Hue Properties of Color...
Shadow Detection and Removal in Still Images by using Hue Properties of Color...
 
Graphics
GraphicsGraphics
Graphics
 
Introduction vision
Introduction visionIntroduction vision
Introduction vision
 
Morphology in graphics and image processing
Morphology in graphics and image processingMorphology in graphics and image processing
Morphology in graphics and image processing
 
Pc Seminar Jordi
Pc Seminar JordiPc Seminar Jordi
Pc Seminar Jordi
 

Dernier

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 

Dernier (20)

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 

Object recognition

  • 1.
  • 2. What is Object Recognition? DescriptionsSensory data Color, texture, shape, “toy”, “stuffed Pooh”, “a motion, size, weight, frontal, close-up shot of smell, touch, sound, … stuffed Pooh”, “ToysRus #345812”, … One of the fundamental problems of computer vision: + Apple
  • 3. 8 Applications of Object Recognition Some images from http://www.cs.utexas.edu/~grauman/research/research.html
  • 4.  What is Object Recognition? › Segmentation/Figure-Ground Separation: prerequisite or consequence › Labeling an object [The focus of most studies] › Extracting a parametric description as well  Object Recognition versus Scene Analysis › An object may be part of a scene or › Itself be recognized as a “scene”  What is Object Recognition for? › As a context for recognizing something else (locating a house by the tree in the garden) › As a target for action (climb that tree)
  • 5.  We bind features of objects into objects (feature binding)  We bind objects in space into some arrangement (space binding)  We perceive the scene.  Feature binding = what/how stream  Space binding = where stream  Double role of spatial relationships: › To relate different portions of an object or scene as a guide to recognition › Augmented by other “how” parameters, to guide our behavior with respect to the observed scene.
  • 6.  The binding problem: binding different features (color, orientation, etc) to yield a unitary percept. (see next slide)  Bottom-up vs. top-down processing: how  much is assumed top-down vs. extracted  from the image?  Perception vs. recognition vs. categorization: seeing an object vs. seeing is as something. Matching views of known objects to memory vs. matching a novel object to object categories in memory.  Viewpoint invariance: a major issue is to recognize objects irrespective of the viewpoint from which we see them.
  • 7.  1) pixel-based (light intensity)  2) primal sketch (discontinuities in intensity)  3) 2 ½ D sketch (oriented surfaces, relative depth between surfaces)  4) 3D model (shapes, spatial relationships, volumes)
  • 8.
  • 9.  Major problem for recognition. Biederman & Gerhardstein, 1994:  We can recognize two views of an unfamiliar object as being the same object.  Thus, viewpoint invariance cannot only rely on matching views to memory.
  • 10.  Direct Template Matching:  Processing hierarchy yields activation of view-tuned units.  A collection of view-tuned units is associated with one object.  View tuned units are built from V4-like units,  using sets of weights which differ for each object.
  • 11.
  • 12.  Hierarchical Template Matching:  Image passed through layers of units with progressively more complex features at progressively less specific locations.  Hierarchical in that features at one stage are built from features at earlier stages.  e.g., Fukushima & Miyake (1982)’s Neocognitron:  Several processing layers, comprising simple (S) and complex (C) cells.  S-cells in one layer respond to conjunctions of C-cells in previous layer.  C-cells in one layer are excited by small neighborhoods of S- cells.
  • 13.
  • 14.  Transform & Match:  First take care of rotation, translation, scale, etc. invariances.  Then recognize based on standardized pixel representation of objects.  e.g., Olshausen et al, 1993, dynamic routing model  Template match: e.g., with an associative memory based on a Hopfield network.
  • 15.
  • 16.  GEONS: geometric elements of which all objects are composed (cylinders, cones, etc). On the order of 30 different shapes.  Skips 2 ½ D sketch: Geons are directly recognized from edges, based on their nonaccidental properties (i.e., 3D features that are usually preserved by the projective imaging process).
  • 17.  They are sufficiently different from each other to be easily discriminated  They are view-invariant (look identical from most viewpoints)  They are robust to noise (can be identified even with parts of image missing)
  • 18. Edelman, 1997 A. Structural description not enough, also need metric info B. Difficult to extract geons from real images C. Ambiguity in the structu- ral description: most often we have several candidates D. For some objects, deriving a structural repre- sentation can be difficult
  • 19.  Structural approach to object recognition:  Biederman, 1987:  Complex objects are composed so simpler pieces  We can recognize a novel/unfamiliar object by parsing it in terms of its component pieces, then comparing the assemblage of pieces to those of known objects.
  • 20. Biederman et al. (1991 – ) “geons”: units of 3D geometric structure
  • 21.
  • 22. 37 Object Recognition in Practice Commercial object recognition Currently a $4 billion/year industry for inspection and assembly Almost entirely based on template matching Upcoming applications Mobile robots, toys, user interfaces Location recognition Digital camera panoramas, 3D scene modeling courtesy of David Lowe, website and CVPR 2003 Tutorial