SlideShare une entreprise Scribd logo
1  sur  72
Computer vision:
models, learning and inference
             Chapter 17
           Models for shape



      Please send errata to s.prince@cs.ucl.ac.uk
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   2
Motivation: fitting shape model




    Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   3
What is shape?
• Kendall (1984) – Shape “is all the geometrical
  information that remains when location scale
  and rotational effects are filtered out from an
  object”
• In other words, it is whatever is invariant to a
  similarity transformation



         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   4
Representing Shape
• Algebraic modelling
  – Line:

  – Conic:



  – More complex objects? Not practical for spine.



            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   5
Landmark Points




• Landmark points can be thought of as discrete samples
  from underlying contour
   – Ordered (single continuous contour)
   – Ordered with wrapping (closed contour)
   – More complex organisation (collection of closed and open)
           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   6
Snakes
• Provide only weak information: contour is smooth
• Represent contour as N 2D landmark points

• We will construct terms for
   – The likelihood            of observing an image x given
     landmark points W. Encourages landmark points to lie on
     border in the image
   – The prior           of the landmark point. Encourages the
     contours to be smooth.


           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   7
Snakes




Initialise contour and let it evolve until it grabs onto an object
Crawls across the image – hence called snake or active contour
           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   8
Snake likelihood




Has correct properties (probability high at edges), but flat in
regions distant from the contour. Not good for optimisation.
          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   9
Snake likelihood (2)




Compute edges (here using Canny) and then compute distance
  image – this varies smoothly with distance from the image



          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   10
Prior
• Encourages smoothness



  – Encourages equal spacing



  – Encourages low curvature


        Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   11
Inference
• Maximise posterior probability




• No closed form solution
• Must use non-linear optimisation method
• Number of unknowns = 2N

         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   12
Snakes




        Notice failure at nose – falls between points.
A better model would sample image between landmark points
         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   13
Inference
• Maximise posterior probability



• Very slow. Can potentially speed it up by changing
  spacing element of prior:



• Take advantage of limited connectivity of associated
  graphical model

           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   14
Relationships between models




   Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   15
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   16
Shape template model
•   Shape based on landmark points
•   These points are assumed known
•   Mapped into the image by transformation
•   What is left is to find parameters of transformation
•   Likelihood is based on distance transform:



• No prior on parameters (but could do)




             Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   17
Shape template model




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   18
Inference
• Use maximum likelihood approach




• No closed form solution
• Must use non-linear optimization
• Use chain rule to compute derivatives



          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   19
Iterative closest points
                                                 • Find nearest edge point to
                                                   each landmark point

                                                 • Compute transformation
                                                   in closed form

                                                 • Repeat




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   20
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   21
Statistical shape models
• Also called
  – Point distribution models
  – Active shape models (as they adapt to the image)
• Likelihood:



• Prior:

           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   22
Learning
• Usually, we are given the examples after they
  have been transformed


• Before we can learn the normal distribution we
  must compute the inverse transformation


• Procedure is called generalized Procrustes
  analysis

         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   23
Generalized Procrustes analysis




Training data                Before alignment                              After alignment



          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince       24
Generalized Procrustes analysis
Alternately
  – Update all transformations to map landmark
    points to current mean



  – Update mean to be average of transformed values



Then learn mean and variance parameters.
         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   25
Inference
• Map inference:



• No closed form solution
• Use non-linear optimisation
• Or use ICP approach
• However, many parameters, and not clear they
  are all needed
• more efficient to use subspace model
         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   26
Face model




Three samples from learnt model for faces




   Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   27
Subspace shape model
• Generate data from model:

  – is the mean shape
  – the matrix                         contains K
    basis functions in it columns
  –    is normal noise with covariance
• Can alternatively write


         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   28
Approximating with subspace

Subspace model




Can approximate an vector w with a weighted sum of
the basis functions



Surprising how well this works even with a small
number of basis functions
        Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   29
Subspace shape model




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   30
Probabilistic PCA
Generative eq:

Probabilistic version:



Add prior:

Density:




             Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   31
Learning PPCA
Learn parameters                    and            from data                        ,
where                                              .

Learn mean:

Then set                                                                            and
compute eigen-decomposition

Choose parameters




         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince         32
Properties of basis functions
Learning of parameters based on eigen-decomposition:



Parameters




Notice that:
   • Basis functions in                   are orthogonal
   • Basis functions in                   are ordered

         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   33
Learnt hand model




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   34
Learnt spine model




Mean          Manipulating first                                  Manipulating second
             principal component                                  principal component

       Computer vision: models, learning and inference. ©2011 Simon J.D. Prince     35
Inference
To fit model to an image:




                                             likelihood                                prior

ICP Approach:

    •   Find closest points to current prediction
    •   Update weightings h
    •   Find closest points to current prediction
    •   Update transformation parameters y

            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince           36
Inference
1. Update weightings h




If transformation parameters can be represented as a matrix A




2. Update transformation parameters y
    • Using one of closed form solutions
           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   37
Fitting model




Much better to use statistical classifier instead of just
distance from edges


           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   38
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   39
3D shape models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   40
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   41
Statistical models for shape and
            appearance




   Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   42
Statistical models for shape and
                appearance




1. We draw a hidden variable from a prior




          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   43
Statistical models for shape and
                appearance




1. We draw a hidden variable h from a prior
2. We draw landmark points w from a subspace model
3. We draw image intensities x.
   • Generate image intensities in standard template shape
   • Transform the landmark points (parameters y)
   • Transform the image to landmark points
   • Add noise
         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   44
Shape and appearance model




Shape model                    Intensity model                                     Shape and
                                                                                    intensity
        Computer vision: models, learning and inference. ©2011 Simon J.D. Prince                45
Warping images
                                                    Piecewise affine
                                                    transformation

                                                    Triangulate image points
                                                    using Delaunay
                                                    triangulation.

                                                    Image in each triangle is
                                                    warped by an affine
                                                    transformation.




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince    46
Learning



Goal is to learn parameters :

Problem
   • We are given the transformed landmark points
   • We are given the warped and transformed images

Solution
    • Use Procrustes analysis to un-transform landmark points
    • Warp observed images to template shape
            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   47
Learning
Now have aligned landmark points w, and aligned images
x, we can learn the simpler model:




Can write generative equation as:




Has the form of a factor analyzer
          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   48
Inference
Likelihood of observed intensities



To fit the model use maximum likelihood




This has the least squares form



              Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   49
Inference
This has the least squares form



Use Gauss-Newton method or similar



Where the Jacobian J is a matrix with elements




             Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   50
Statistical models for shape and
            appearance




   Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   51
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   52
Non-linear models
• The shape and appearance models that we have
  studied so far are based on the normal
  distribution
• But more complex shapes might need more
  complex distributions
  – Could use mixture of PPCAs or similar
  – Or use a non-linear subspace model
• We will investigate the Gaussian process latent
  variable model (GPLVM)
• To understand the GPLVM, first think about PPCA
  in terms of regression.
         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   53
PPCA as regression
PPCA model:




•   First term in last equation looks like regression
•   Predicts w for a given h
•   Considering each dimension separately, get linear regression


            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   54
PPCA as regression




Joint probability                  Regress 1st                                       Regress 2nd
  distribution                  dimension against                                 dimension against
                                 hidden variable                                   hidden variable
            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince            55
Gaussian process latent variable model
• Idea: replace the linear regression model with
  a non-linear regression model
• As name suggests, use Gaussian process
  regression
• Implications
  – Can now marginalize over parameters m and F
  – Can no longer marginalize over variable h


         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   56
GPLVM as regression




Joint probability                  Regress 1st                                       Regress 2nd
  distribution                  dimension against                                 dimension against
                                 hidden variable                                   hidden variable
            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince            57
Learning
• In learning the Gaussian process regression
  model , we optimized the marginal likelihood
  of the data with respect to the parameter s2.




         Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   58
Learning
• In learning the GPLVM, we still optimized the
  marginal likelihood of the data with respect to the
  parameter s2, but must also find the values of the
  hidden variables that we regress against .




• Use non-linear optimization technique
          Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   59
Inference
• To predict a new value of the data using a hidden variable




• To compute density



• Cannot be computed in closed from
            Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   60
GPLVM Shape models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   61
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   62
Articulated Models
• Transformations of parts applied one after each other

• Known as a kinematic chain

• e.g. Foot transform is relative to lower leg, which is
  relative to upper leg etc.

• One root transformation that describes the position of
  model relative to camera




           Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   63
Articulated Models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   64
Articulated Models
                       • One possible model for an object part
                         is a quadric
                       • Represents
                         spheres, ellipsoids, cylinders, pairs of
                         planes and others
                       • Make truncated cylinders by clipping
                         with cylinder with pair of planes
                       • Projects to conic in the image




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   65
Structure
• Snakes
• Template models
• Statistical shape models
   • 3D shape models
   • Models for shape and appearance
   • Non-linear models
• Articulated models
• Applications

      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   66
3D morphable models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   67
3D morphable models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   68
3D morphable models




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   69
3D body model




Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   70
3D body model applications




  Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   71
Conclusions
• Introduced a series of models for shape

• Assume different forms of prior knowledge
   • Contour is smooth (snakes)
   • Shape is known, but not position (template)
   • Shape class is known (statistical models)
   • Structure of shape known (articulated model)

• Relates to other models
   • Based on subspace models
   • Tracked using temporal models
      Computer vision: models, learning and inference. ©2011 Simon J.D. Prince   72

Contenu connexe

Similaire à 17 cv mil_models_for_shape

09 cv mil_classification
09 cv mil_classification09 cv mil_classification
09 cv mil_classificationzukun
 
11 cv mil_models_for_chains_and_trees
11 cv mil_models_for_chains_and_trees11 cv mil_models_for_chains_and_trees
11 cv mil_models_for_chains_and_treeszukun
 
13 cv mil_preprocessing
13 cv mil_preprocessing13 cv mil_preprocessing
13 cv mil_preprocessingzukun
 
20 cv mil_models_for_words
20 cv mil_models_for_words20 cv mil_models_for_words
20 cv mil_models_for_wordszukun
 
18 cv mil_style_and_identity
18 cv mil_style_and_identity18 cv mil_style_and_identity
18 cv mil_style_and_identityzukun
 
07 cv mil_modeling_complex_densities
07 cv mil_modeling_complex_densities07 cv mil_modeling_complex_densities
07 cv mil_modeling_complex_densitieszukun
 
04 cv mil_fitting_probability_models
04 cv mil_fitting_probability_models04 cv mil_fitting_probability_models
04 cv mil_fitting_probability_modelszukun
 
03 cv mil_probability_distributions
03 cv mil_probability_distributions03 cv mil_probability_distributions
03 cv mil_probability_distributionszukun
 
12 cv mil_models_for_grids
12 cv mil_models_for_grids12 cv mil_models_for_grids
12 cv mil_models_for_gridszukun
 
10 cv mil_graphical_models
10 cv mil_graphical_models10 cv mil_graphical_models
10 cv mil_graphical_modelszukun
 
Common Probability Distibution
Common Probability DistibutionCommon Probability Distibution
Common Probability DistibutionLukas Tencer
 
02 cv mil_intro_to_probability
02 cv mil_intro_to_probability02 cv mil_intro_to_probability
02 cv mil_intro_to_probabilityzukun
 
05 cv mil_normal_distribution
05 cv mil_normal_distribution05 cv mil_normal_distribution
05 cv mil_normal_distributionzukun
 
Introduction to Probability
Introduction to ProbabilityIntroduction to Probability
Introduction to ProbabilityLukas Tencer
 
06 cv mil_learning_and_inference
06 cv mil_learning_and_inference06 cv mil_learning_and_inference
06 cv mil_learning_and_inferencezukun
 
19 cv mil_temporal_models
19 cv mil_temporal_models19 cv mil_temporal_models
19 cv mil_temporal_modelszukun
 
Mathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsMathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsLiwei Ren任力偉
 
Statistical models of shape and appearance
Statistical models of shape and appearanceStatistical models of shape and appearance
Statistical models of shape and appearancepotaters
 
Geometric Modelling in Computer Aided Design.pptx
Geometric Modelling in Computer Aided Design.pptxGeometric Modelling in Computer Aided Design.pptx
Geometric Modelling in Computer Aided Design.pptxsuyogpatil49
 

Similaire à 17 cv mil_models_for_shape (20)

09 cv mil_classification
09 cv mil_classification09 cv mil_classification
09 cv mil_classification
 
11 cv mil_models_for_chains_and_trees
11 cv mil_models_for_chains_and_trees11 cv mil_models_for_chains_and_trees
11 cv mil_models_for_chains_and_trees
 
13 cv mil_preprocessing
13 cv mil_preprocessing13 cv mil_preprocessing
13 cv mil_preprocessing
 
20 cv mil_models_for_words
20 cv mil_models_for_words20 cv mil_models_for_words
20 cv mil_models_for_words
 
18 cv mil_style_and_identity
18 cv mil_style_and_identity18 cv mil_style_and_identity
18 cv mil_style_and_identity
 
07 cv mil_modeling_complex_densities
07 cv mil_modeling_complex_densities07 cv mil_modeling_complex_densities
07 cv mil_modeling_complex_densities
 
04 cv mil_fitting_probability_models
04 cv mil_fitting_probability_models04 cv mil_fitting_probability_models
04 cv mil_fitting_probability_models
 
03 cv mil_probability_distributions
03 cv mil_probability_distributions03 cv mil_probability_distributions
03 cv mil_probability_distributions
 
12 cv mil_models_for_grids
12 cv mil_models_for_grids12 cv mil_models_for_grids
12 cv mil_models_for_grids
 
10 cv mil_graphical_models
10 cv mil_graphical_models10 cv mil_graphical_models
10 cv mil_graphical_models
 
Common Probability Distibution
Common Probability DistibutionCommon Probability Distibution
Common Probability Distibution
 
02 cv mil_intro_to_probability
02 cv mil_intro_to_probability02 cv mil_intro_to_probability
02 cv mil_intro_to_probability
 
05 cv mil_normal_distribution
05 cv mil_normal_distribution05 cv mil_normal_distribution
05 cv mil_normal_distribution
 
Introduction to Probability
Introduction to ProbabilityIntroduction to Probability
Introduction to Probability
 
06 cv mil_learning_and_inference
06 cv mil_learning_and_inference06 cv mil_learning_and_inference
06 cv mil_learning_and_inference
 
19 cv mil_temporal_models
19 cv mil_temporal_models19 cv mil_temporal_models
19 cv mil_temporal_models
 
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
 
Mathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsMathematical Modeling for Practical Problems
Mathematical Modeling for Practical Problems
 
Statistical models of shape and appearance
Statistical models of shape and appearanceStatistical models of shape and appearance
Statistical models of shape and appearance
 
Geometric Modelling in Computer Aided Design.pptx
Geometric Modelling in Computer Aided Design.pptxGeometric Modelling in Computer Aided Design.pptx
Geometric Modelling in Computer Aided Design.pptx
 

Plus de zukun

ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featureszukun
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...zukun
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learningzukun
 

Plus de zukun (20)

ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
 
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
ECCV2010: Modeling Temporal Structure of Decomposable Motion Segments for Act...
 
Quoc le tera-scale deep learning
Quoc le   tera-scale deep learningQuoc le   tera-scale deep learning
Quoc le tera-scale deep learning
 

Dernier

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Dernier (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

17 cv mil_models_for_shape

  • 1. Computer vision: models, learning and inference Chapter 17 Models for shape Please send errata to s.prince@cs.ucl.ac.uk
  • 2. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 2
  • 3. Motivation: fitting shape model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 3
  • 4. What is shape? • Kendall (1984) – Shape “is all the geometrical information that remains when location scale and rotational effects are filtered out from an object” • In other words, it is whatever is invariant to a similarity transformation Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 4
  • 5. Representing Shape • Algebraic modelling – Line: – Conic: – More complex objects? Not practical for spine. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 5
  • 6. Landmark Points • Landmark points can be thought of as discrete samples from underlying contour – Ordered (single continuous contour) – Ordered with wrapping (closed contour) – More complex organisation (collection of closed and open) Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 6
  • 7. Snakes • Provide only weak information: contour is smooth • Represent contour as N 2D landmark points • We will construct terms for – The likelihood of observing an image x given landmark points W. Encourages landmark points to lie on border in the image – The prior of the landmark point. Encourages the contours to be smooth. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 7
  • 8. Snakes Initialise contour and let it evolve until it grabs onto an object Crawls across the image – hence called snake or active contour Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 8
  • 9. Snake likelihood Has correct properties (probability high at edges), but flat in regions distant from the contour. Not good for optimisation. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 9
  • 10. Snake likelihood (2) Compute edges (here using Canny) and then compute distance image – this varies smoothly with distance from the image Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 10
  • 11. Prior • Encourages smoothness – Encourages equal spacing – Encourages low curvature Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 11
  • 12. Inference • Maximise posterior probability • No closed form solution • Must use non-linear optimisation method • Number of unknowns = 2N Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 12
  • 13. Snakes Notice failure at nose – falls between points. A better model would sample image between landmark points Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 13
  • 14. Inference • Maximise posterior probability • Very slow. Can potentially speed it up by changing spacing element of prior: • Take advantage of limited connectivity of associated graphical model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 14
  • 15. Relationships between models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 15
  • 16. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 16
  • 17. Shape template model • Shape based on landmark points • These points are assumed known • Mapped into the image by transformation • What is left is to find parameters of transformation • Likelihood is based on distance transform: • No prior on parameters (but could do) Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 17
  • 18. Shape template model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 18
  • 19. Inference • Use maximum likelihood approach • No closed form solution • Must use non-linear optimization • Use chain rule to compute derivatives Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 19
  • 20. Iterative closest points • Find nearest edge point to each landmark point • Compute transformation in closed form • Repeat Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 20
  • 21. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 21
  • 22. Statistical shape models • Also called – Point distribution models – Active shape models (as they adapt to the image) • Likelihood: • Prior: Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 22
  • 23. Learning • Usually, we are given the examples after they have been transformed • Before we can learn the normal distribution we must compute the inverse transformation • Procedure is called generalized Procrustes analysis Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 23
  • 24. Generalized Procrustes analysis Training data Before alignment After alignment Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 24
  • 25. Generalized Procrustes analysis Alternately – Update all transformations to map landmark points to current mean – Update mean to be average of transformed values Then learn mean and variance parameters. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 25
  • 26. Inference • Map inference: • No closed form solution • Use non-linear optimisation • Or use ICP approach • However, many parameters, and not clear they are all needed • more efficient to use subspace model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 26
  • 27. Face model Three samples from learnt model for faces Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 27
  • 28. Subspace shape model • Generate data from model: – is the mean shape – the matrix contains K basis functions in it columns – is normal noise with covariance • Can alternatively write Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 28
  • 29. Approximating with subspace Subspace model Can approximate an vector w with a weighted sum of the basis functions Surprising how well this works even with a small number of basis functions Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 29
  • 30. Subspace shape model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 30
  • 31. Probabilistic PCA Generative eq: Probabilistic version: Add prior: Density: Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 31
  • 32. Learning PPCA Learn parameters and from data , where . Learn mean: Then set and compute eigen-decomposition Choose parameters Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 32
  • 33. Properties of basis functions Learning of parameters based on eigen-decomposition: Parameters Notice that: • Basis functions in are orthogonal • Basis functions in are ordered Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 33
  • 34. Learnt hand model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 34
  • 35. Learnt spine model Mean Manipulating first Manipulating second principal component principal component Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 35
  • 36. Inference To fit model to an image: likelihood prior ICP Approach: • Find closest points to current prediction • Update weightings h • Find closest points to current prediction • Update transformation parameters y Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 36
  • 37. Inference 1. Update weightings h If transformation parameters can be represented as a matrix A 2. Update transformation parameters y • Using one of closed form solutions Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 37
  • 38. Fitting model Much better to use statistical classifier instead of just distance from edges Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 38
  • 39. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 39
  • 40. 3D shape models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 40
  • 41. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 41
  • 42. Statistical models for shape and appearance Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 42
  • 43. Statistical models for shape and appearance 1. We draw a hidden variable from a prior Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 43
  • 44. Statistical models for shape and appearance 1. We draw a hidden variable h from a prior 2. We draw landmark points w from a subspace model 3. We draw image intensities x. • Generate image intensities in standard template shape • Transform the landmark points (parameters y) • Transform the image to landmark points • Add noise Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 44
  • 45. Shape and appearance model Shape model Intensity model Shape and intensity Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 45
  • 46. Warping images Piecewise affine transformation Triangulate image points using Delaunay triangulation. Image in each triangle is warped by an affine transformation. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 46
  • 47. Learning Goal is to learn parameters : Problem • We are given the transformed landmark points • We are given the warped and transformed images Solution • Use Procrustes analysis to un-transform landmark points • Warp observed images to template shape Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 47
  • 48. Learning Now have aligned landmark points w, and aligned images x, we can learn the simpler model: Can write generative equation as: Has the form of a factor analyzer Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 48
  • 49. Inference Likelihood of observed intensities To fit the model use maximum likelihood This has the least squares form Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 49
  • 50. Inference This has the least squares form Use Gauss-Newton method or similar Where the Jacobian J is a matrix with elements Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 50
  • 51. Statistical models for shape and appearance Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 51
  • 52. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 52
  • 53. Non-linear models • The shape and appearance models that we have studied so far are based on the normal distribution • But more complex shapes might need more complex distributions – Could use mixture of PPCAs or similar – Or use a non-linear subspace model • We will investigate the Gaussian process latent variable model (GPLVM) • To understand the GPLVM, first think about PPCA in terms of regression. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 53
  • 54. PPCA as regression PPCA model: • First term in last equation looks like regression • Predicts w for a given h • Considering each dimension separately, get linear regression Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 54
  • 55. PPCA as regression Joint probability Regress 1st Regress 2nd distribution dimension against dimension against hidden variable hidden variable Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 55
  • 56. Gaussian process latent variable model • Idea: replace the linear regression model with a non-linear regression model • As name suggests, use Gaussian process regression • Implications – Can now marginalize over parameters m and F – Can no longer marginalize over variable h Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 56
  • 57. GPLVM as regression Joint probability Regress 1st Regress 2nd distribution dimension against dimension against hidden variable hidden variable Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 57
  • 58. Learning • In learning the Gaussian process regression model , we optimized the marginal likelihood of the data with respect to the parameter s2. Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 58
  • 59. Learning • In learning the GPLVM, we still optimized the marginal likelihood of the data with respect to the parameter s2, but must also find the values of the hidden variables that we regress against . • Use non-linear optimization technique Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 59
  • 60. Inference • To predict a new value of the data using a hidden variable • To compute density • Cannot be computed in closed from Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 60
  • 61. GPLVM Shape models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 61
  • 62. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 62
  • 63. Articulated Models • Transformations of parts applied one after each other • Known as a kinematic chain • e.g. Foot transform is relative to lower leg, which is relative to upper leg etc. • One root transformation that describes the position of model relative to camera Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 63
  • 64. Articulated Models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 64
  • 65. Articulated Models • One possible model for an object part is a quadric • Represents spheres, ellipsoids, cylinders, pairs of planes and others • Make truncated cylinders by clipping with cylinder with pair of planes • Projects to conic in the image Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 65
  • 66. Structure • Snakes • Template models • Statistical shape models • 3D shape models • Models for shape and appearance • Non-linear models • Articulated models • Applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 66
  • 67. 3D morphable models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 67
  • 68. 3D morphable models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 68
  • 69. 3D morphable models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 69
  • 70. 3D body model Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 70
  • 71. 3D body model applications Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 71
  • 72. Conclusions • Introduced a series of models for shape • Assume different forms of prior knowledge • Contour is smooth (snakes) • Shape is known, but not position (template) • Shape class is known (statistical models) • Structure of shape known (articulated model) • Relates to other models • Based on subspace models • Tracked using temporal models Computer vision: models, learning and inference. ©2011 Simon J.D. Prince 72