SlideShare a Scribd company logo
1 of 33
Download to read offline
Multi-class Classification on Riemannian
Manifolds for Video Surveillance
D. Tosato, M. Farenzena, M. Cristani, M. Spera and V. Murino
Dipartimento di Informatica, University of Verona, Italy
Istituto Italiano di Tecnologia (IIT), Genova, Italy
The Problem
In video surveillance, classification of visual data
can be very hard
Small obj., << 50x50
< 60x40
Low resolution
Occlusions
Bad light conditions
What kind of situations we want to tackle?
The goal of this work is …
• finding a feature able to describe visual
objects at prohibitive low resolutions.
• building a robust multi-class learning
framework which marries the selected object
description.
Related works:
• O. Tuzel, F. Porikli, P. Meer. Pedestrian detection via classification on Riemannian manifolds.
IEEE PAMI, 2008.
• J. Orozco, S. Gong, T. Xiang. Head pose classification in crowded scenes. BMVC 2009.
• N. Dalal, B. Triggs. Histograms of oriented gradients for human detection. CVPR 2005.
3
Outline
• Problem overview
• Feature layout: ARCO
• Multi-class classification framework on
Riemannian Manifolds
• Experiments
• Computational considerations
• Conclusions and future work
4
Overview
Assume:
• Data are typically at low resolution, often in crowded
scenarios.
5
• According to a surveillance task, only a coarse
categorization can be achieved.
• Data must be roughly aligned for training purposes.
Overview
The method in a nutshell …
1. Features and their integral representations are
calculated.
2. For a set of overlapping patches on a regular grid
covariance matrices as object descriptors are built.
3. Covariance matrices are managed introducing the
sectional curvature analysis (SCA)
4. For each patch a multi-class LogitBoost classifier is
instantiated.
5. A majority voting is used to label an image.
6
Outline
• Problem overview
• Feature layout: ARCO
• Multi-class classification framework on
Riemannian Manifolds
• Experiments
• Computational considerations
• Conclusions and future work
7
ARCO (ARray of COvariances)
• An image is organized into a grid of uniformly spaced and
overlapping patches.
• This layout does not needs to find region/point of interest and
is efficiently computed.
• The patches of pixels, on a fixed grid of pixels steps.
• We achieve the best classification performances both for
pedestrians and heads using where is the image
dimension. 8
ARCO (ARray of COvariances)
• Each patch is described by a covariance matrix of image
features.
• These have been exploited as powerful descriptors of
pedestrians [Tuzel et al. PAMI2008].
• Their effectiveness have been explicitly investigated in a
comparative study [Paisitkriangkrai et al. IET-CV2008].
• Their versatility has been shown in [Tuzel et al. ECCV2006].
• Feature set is task-dependent:
9
Head Pose Classification/Detection
Pedestrian Detection
Working with covariances
• An object is described with a set of covariance
matrices.
• Covariance matrices live on a Riemannian manifold
and typical machine learning techniques are not
usable.
• Covariances have to be projected on local manifold
views (vectorial spaces) for learning purposes.
• At the state-of-the art, classifiers are learned on the
local views and combined with boosting.
10
Interesting issues in using covariances
• On the Riemannian Manifold
of covariances , the
distance between points is
• On a tangent space the
distance is the usual
Euclidean distance
11
So, the question is:
- is a good approximation of ? ?
- if so, which tangent space must be chosen?
Working efficiently with covariances
• Covariances are very powerful descriptors with some
characteristics:
– covariances’ calculation is fast thanks to integral tensor
representation [Tuzel et al., ECCV2006]
– the computational burden necessary to utilize them is a
drawback
• The nonlinear manifold of covariances can be turned
into a flat one using the Log-Euclidean metric, but the
goodness of the approximation have to be estimated.
Sectional Curvature Analysis - SCA
12
I. Chavel, Riemannian Geometry - A modern introduction. Cambridge Univ. Press, 2006.
Sectional Curvature Analysis - SCA
• The space of covariance matrices can be
equipped with a Riemannian metric.
• SCA is a way to describe the curvature on a Riemannian
Manifold which naturally generalize the classical
Gaussian curvature for surfaces.
• is a homogeneous symmetric space, therefore
its negative sectional curvature can be computed at .
• If the SCA coefficient is close to 0 implies that the
Riemannian Manifold is almost flat (= Euclidean space).
13
Idea
SCA (2)
Let and their
logarithm mapping at ( e.g. ), and
an approximated (geodesic) distance
14
It is a non-negative function that depends
on the sectional curvature
SCA (3)
• Experimentally mean value is -10-3 that is
far from the standard negative curvature -1
• In this conditions, one can choose any point
on which to map the dataset.
15
Sectional Curvature
Outline
• Problem overview
• Feature layout: ArCO
• Multi-class classification framework on
Riemannian Manifolds
• Experiments
• Computational considerations
• Conclusions and future work
16
Multi-class Boosting Framework
• For each patch a multi-class boosting classifier is
learned.
• To reach a computationally feasible solution we
exploit the result of SCA in a multi-class
LogitBoost learning framework:
• Given , where , the
Riemannian Manifold of covariance matrices and
a set of labels, data is mapped to
by:
17
LogitBoost [J. Friedman, T. Hastie, and R. Tibshirani, Ann Statist. 2000]
18
• LB is a real (not 1-vs-All) multi-class boosting
framework which fits iteratively an additive
symmetric logistic model to get the posterior
over the classes
• The update step combine the weak
classification response coming from each class.
Multi-class weak classifier
Binary weak classifier
• At each iteration, LB combine binary weak
classification response fitting its
own linear/non-linear regressor .
• Each multi-class weak learner
focuses on a sub-window on an overlapped
regular grid of Np patches.
• We assign a class label with a estimating
where
19
LogitBoost (2)
Multi-class Boosting Framework (2)
• We eliminate the necessity of using the boosting as
feature selector reinforcing the weak learning
strategy: Weighted Regression Trees.
• Adding an extra class with negative examples and
using the rejection cascade*, we can build a robust
multi-class detector.
• We have established an automatic stopping rule for
the learning process:
20* Viola and Jones, CVPR 2001
Weighted Regression Trees
• WRTs* are binary tree which can be applied to
efficiently tackle the weighted nonlinear
regression problem.
• WRTs growth have been limited strongly in
order to use them as weak classifiers.
• Boosting Weights are injected into a WRT to
refine the regression result.
21* L. Breiman et al., Classification and Regression Trees, CRC Press, 1984
Outline
• Problem overview
• Feature layout: ArCO
• Multi-class classification framework on
Riemannian Manifolds
• Experiments
• Computational considerations
• Conclusions and future work
22
Experiments
Datasets
Head Pose Classification:
• QMUL 4 Head Pose Dataset:
– 5 classes (back/front/left/right/background)
– 4000 examples/class automatically collected
– Image resolution: 128x64 pixels
• Additional Examples from INRIA Person Dataset
– 2736 head examples / ~ 2000 background examples
– head examples are manually classified in 4 classes
– Image resolution: 32x pixels
Pedestrian Detection:
• INRIA Person Dataset:
– 3580 pedestrians / 1671 person-free images;
– Pedestrian ROI resolution 128x64 pixels.
23
24
Experiments
Head Pose Classification
4 classes is used from QMUL 4 Head Pose Dataset (no background). Some examples:
Performances using
different feature subsets
Experiments
Head Pose Classification
25
Our, avg =.94, std=.05 Orozco et al., avg =.82, std=.11
• 5 classes is used from QMUL 4 Head
Pose Dataset.
26
Our, avg =.90, std=.09
Orozco et al., avg =.67, std=.36
* Data provided by QMUL * Data is extracted from INRIA Person Dataset
• 5 classes coming from the
previous dataset join extra ~ 500
examples/class for the FG and
~2000 for the BG from a more
general dataset.
Experiments
Head Pose Detection
Experiments
Pedestrian Detection
• We use the INRIA Person Dataset where a person is
contained in a ROI of 128x64 pixels and its actual
average dimension is 50x50 pixels.
• To achieve the best performances in terms of FPPW,
we imply a cascade of 5 levels.
27
True Positive
False Negative
miss rate
True Negative
False Positive
FPPW
Qualitative Experiments
28
29
Qualitative Experiments
Outline
• Problem overview
• Feature layout: ArCO
• Multi-class classification framework on
Riemannian Manifolds
• Experiments
• Computational considerations
• Conclusions and future work
30
Computational considerations
• Fixing the feature layout, ARCO decreases the
computational complexity of the learning phase
of one order of magnitude wrt the state-of-the-
art boosting framework embedding the feature
selection: from to .
• Using as unique projection point we reduce
the compitational complexity of projection from
to .
31
Number of WL per class
Number of classes
Number of candidate features
SVD
Number of candidate features
In time: from 2 weeks to 2 hours.
Computational considerations (2)
• The computation of a image integral tensors takes
, where is the number of features
considered and and are the image
dimensions.
• The complexity of using regression trees as weak
learners (fixing a priori the number of elements
per terminal node) is , with the
number of samples.
32
Conclusions and future work
• We have proposed the novel general-purpose ARCO
descriptor.
• We have built an effective and efficient multi-class
LogitBoost framework able to work on Riemannian
Manifolds.
• SCA is introduced as an effective tool to analyze the
curvature of a Riemannian Manifold.
• ARCO is able to describe visual objects at prohibitive
low resolutions.
• ARCO will be used with more general class of object.
• In the future, we plan to work on devising a more
novel and powerful classification technique, to replace
the LogitBoost framework. 34

More Related Content

What's hot

CVPR2008 tutorial generalized pca
CVPR2008 tutorial generalized pcaCVPR2008 tutorial generalized pca
CVPR2008 tutorial generalized pcazukun
 
Performance Evaluation of Object Tracking Technique Based on Position Vectors
Performance Evaluation of Object Tracking Technique Based on Position VectorsPerformance Evaluation of Object Tracking Technique Based on Position Vectors
Performance Evaluation of Object Tracking Technique Based on Position VectorsCSCJournals
 
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...ITIIIndustries
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural NetworkOffline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural Networkijaia
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn
 
Two strategies for large-scale multi-label classification on the YouTube-8M d...
Two strategies for large-scale multi-label classification on the YouTube-8M d...Two strategies for large-scale multi-label classification on the YouTube-8M d...
Two strategies for large-scale multi-label classification on the YouTube-8M d...Dalei Li
 
A beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsA beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsJaeJun Yoo
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningSungchul Kim
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...ActiveEon
 
Lecture 06 geometric transformations and image registration
Lecture 06 geometric transformations and image registrationLecture 06 geometric transformations and image registration
Lecture 06 geometric transformations and image registrationobertksg
 
Mlp mixer an all-mlp architecture for vision
Mlp mixer  an all-mlp architecture for visionMlp mixer  an all-mlp architecture for vision
Mlp mixer an all-mlp architecture for visionJaey Jeong
 
Neural networks Self Organizing Map by Engr. Edgar Carrillo II
Neural networks Self Organizing Map by Engr. Edgar Carrillo IINeural networks Self Organizing Map by Engr. Edgar Carrillo II
Neural networks Self Organizing Map by Engr. Edgar Carrillo IIEdgar Carrillo
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningSungchul Kim
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashrisheetal katkar
 
Review-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningReview-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningTrong-An Bui
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsMathias Niepert
 
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRI
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRIOriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRI
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRILeonid Zhukov
 

What's hot (20)

CVPR2008 tutorial generalized pca
CVPR2008 tutorial generalized pcaCVPR2008 tutorial generalized pca
CVPR2008 tutorial generalized pca
 
Performance Evaluation of Object Tracking Technique Based on Position Vectors
Performance Evaluation of Object Tracking Technique Based on Position VectorsPerformance Evaluation of Object Tracking Technique Based on Position Vectors
Performance Evaluation of Object Tracking Technique Based on Position Vectors
 
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural NetworkOffline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural Network
 
Mattar_PhD_Thesis
Mattar_PhD_ThesisMattar_PhD_Thesis
Mattar_PhD_Thesis
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
 
Two strategies for large-scale multi-label classification on the YouTube-8M d...
Two strategies for large-scale multi-label classification on the YouTube-8M d...Two strategies for large-scale multi-label classification on the YouTube-8M d...
Two strategies for large-scale multi-label classification on the YouTube-8M d...
 
A beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsA beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trends
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
 
Lecture 06 geometric transformations and image registration
Lecture 06 geometric transformations and image registrationLecture 06 geometric transformations and image registration
Lecture 06 geometric transformations and image registration
 
Mlp mixer an all-mlp architecture for vision
Mlp mixer  an all-mlp architecture for visionMlp mixer  an all-mlp architecture for vision
Mlp mixer an all-mlp architecture for vision
 
Neural networks Self Organizing Map by Engr. Edgar Carrillo II
Neural networks Self Organizing Map by Engr. Edgar Carrillo IINeural networks Self Organizing Map by Engr. Edgar Carrillo II
Neural networks Self Organizing Map by Engr. Edgar Carrillo II
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation Learning
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashri
 
Sefl Organizing Map
Sefl Organizing MapSefl Organizing Map
Sefl Organizing Map
 
Review-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningReview-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learning
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRI
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRIOriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRI
Oriented Tensor Reconstruction. Tracing Neural Pathways from DT-MRI
 

Similar to Multi-class Classification on Manifolds for Video Surveillance

A Re-evaluation of Pedestrian Detection on Riemannian Manifolds
A Re-evaluation of Pedestrian Detection on Riemannian ManifoldsA Re-evaluation of Pedestrian Detection on Riemannian Manifolds
A Re-evaluation of Pedestrian Detection on Riemannian ManifoldsDiego Tosato
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyNUPUR YADAV
 
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondenceParn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondenceNAVER Engineering
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersSeunghyun Hwang
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspectiveAnirban Santara
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya
 
MLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, CaptioningMLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, CaptioningCharles Deledalle
 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksJinwon Lee
 
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...WiMLDSMontreal
 
Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSGanesan Narayanasamy
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programmingSoumya Mukherjee
 
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex sceneKumar Mayank
 

Similar to Multi-class Classification on Manifolds for Video Surveillance (20)

A Re-evaluation of Pedestrian Detection on Riemannian Manifolds
A Re-evaluation of Pedestrian Detection on Riemannian ManifoldsA Re-evaluation of Pedestrian Detection on Riemannian Manifolds
A Re-evaluation of Pedestrian Detection on Riemannian Manifolds
 
lecture_16.pptx
lecture_16.pptxlecture_16.pptx
lecture_16.pptx
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
 
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondenceParn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
Parn pyramidal+affine+regression+networks+for+dense+semantic+correspondence
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
 
Mini_Project
Mini_ProjectMini_Project
Mini_Project
 
ExplainableAI.pptx
ExplainableAI.pptxExplainableAI.pptx
ExplainableAI.pptx
 
lec10svm.ppt
lec10svm.pptlec10svm.ppt
lec10svm.ppt
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Seminar nov2017
Seminar nov2017Seminar nov2017
Seminar nov2017
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
 
MLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, CaptioningMLIP - Chapter 5 - Detection, Segmentation, Captioning
MLIP - Chapter 5 - Detection, Segmentation, Captioning
 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
 
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
 
Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUS
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programming
 
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
 
Inverse problems in medical imaging
Inverse problems in medical imagingInverse problems in medical imaging
Inverse problems in medical imaging
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex scene
 
convolutional_rbm.ppt
convolutional_rbm.pptconvolutional_rbm.ppt
convolutional_rbm.ppt
 

Recently uploaded

Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxRTS corp
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 

Recently uploaded (20)

Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 

Multi-class Classification on Manifolds for Video Surveillance

  • 1. Multi-class Classification on Riemannian Manifolds for Video Surveillance D. Tosato, M. Farenzena, M. Cristani, M. Spera and V. Murino Dipartimento di Informatica, University of Verona, Italy Istituto Italiano di Tecnologia (IIT), Genova, Italy
  • 2. The Problem In video surveillance, classification of visual data can be very hard Small obj., << 50x50 < 60x40 Low resolution Occlusions Bad light conditions What kind of situations we want to tackle?
  • 3. The goal of this work is … • finding a feature able to describe visual objects at prohibitive low resolutions. • building a robust multi-class learning framework which marries the selected object description. Related works: • O. Tuzel, F. Porikli, P. Meer. Pedestrian detection via classification on Riemannian manifolds. IEEE PAMI, 2008. • J. Orozco, S. Gong, T. Xiang. Head pose classification in crowded scenes. BMVC 2009. • N. Dalal, B. Triggs. Histograms of oriented gradients for human detection. CVPR 2005. 3
  • 4. Outline • Problem overview • Feature layout: ARCO • Multi-class classification framework on Riemannian Manifolds • Experiments • Computational considerations • Conclusions and future work 4
  • 5. Overview Assume: • Data are typically at low resolution, often in crowded scenarios. 5 • According to a surveillance task, only a coarse categorization can be achieved. • Data must be roughly aligned for training purposes.
  • 6. Overview The method in a nutshell … 1. Features and their integral representations are calculated. 2. For a set of overlapping patches on a regular grid covariance matrices as object descriptors are built. 3. Covariance matrices are managed introducing the sectional curvature analysis (SCA) 4. For each patch a multi-class LogitBoost classifier is instantiated. 5. A majority voting is used to label an image. 6
  • 7. Outline • Problem overview • Feature layout: ARCO • Multi-class classification framework on Riemannian Manifolds • Experiments • Computational considerations • Conclusions and future work 7
  • 8. ARCO (ARray of COvariances) • An image is organized into a grid of uniformly spaced and overlapping patches. • This layout does not needs to find region/point of interest and is efficiently computed. • The patches of pixels, on a fixed grid of pixels steps. • We achieve the best classification performances both for pedestrians and heads using where is the image dimension. 8
  • 9. ARCO (ARray of COvariances) • Each patch is described by a covariance matrix of image features. • These have been exploited as powerful descriptors of pedestrians [Tuzel et al. PAMI2008]. • Their effectiveness have been explicitly investigated in a comparative study [Paisitkriangkrai et al. IET-CV2008]. • Their versatility has been shown in [Tuzel et al. ECCV2006]. • Feature set is task-dependent: 9 Head Pose Classification/Detection Pedestrian Detection
  • 10. Working with covariances • An object is described with a set of covariance matrices. • Covariance matrices live on a Riemannian manifold and typical machine learning techniques are not usable. • Covariances have to be projected on local manifold views (vectorial spaces) for learning purposes. • At the state-of-the art, classifiers are learned on the local views and combined with boosting. 10
  • 11. Interesting issues in using covariances • On the Riemannian Manifold of covariances , the distance between points is • On a tangent space the distance is the usual Euclidean distance 11 So, the question is: - is a good approximation of ? ? - if so, which tangent space must be chosen?
  • 12. Working efficiently with covariances • Covariances are very powerful descriptors with some characteristics: – covariances’ calculation is fast thanks to integral tensor representation [Tuzel et al., ECCV2006] – the computational burden necessary to utilize them is a drawback • The nonlinear manifold of covariances can be turned into a flat one using the Log-Euclidean metric, but the goodness of the approximation have to be estimated. Sectional Curvature Analysis - SCA 12 I. Chavel, Riemannian Geometry - A modern introduction. Cambridge Univ. Press, 2006.
  • 13. Sectional Curvature Analysis - SCA • The space of covariance matrices can be equipped with a Riemannian metric. • SCA is a way to describe the curvature on a Riemannian Manifold which naturally generalize the classical Gaussian curvature for surfaces. • is a homogeneous symmetric space, therefore its negative sectional curvature can be computed at . • If the SCA coefficient is close to 0 implies that the Riemannian Manifold is almost flat (= Euclidean space). 13 Idea
  • 14. SCA (2) Let and their logarithm mapping at ( e.g. ), and an approximated (geodesic) distance 14 It is a non-negative function that depends on the sectional curvature
  • 15. SCA (3) • Experimentally mean value is -10-3 that is far from the standard negative curvature -1 • In this conditions, one can choose any point on which to map the dataset. 15 Sectional Curvature
  • 16. Outline • Problem overview • Feature layout: ArCO • Multi-class classification framework on Riemannian Manifolds • Experiments • Computational considerations • Conclusions and future work 16
  • 17. Multi-class Boosting Framework • For each patch a multi-class boosting classifier is learned. • To reach a computationally feasible solution we exploit the result of SCA in a multi-class LogitBoost learning framework: • Given , where , the Riemannian Manifold of covariance matrices and a set of labels, data is mapped to by: 17
  • 18. LogitBoost [J. Friedman, T. Hastie, and R. Tibshirani, Ann Statist. 2000] 18 • LB is a real (not 1-vs-All) multi-class boosting framework which fits iteratively an additive symmetric logistic model to get the posterior over the classes • The update step combine the weak classification response coming from each class. Multi-class weak classifier Binary weak classifier
  • 19. • At each iteration, LB combine binary weak classification response fitting its own linear/non-linear regressor . • Each multi-class weak learner focuses on a sub-window on an overlapped regular grid of Np patches. • We assign a class label with a estimating where 19 LogitBoost (2)
  • 20. Multi-class Boosting Framework (2) • We eliminate the necessity of using the boosting as feature selector reinforcing the weak learning strategy: Weighted Regression Trees. • Adding an extra class with negative examples and using the rejection cascade*, we can build a robust multi-class detector. • We have established an automatic stopping rule for the learning process: 20* Viola and Jones, CVPR 2001
  • 21. Weighted Regression Trees • WRTs* are binary tree which can be applied to efficiently tackle the weighted nonlinear regression problem. • WRTs growth have been limited strongly in order to use them as weak classifiers. • Boosting Weights are injected into a WRT to refine the regression result. 21* L. Breiman et al., Classification and Regression Trees, CRC Press, 1984
  • 22. Outline • Problem overview • Feature layout: ArCO • Multi-class classification framework on Riemannian Manifolds • Experiments • Computational considerations • Conclusions and future work 22
  • 23. Experiments Datasets Head Pose Classification: • QMUL 4 Head Pose Dataset: – 5 classes (back/front/left/right/background) – 4000 examples/class automatically collected – Image resolution: 128x64 pixels • Additional Examples from INRIA Person Dataset – 2736 head examples / ~ 2000 background examples – head examples are manually classified in 4 classes – Image resolution: 32x pixels Pedestrian Detection: • INRIA Person Dataset: – 3580 pedestrians / 1671 person-free images; – Pedestrian ROI resolution 128x64 pixels. 23
  • 24. 24 Experiments Head Pose Classification 4 classes is used from QMUL 4 Head Pose Dataset (no background). Some examples: Performances using different feature subsets
  • 25. Experiments Head Pose Classification 25 Our, avg =.94, std=.05 Orozco et al., avg =.82, std=.11
  • 26. • 5 classes is used from QMUL 4 Head Pose Dataset. 26 Our, avg =.90, std=.09 Orozco et al., avg =.67, std=.36 * Data provided by QMUL * Data is extracted from INRIA Person Dataset • 5 classes coming from the previous dataset join extra ~ 500 examples/class for the FG and ~2000 for the BG from a more general dataset. Experiments Head Pose Detection
  • 27. Experiments Pedestrian Detection • We use the INRIA Person Dataset where a person is contained in a ROI of 128x64 pixels and its actual average dimension is 50x50 pixels. • To achieve the best performances in terms of FPPW, we imply a cascade of 5 levels. 27 True Positive False Negative miss rate True Negative False Positive FPPW
  • 30. Outline • Problem overview • Feature layout: ArCO • Multi-class classification framework on Riemannian Manifolds • Experiments • Computational considerations • Conclusions and future work 30
  • 31. Computational considerations • Fixing the feature layout, ARCO decreases the computational complexity of the learning phase of one order of magnitude wrt the state-of-the- art boosting framework embedding the feature selection: from to . • Using as unique projection point we reduce the compitational complexity of projection from to . 31 Number of WL per class Number of classes Number of candidate features SVD Number of candidate features In time: from 2 weeks to 2 hours.
  • 32. Computational considerations (2) • The computation of a image integral tensors takes , where is the number of features considered and and are the image dimensions. • The complexity of using regression trees as weak learners (fixing a priori the number of elements per terminal node) is , with the number of samples. 32
  • 33. Conclusions and future work • We have proposed the novel general-purpose ARCO descriptor. • We have built an effective and efficient multi-class LogitBoost framework able to work on Riemannian Manifolds. • SCA is introduced as an effective tool to analyze the curvature of a Riemannian Manifold. • ARCO is able to describe visual objects at prohibitive low resolutions. • ARCO will be used with more general class of object. • In the future, we plan to work on devising a more novel and powerful classification technique, to replace the LogitBoost framework. 34