SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
A MKL Based Fusion Framework for Real-Time 
Multi-View Action Recognition 
Feng Gu, Francisco Florez-Revuelta, Dorothy Monekosso and 
Paolo Remagnino 
Digital Imaging Research Centre 
Kingston University, London, UK 
December 3rd, 2014 
1 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
1 Introduction 
2 Framework Overview 
3 Experimental Conditions 
4 Results and Analysis 
5 Conclusions and Future Work 
2 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Background and Motivations 
Real-time multi-view action recognition: 
Gain an increasing interest in video surveillance, human 
computer interaction, and multimedia retrieval etc. 
Provide complementary
eld of views (FOVs) of a monitored 
scene via multiple cameras 
Lead to a more robust decision making based on multiple 
heterogeneous video streams 
Real-time capacity enables continuous long-term monitoring 
If possible multiple cameras should be deployed to monitor 
human behaviour, where data fusion techniques can be applied. 
3 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Illustration of the Monitored Scenario 
C4 
C1 
C2 
C3 
4 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Motion-Based Person Detector 
We use a state-of-the-art motion-based tracker [6]: 
Each pixel modelled as a mixture of Gaussians in RGB space 
Background model to
nd foreground pixels in a new frame 
Found foreground pixels grouped to form large regions 
associated the person of interest 
Kalman
lters used to track foreground detections 
Person detections generated for every frame 
5 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Feature Representation of Videos 
Use of STIP and improved dense trajectories (IDT) [7] as 
local descriptor to extract visual features from a video 
Person detections and frame spans to de
ne a XYT cuboid 
associated with an action performed by the monitored person 
Apply bag of words (BOWs) to compute the feature vector of 
a cuboid, where K-Means clustering used for the generation 
of a codebook 
6 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Disciminative Models for Classi
cation 
Let xki 
2 RD, where i 2 f1; 2; : : : ;Ng is the index of a feature 
vector corresponding to a XYT cuboid and k 2 f1; 2; : : : ;Kg is the 
index of a camera view. We learn a SVM classi
er as 
f (x) = 
XN 
i=1 
i yik(xi ; x) + b (1) 
We then compute a classi
cation score via a sigmoid function as 
p(y = 1jx) = 
1 
1 + exp(f (x)) 
(2) 
7 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Simple Fusion Strategies 
Ki1i 
Concantenation of Features: concatenate the feature 
vectors of multiple views into one single feature vector such 
that ~xi = [x; : : : ; x] 
Sum of Classi
cation Scores: compute a classi
cation score 
for each camera view p(y = 1jx) as in 2, and then average 
them as 1K 
PK 
k=1 p(y = 1jxk ) 
Product of Classi
cation Scores: apply the product rule to 
tQhe classi
cation scores of all the camera views as K 
k=1 p(y = 1jxk ) 
8 / 22
Outline 
Introduction 
Framework Overview 
Experimental Conditions 
Results and Analysis 
Conclusions and Future Work 
Multiple Kernel Learning 
Combine of multiple kernels corresponding to dierent data 
sources (e.g. camera views) via a convex function such as 
K(xi ; xj ) = 
XK 
k=1
kkk (xi ; xj ) (3) 
where
k  0 and 
PK 
k=1

Contenu connexe

Tendances

Algorithmic Analysis to Video Object Tracking and Background Segmentation and...
Algorithmic Analysis to Video Object Tracking and Background Segmentation and...Algorithmic Analysis to Video Object Tracking and Background Segmentation and...
Algorithmic Analysis to Video Object Tracking and Background Segmentation and...Editor IJCATR
 
Batch normalization
Batch normalizationBatch normalization
Batch normalizationYuichiro Iio
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...ActiveEon
 
IRJET- Object Detection using Machine Learning Technique
IRJET- Object Detection using Machine Learning TechniqueIRJET- Object Detection using Machine Learning Technique
IRJET- Object Detection using Machine Learning TechniqueIRJET Journal
 
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Dongmin Choi
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer ChemistryPreferred Networks
 
A temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksA temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksDaniele Loiacono
 
MPerceptron
MPerceptronMPerceptron
MPerceptronbutest
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...ActiveEon
 
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationReview : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationDongmin Choi
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection PipelineAbhinav Dadhich
 
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...Dongmin Choi
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillanceAshfaqul Haque John
 
auto-assistance system for visually impaired person
auto-assistance system for visually impaired personauto-assistance system for visually impaired person
auto-assistance system for visually impaired personshahsamkit73
 
Pose Machine
Pose MachinePose Machine
Pose MachineWei Yang
 
FPT17: An object detector based on multiscale sliding window search using a f...
FPT17: An object detector based on multiscale sliding window search using a f...FPT17: An object detector based on multiscale sliding window search using a f...
FPT17: An object detector based on multiscale sliding window search using a f...Hiroki Nakahara
 
Landmark Retrieval & Recognition
Landmark Retrieval & RecognitionLandmark Retrieval & Recognition
Landmark Retrieval & Recognitionkenluck2001
 

Tendances (19)

Algorithmic Analysis to Video Object Tracking and Background Segmentation and...
Algorithmic Analysis to Video Object Tracking and Background Segmentation and...Algorithmic Analysis to Video Object Tracking and Background Segmentation and...
Algorithmic Analysis to Video Object Tracking and Background Segmentation and...
 
Batch normalization
Batch normalizationBatch normalization
Batch normalization
 
Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...Comparison of Matrix Completion Algorithms for Background Initialization in V...
Comparison of Matrix Completion Algorithms for Background Initialization in V...
 
IRJET- Object Detection using Machine Learning Technique
IRJET- Object Detection using Machine Learning TechniqueIRJET- Object Detection using Machine Learning Technique
IRJET- Object Detection using Machine Learning Technique
 
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
 
A temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksA temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networks
 
MPerceptron
MPerceptronMPerceptron
MPerceptron
 
kanimozhi2019.pdf
kanimozhi2019.pdfkanimozhi2019.pdf
kanimozhi2019.pdf
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
 
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)
 
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationReview : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection Pipeline
 
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillance
 
auto-assistance system for visually impaired person
auto-assistance system for visually impaired personauto-assistance system for visually impaired person
auto-assistance system for visually impaired person
 
Pose Machine
Pose MachinePose Machine
Pose Machine
 
FPT17: An object detector based on multiscale sliding window search using a f...
FPT17: An object detector based on multiscale sliding window search using a f...FPT17: An object detector based on multiscale sliding window search using a f...
FPT17: An object detector based on multiscale sliding window search using a f...
 
Landmark Retrieval & Recognition
Landmark Retrieval & RecognitionLandmark Retrieval & Recognition
Landmark Retrieval & Recognition
 

En vedette

Sparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSean Moran
 
cvpr2011: human activity recognition - part 6: applications
cvpr2011: human activity recognition - part 6: applicationscvpr2011: human activity recognition - part 6: applications
cvpr2011: human activity recognition - part 6: applicationszukun
 
Visual Object Category Recognition
Visual Object Category RecognitionVisual Object Category Recognition
Visual Object Category RecognitionAshish Gupta
 
cvpr2011: human activity recognition - part 3: single layer
cvpr2011: human activity recognition - part 3: single layercvpr2011: human activity recognition - part 3: single layer
cvpr2011: human activity recognition - part 3: single layerzukun
 
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...PyData
 
Human Activity Recognition in Android
Human Activity Recognition in AndroidHuman Activity Recognition in Android
Human Activity Recognition in AndroidSurbhi Jain
 
CVML2011: human action recognition (Ivan Laptev)
CVML2011: human action recognition (Ivan Laptev)CVML2011: human action recognition (Ivan Laptev)
CVML2011: human action recognition (Ivan Laptev)zukun
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformFadwa Fouad
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...ICAC09
 
Motion recognition with Android devices
Motion recognition with Android devicesMotion recognition with Android devices
Motion recognition with Android devicesGabor Paller
 
Activity Recognition using Cell Phone Accelerometers
Activity Recognition using Cell Phone AccelerometersActivity Recognition using Cell Phone Accelerometers
Activity Recognition using Cell Phone AccelerometersIshara Amarasekera
 
Object recognition
Object recognitionObject recognition
Object recognitionsaniacorreya
 
Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)nikhilus85
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition srikanthgadam
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition Intel Nervana
 
Human activity recognition
Human activity recognitionHuman activity recognition
Human activity recognitionRandhir Gupta
 
Object recognition
Object recognitionObject recognition
Object recognitionakkichester
 
Object detection
Object detectionObject detection
Object detectionSomesh Vyas
 

En vedette (20)

Sparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image Annotation
 
cvpr2011: human activity recognition - part 6: applications
cvpr2011: human activity recognition - part 6: applicationscvpr2011: human activity recognition - part 6: applications
cvpr2011: human activity recognition - part 6: applications
 
Visual Object Category Recognition
Visual Object Category RecognitionVisual Object Category Recognition
Visual Object Category Recognition
 
cvpr2011: human activity recognition - part 3: single layer
cvpr2011: human activity recognition - part 3: single layercvpr2011: human activity recognition - part 3: single layer
cvpr2011: human activity recognition - part 3: single layer
 
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...
Introduction to Action Recognition in Python by Bertrand Nouvel, Jonathan Kel...
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Human Activity Recognition in Android
Human Activity Recognition in AndroidHuman Activity Recognition in Android
Human Activity Recognition in Android
 
CVML2011: human action recognition (Ivan Laptev)
CVML2011: human action recognition (Ivan Laptev)CVML2011: human action recognition (Ivan Laptev)
CVML2011: human action recognition (Ivan Laptev)
 
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon TransformHuman Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
Human Action Recognition in Videos Employing 2DPCA on 2DHOOF and Radon Transform
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
 
Motion recognition with Android devices
Motion recognition with Android devicesMotion recognition with Android devices
Motion recognition with Android devices
 
Activity Recognition using Cell Phone Accelerometers
Activity Recognition using Cell Phone AccelerometersActivity Recognition using Cell Phone Accelerometers
Activity Recognition using Cell Phone Accelerometers
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)Action Recognition (Thesis presentation)
Action Recognition (Thesis presentation)
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
 
Human activity recognition
Human activity recognitionHuman activity recognition
Human activity recognition
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Object detection
Object detectionObject detection
Object detection
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 

Similaire à A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Action Recognition

SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTIONSENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTIONsipij
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Wesley De Neve
 
Optimising Autonomous Robot Swarm Parameters for Stable Formation Design
Optimising Autonomous Robot Swarm Parameters for Stable Formation DesignOptimising Autonomous Robot Swarm Parameters for Stable Formation Design
Optimising Autonomous Robot Swarm Parameters for Stable Formation DesignDaniel H. Stolfi
 
A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...Rafael Ferreira da Silva
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Self-supervised ECG Representation Learning for Affective Computing
Self-supervised ECG Representation Learning for Affective ComputingSelf-supervised ECG Representation Learning for Affective Computing
Self-supervised ECG Representation Learning for Affective ComputingPritam Sarkar
 
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...CSCJournals
 
Gait Based Person Recognition Using Partial Least Squares Selection Scheme
Gait Based Person Recognition Using Partial Least Squares Selection Scheme Gait Based Person Recognition Using Partial Least Squares Selection Scheme
Gait Based Person Recognition Using Partial Least Squares Selection Scheme ijcisjournal
 
Visualizing the Model Selection Process
Visualizing the Model Selection ProcessVisualizing the Model Selection Process
Visualizing the Model Selection ProcessBenjamin Bengfort
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sSiddharth Chakravarty
 
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...Rashid Mijumbi
 
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Saimunur Rahman
 
IRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature DescriptorIRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature DescriptorIRJET Journal
 
A Video Processing based System for Counting Vehicles
A Video Processing based System for Counting VehiclesA Video Processing based System for Counting Vehicles
A Video Processing based System for Counting VehiclesIRJET Journal
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringAllenWu
 
IRJET- A Review Analysis to Detect an Object in Video Surveillance System
IRJET- A Review Analysis to Detect an Object in Video Surveillance SystemIRJET- A Review Analysis to Detect an Object in Video Surveillance System
IRJET- A Review Analysis to Detect an Object in Video Surveillance SystemIRJET Journal
 

Similaire à A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Action Recognition (20)

Report
ReportReport
Report
 
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTIONSENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
 
Optimising Autonomous Robot Swarm Parameters for Stable Formation Design
Optimising Autonomous Robot Swarm Parameters for Stable Formation DesignOptimising Autonomous Robot Swarm Parameters for Stable Formation Design
Optimising Autonomous Robot Swarm Parameters for Stable Formation Design
 
A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Self-supervised ECG Representation Learning for Affective Computing
Self-supervised ECG Representation Learning for Affective ComputingSelf-supervised ECG Representation Learning for Affective Computing
Self-supervised ECG Representation Learning for Affective Computing
 
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classif...
 
Gait Based Person Recognition Using Partial Least Squares Selection Scheme
Gait Based Person Recognition Using Partial Least Squares Selection Scheme Gait Based Person Recognition Using Partial Least Squares Selection Scheme
Gait Based Person Recognition Using Partial Least Squares Selection Scheme
 
Visualizing the Model Selection Process
Visualizing the Model Selection ProcessVisualizing the Model Selection Process
Visualizing the Model Selection Process
 
MTVS Poster
MTVS PosterMTVS Poster
MTVS Poster
 
Machine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’sMachine learning algorithm for classification of activity of daily life’s
Machine learning algorithm for classification of activity of daily life’s
 
Kq3518291832
Kq3518291832Kq3518291832
Kq3518291832
 
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
 
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
 
IRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature DescriptorIRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature Descriptor
 
A Video Processing based System for Counting Vehicles
A Video Processing based System for Counting VehiclesA Video Processing based System for Counting Vehicles
A Video Processing based System for Counting Vehicles
 
A scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clusteringA scalable collaborative filtering framework based on co clustering
A scalable collaborative filtering framework based on co clustering
 
IRJET- A Review Analysis to Detect an Object in Video Surveillance System
IRJET- A Review Analysis to Detect an Object in Video Surveillance SystemIRJET- A Review Analysis to Detect an Object in Video Surveillance System
IRJET- A Review Analysis to Detect an Object in Video Surveillance System
 
Ls3520052009
Ls3520052009Ls3520052009
Ls3520052009
 

Plus de Francisco (Paco) Florez-Revuelta

Visual monitoring of people in private spaces. From the “Big Brother” to the...
Visual monitoring of people in private spaces.  From the “Big Brother” to the...Visual monitoring of people in private spaces.  From the “Big Brother” to the...
Visual monitoring of people in private spaces. From the “Big Brother” to the...Francisco (Paco) Florez-Revuelta
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmFrancisco (Paco) Florez-Revuelta
 
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Francisco (Paco) Florez-Revuelta
 
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Francisco (Paco) Florez-Revuelta
 
Continuous human action recognition in ambient assisted living scenarios
Continuous human action recognition in ambient assisted living scenariosContinuous human action recognition in ambient assisted living scenarios
Continuous human action recognition in ambient assisted living scenariosFrancisco (Paco) Florez-Revuelta
 
Evolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsEvolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsFrancisco (Paco) Florez-Revuelta
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Francisco (Paco) Florez-Revuelta
 
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Francisco (Paco) Florez-Revuelta
 
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Francisco (Paco) Florez-Revuelta
 
Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Francisco (Paco) Florez-Revuelta
 
Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Francisco (Paco) Florez-Revuelta
 
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Francisco (Paco) Florez-Revuelta
 

Plus de Francisco (Paco) Florez-Revuelta (17)

AAL Forum 2016
AAL Forum 2016AAL Forum 2016
AAL Forum 2016
 
Visual monitoring of people in private spaces
Visual monitoring of people in private spacesVisual monitoring of people in private spaces
Visual monitoring of people in private spaces
 
Visual monitoring of people in private spaces. From the “Big Brother” to the...
Visual monitoring of people in private spaces.  From the “Big Brother” to the...Visual monitoring of people in private spaces.  From the “Big Brother” to the...
Visual monitoring of people in private spaces. From the “Big Brother” to the...
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
 
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
 
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
 
Continuous human action recognition in ambient assisted living scenarios
Continuous human action recognition in ambient assisted living scenariosContinuous human action recognition in ambient assisted living scenarios
Continuous human action recognition in ambient assisted living scenarios
 
Vision-based monitoring of people in private spaces
Vision-based monitoring of people in private spacesVision-based monitoring of people in private spaces
Vision-based monitoring of people in private spaces
 
Evolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsEvolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortions
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
 
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
 
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
 
Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature
 
Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...
 
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
 
Use of Kinect in a Smart Home
Use of Kinect in a Smart HomeUse of Kinect in a Smart Home
Use of Kinect in a Smart Home
 
Metaltic
MetalticMetaltic
Metaltic
 

Dernier

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 

Dernier (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Action Recognition

  • 1. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work A MKL Based Fusion Framework for Real-Time Multi-View Action Recognition Feng Gu, Francisco Florez-Revuelta, Dorothy Monekosso and Paolo Remagnino Digital Imaging Research Centre Kingston University, London, UK December 3rd, 2014 1 / 22
  • 2. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work 1 Introduction 2 Framework Overview 3 Experimental Conditions 4 Results and Analysis 5 Conclusions and Future Work 2 / 22
  • 3. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Background and Motivations Real-time multi-view action recognition: Gain an increasing interest in video surveillance, human computer interaction, and multimedia retrieval etc. Provide complementary
  • 4. eld of views (FOVs) of a monitored scene via multiple cameras Lead to a more robust decision making based on multiple heterogeneous video streams Real-time capacity enables continuous long-term monitoring If possible multiple cameras should be deployed to monitor human behaviour, where data fusion techniques can be applied. 3 / 22
  • 5. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Illustration of the Monitored Scenario C4 C1 C2 C3 4 / 22
  • 6. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Motion-Based Person Detector We use a state-of-the-art motion-based tracker [6]: Each pixel modelled as a mixture of Gaussians in RGB space Background model to
  • 7. nd foreground pixels in a new frame Found foreground pixels grouped to form large regions associated the person of interest Kalman
  • 8. lters used to track foreground detections Person detections generated for every frame 5 / 22
  • 9. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Feature Representation of Videos Use of STIP and improved dense trajectories (IDT) [7] as local descriptor to extract visual features from a video Person detections and frame spans to de
  • 10. ne a XYT cuboid associated with an action performed by the monitored person Apply bag of words (BOWs) to compute the feature vector of a cuboid, where K-Means clustering used for the generation of a codebook 6 / 22
  • 11. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Disciminative Models for Classi
  • 12. cation Let xki 2 RD, where i 2 f1; 2; : : : ;Ng is the index of a feature vector corresponding to a XYT cuboid and k 2 f1; 2; : : : ;Kg is the index of a camera view. We learn a SVM classi
  • 13. er as f (x) = XN i=1 i yik(xi ; x) + b (1) We then compute a classi
  • 14. cation score via a sigmoid function as p(y = 1jx) = 1 1 + exp(f (x)) (2) 7 / 22
  • 15. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Simple Fusion Strategies Ki1i Concantenation of Features: concatenate the feature vectors of multiple views into one single feature vector such that ~xi = [x; : : : ; x] Sum of Classi
  • 17. cation score for each camera view p(y = 1jx) as in 2, and then average them as 1K PK k=1 p(y = 1jxk ) Product of Classi
  • 18. cation Scores: apply the product rule to tQhe classi
  • 19. cation scores of all the camera views as K k=1 p(y = 1jxk ) 8 / 22
  • 20. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Multiple Kernel Learning Combine of multiple kernels corresponding to dierent data sources (e.g. camera views) via a convex function such as K(xi ; xj ) = XK k=1
  • 21. kkk (xi ; xj ) (3) where
  • 22. k 0 and PK k=1
  • 23. k = 1 and each kernel kk only uses a distinct set of features from a data source. 9 / 22
  • 24. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Two-Step Optimisation We need to learn the kernel parameters weights (k ) and bias (bk ) of a SVM model, and the combination parameters
  • 25. k in 3. This can be solved as follows: Step-1: optimise over the kernel parameters k and bk while
  • 26. xing the combination parameters
  • 27. k (quadratic programming) Step-2: optimise over the combination parameters
  • 29. xing the kernel parameters k and bk (gradient decent) Alternates between two steps iteratively until the system converges to an optimal solution 10 / 22
  • 30. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work IXMAS Multi-View Dataset Created for view-invariant human action recognition [8] Include 13 daily actions, each of which performed 3 times by 12 actors Video sequences collected via 5 cameras, at 23 frames per second and 390 291 resolution We use all 12 actors and 5 cameras and evaluate 11 actions as in [9] Leave-one-subject-out cross validation used in the experiments 11 / 22
  • 31. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Implementation Details A codebook sized 4000 quantised from 100000 randomly selected descriptor features of the training set STIP descriptor uses the entire image plane and the frame span of an action given in the ground truth to de
  • 32. ne a cuboid IDT descriptor relies on the person detections in addition to the frame span All the SVM models use `1 normalisation and the 2 kernel 12 / 22
  • 33. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Person Detection Results cam0 cam1 cam2 cam3 cam4 Figure: Detection results of the motion-based tracker of the
  • 34. rst run of the subject `Alba', for all the camera views. 13 / 22
  • 35. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Results of STIP (Internal Comparison) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 check watch sit down cross arms scratch head get up turn around walk wave punch kick pick up SVM−COM SVM−SUM SVM−PRD SVM−MKL Figure: Class-wise mean recognition rates of all the folds of the compared methods using STIP descriptor, where SVMCOM = 0:819, SVMSUM = 0:820, SVMPRD = 0:815, and SVMMKL = 0:842. 14 / 22
  • 36. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Results of IDT (Internal Comparison) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 check watch sit down cross arms scratch head get up turn around walk wave punch kick pick up SVM−COM SVM−SUM SVM−PRD SVM−MKL Figure: Class-wise mean recognition rates of all the folds of the compared methods using IDT descriptor, where SVMCOM = 0:915, SVMSUM = 0:927, SVMPRD = 0:921, and SVMMKL = 0:950. 15 / 22
  • 37. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Comparison with State-of-the-Art (External Comparison) Method Actions Actors Views Rate FPS Cilla et al. [3] 11 12 5 0.913 N/A Weiland et al. [10] 11 10 5 0.933 N/A Cilla et al. [4] 11 10 5 0.940 N/A Holte et al. [5] 13 12 5 1.000 N/A Weinland et al. [9] 11 10 5 0.835 500 Chaaraoui et al. [1] 11 12 5 0.859 26 Chaaraoui et al. [2] 11 12 5 0.914 207 SVM-MKL (IDT+BOWs) 11 12 5 0.950 25 Table: Comparison of the proposed MKL method using (IDT) descriptor and BOWs, where the methods with `N/A' in the FPS column are oine. 16 / 22
  • 38. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Conlusions and Future Work Proposed MKL based framework outperforms the simple fusion techniques, and the state-of-the-art methods IDT descriptor superior than STIP descriptor for feature representation in action recognition The proposed framework capable of performing real-time action recognition at 25 frames per second For the future, apply to other similar vision problems, and study alternative feature representation and fusion techniques. 17 / 22
  • 39. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Thank you very much! Any questions? 18 / 22
  • 40. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work A. A. Chaaraoui, P. Climent-Perez, and F. Florez-Revuelta. Silhouette-based human action recognition using sequences of key poses. Pattern Recognition Letters, 34:1799{1807, 2013. A. A. Chaaraoui, J. R. Padilla-Lopez, F. J. Ferrandez-Pastor, M. Nieto-Hidalgo, and F. Florez-Revuelta. A vision-based system for intelligent monitoring: human behaviour analysis and privacy by context. Sensors, 14:8895{8925, 2014. R. Cilla, M. A. Patricio, and A. Berlanga. A probabilistic, discriminative and distributed system for the recognition of human actions from multiple views. Neurocomputing, 75:78{87, 2012. R. Cilla, M. A. Patricio, A. Berlanga, and J. M. Molina. 19 / 22
  • 41. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Human action recognition with sparse classi
  • 42. cation and multiple-view learning. Expert Systems, DOI: 10.1111/exsy.12040, 2013. M. Holte, B. Chakraborty, J. Gonzalez, and T. Moeslund. A local 3-D motion descriptor for mult-view human action recognition from 4-D spatio-temporal interest points. IEEE Journal of Selected Topics in Signal Processing, 6:553{565, 2012. C. Stauer and W. Grimson. Learning patterns of activity using real time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 22(8):747{767, 2000. H. Wang, M. M. Ullah, A. Klaser, I. Laptev, and C. Schmid. 20 / 22
  • 43. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Evaluation of local spatio-temporal features for action recognition. In British Machine Vision Conference (BMVC), 2009. D. Weinland, E. Boyer, and R. Ronfard. Action recognition from arbitrary views using 3d exemplars. In IEEE International Conference on Computer Vision (ICCV), pages 1{7, 2007. D. Weinland, M. Ozuysal, and P. Fua. Making action recognition robust to occlusions and viewpoint changes. In European Conference on Computer Vision, 2010. D. Weinland, R. Ronfard, and E. Boyer. Free viewpoint action recognition using motion history volumes. 21 / 22
  • 44. Outline Introduction Framework Overview Experimental Conditions Results and Analysis Conclusions and Future Work Computer Vision and Image Understanding, 104(2-3):249{257, 2006. 22 / 22