SlideShare a Scribd company logo
1 of 21
Silhouette Analysis-Based
Action Recognition via
Exploiting Human Poses
CONTENTS
•
•
•
•
•
•
•
•
•
•

Abstract & Objective
Introduction
Software Requirement
Hardware Requirement
Existing System & Disadvantages
Proposed System & Advantages
Literature Survey
Application
Conclusion
References
Abstract
•
•
•
•
•

In this paper, we propose a novel scheme for human
action recognition that combines the advantages of both
local and global representations.
We explore human silhouettes for human action
representation by taking into account the correlation
between sequential poses in an action.
A modified bag-of-words model, named bag of
correlated poses, is introduced to encode temporally
local features of actions.
To utilize the property of visual word ambiguity, we
reduce the dimensionality of our model.
To compensate for the loss of structural information, we
propose an extended motion template, i.e., extensions
of the motion history image, to capture the holistic
structural features.
OBJECTIVE
• The objective of vision-based human action
recognition is to label the video sequence with its
corresponding action category.
Software Requirement
• Operating System

:

Windows XP

• Language

:

MATLAB

• Version

:

MATLAB 7.9
Hardware Requirement
• Pentium IV – 2.7 GHz
• 1 GB DDR RAM
• 250 GB Hard Disk
Existing system
•
•

•
•
•
•

STIPs using a temporal Gabor filter and a spatial
Gaussian filter.
STIP detectors such as Harris3D, Cuboid, 3D-Hessian,
dense sampling, spatiotemporal regularity-based
feature HOG/HOF, HOG3D, extended SURF, and
MoSIFT.
PageRank-based centrality measure to select key
poses according to the recovered geometric structure.
Utilizing properties of the solution to the Poisson
equation to extract space-time features.
By calculating the differences between frames and used
them as intermediate features.
Action recognition framework fusing local 3D-SIFT
descriptors and holistic Zernike motion energy image
(MEI) features.
disadvantages
• Segmentation and tracking are not possible.
• It is consuming too time for the feature points
computation.
• Sparse representation, such as bag of visual
words
(BoVWs),
discards
geometric
relationship of the features and is less
discriminative.
• Hard-assignment quantization during
codebook construction for BoVW.

the
Proposed system
•

Here we proposed the method to recognize the action in the silhouette of
human.

•

Here we extract the BoCP(Bag of correlated posses).

•

BoCP feature will extract in the sequence of steps.

– PCA feature extraction followed by k-means
clustering
and
the
correlogram
matrix
construction.
– We reduce the correlogram dimension by the use
of LDA.
•

BoCP feature descriptor and Extended-MHI forms the feature vector.

•

SVM (Support Vector Machine) trains the features and predict the result.
advantages
• Reduce computational complexity and quantization
error.
• The proposed scheme takes advantages of local and
global features

• Provides a discriminative representation for human
actions.
Literature survey
Action recognition using
context and appearance
distribution features distribution
•
We first propose a new spatio-temporal context
•
•

•

•
•

feature of

interest points for human action recognition.
Each action video is expressed as a set of relative XYT coordinates between
pairwise interest points in a local region.
We learn a global GMM (referred to as Universal Background Model, UBM)
using the relative coordinate features from all the training videos, and then
represent each video as the normalized parameters of a video-specific GMM
adapted from the global GMM.
In order to capture the spatio-temporal relationships at different levels,
multiple GMMs are utilized to describe the context distributions of interest
points over multi-scale local regions.
To describe the appearance information of an action video, we also propose
to use GMM to characterize the distribution of local appearance features
from the cuboids centered around the interest points.
Accordingly, an action video can be represented by two types of distribution
features:

– 1) multiple GMM distributions of spatio-temporal
context;
– 2) GMM distribution of local video appearance.
Action Recognition using
Space-time Shape Difference
Images we present a novel motion representation
• In this paper,
•
•

•

•

based on difference images.
In this paper we have presented a new method of
extracting useful features from human action videos for
action recognition.
We show that this representation exploits the dynamics
of motion, and show its effectiveness in action
recognition
We showed the effectiveness of our method, and
compared our results against other well established
algorithms, which shows our algorithm has competitive
accuracy, is fast, and furthermore, is not very sensitive
to video resolution, partial shape deformation of actions
nor the number of clusters used.
Future work can include combining other features
containing additional shape information, and improving
the quality of silhouette extraction.
Making action recognition
robust to occlusions and
viewpoint changes
•

•

•

•

We propose a novel approach to providing robustness
to both occlusions and viewpoint changes that yields
significant improvements over existing techniques.
At its heart is a local partitioning and hierarchical
classification of the 3D Histogram of Oriented Gradients
(HOG) descriptor to represent sequences of images that
have been concatenated into a data volume.
We achieve robustness to occlusions and viewpoint
changes by combining training data from all viewpoints
to train classifiers that estimate action labels
independently over sets of HOG blocks.
A top level classifier combines these local labels into a
global action class decision.
Action recognition using
correlogram of body poses and
spectral regression

• In this paper, we propose a novel representation for
human actions using Correlogram of Body Poses
(CBP) which takes advantage of both the probabilistic
distribution and the temporal relationship of human
poses.
• To reduce the high dimensionality of the CBP
representation, an efficient subspace learning
technique called Spectral Regression Discriminant
Analysis (SRDA) is explored.
• Experimental results on the challenging IXMAS
dataset show that the proposed algorithm
outperforms the state-of-the-art methods on action
recognition.
Evaluation of local spatio temporal features for action
recognition paper is to evaluate and compare
• The purpose of this
•
•
•

•

previously proposed space-time features in a common
experimental setup.
In particular, we consider four different feature detectors
and six local feature descriptors and use a standard
bag-of-features SVM approach for action recognition.
We investigate the performance of these methods on a
total of 25 action classes distributed over three datasets
with varying difficulty.
Among interesting conclusions, we demonstrate that
regular sampling of space-time features consistently
outperforms all tested space-time interest point
detectors for human actions in realistic settings.
We also demonstrate a consistent ranking for the
majority of methods over different datasets and discuss
their advantages and limitations.
applications
•
•
•
•
•

Video Surveillance
Robotics
Human–Computer Interaction
User Interface Design
Multimedia Video Retrieval
Conclusion
•
•
•

•
•

In this paper, we proposed two new representations,
namely, BoCP and the extended-MHI for action
recognition.
BoCP was a temporally local feature descriptor and the
extended-MHI was a holistic motion descriptor.
The extension of MHI compensated for information loss
in the original approach and later we verified the
conjecture that local and holistic features were
complementary to each other.
In this paper, our system showed promising
performance and produced better results than any
published paper on the IXMAS.
With more sophisticated feature descriptors and
advanced dimensionality reduction methods, we
reckoned better performance.
Future Work
• We propose to replace PCA (Principal Component
Analysis) feature extraction by ICA (Independent
Component Analysis), so that the accuracy of
recognition can be improved.
References
• X. Wu, D. Xu, L. Duan, and J. Luo, “Action
recognition using context and appearance distribution
features,”
• H. Qu, L. Wang, and C. Leckie, “Action recognition
using space-time shape difference images,”
• D. Weinland, M. O¨ zuysal, and P. Fua, “Making
action recognition robust to occlusions and viewpoint
changes,”
• L. Shao, D. Wu, and X. Chen, “Action recognition
using correlogram of body poses and spectral
regression,”
• H. Wang, M. Ullah, A. Klaser, I. Laptev, and C.
Schmid, “Evaluation of local spatio-temporal features
for action recognition,”
/AvvenireTechnologies

/avveniretech

More Related Content

What's hot

Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object Recognition
CSCJournals
 
Modified CSLBP
Modified CSLBPModified CSLBP
Modified CSLBP
IJECEIAES
 
PR-297: Training data-efficient image transformers & distillation through att...
PR-297: Training data-efficient image transformers & distillation through att...PR-297: Training data-efficient image transformers & distillation through att...
PR-297: Training data-efficient image transformers & distillation through att...
Jinwon Lee
 

What's hot (19)

A Study on Image Retrieval Features and Techniques with Various Combinations
A Study on Image Retrieval Features and Techniques with Various CombinationsA Study on Image Retrieval Features and Techniques with Various Combinations
A Study on Image Retrieval Features and Techniques with Various Combinations
 
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
 
IRJET-A Review of Underwater Image Enhancement By Wavelet Decomposition using...
IRJET-A Review of Underwater Image Enhancement By Wavelet Decomposition using...IRJET-A Review of Underwater Image Enhancement By Wavelet Decomposition using...
IRJET-A Review of Underwater Image Enhancement By Wavelet Decomposition using...
 
Speeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object RecognitionSpeeded-up and Compact Visual Codebook for Object Recognition
Speeded-up and Compact Visual Codebook for Object Recognition
 
Modified CSLBP
Modified CSLBPModified CSLBP
Modified CSLBP
 
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
 
Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...
 
IRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature DescriptorIRJET- Crowd Density Estimation using Novel Feature Descriptor
IRJET- Crowd Density Estimation using Novel Feature Descriptor
 
Ijcatr04051016
Ijcatr04051016Ijcatr04051016
Ijcatr04051016
 
BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC...
 BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC... BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC...
BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC...
 
PR-297: Training data-efficient image transformers & distillation through att...
PR-297: Training data-efficient image transformers & distillation through att...PR-297: Training data-efficient image transformers & distillation through att...
PR-297: Training data-efficient image transformers & distillation through att...
 
EFFICIENT IMAGE RETRIEVAL USING REGION BASED IMAGE RETRIEVAL
EFFICIENT IMAGE RETRIEVAL USING REGION BASED IMAGE RETRIEVALEFFICIENT IMAGE RETRIEVAL USING REGION BASED IMAGE RETRIEVAL
EFFICIENT IMAGE RETRIEVAL USING REGION BASED IMAGE RETRIEVAL
 
AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...
AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...
AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...
 
Multi sensor calibration by deep learning
Multi sensor calibration by deep learningMulti sensor calibration by deep learning
Multi sensor calibration by deep learning
 
Seminarpaper
SeminarpaperSeminarpaper
Seminarpaper
 
Weighted Performance comparison of DWT and LWT with PCA for Face Image Retrie...
Weighted Performance comparison of DWT and LWT with PCA for Face Image Retrie...Weighted Performance comparison of DWT and LWT with PCA for Face Image Retrie...
Weighted Performance comparison of DWT and LWT with PCA for Face Image Retrie...
 
Depth Fusion from RGB and Depth Sensors III
Depth Fusion from RGB and Depth Sensors  IIIDepth Fusion from RGB and Depth Sensors  III
Depth Fusion from RGB and Depth Sensors III
 
AN EFFICIENT CODEBOOK INITIALIZATION APPROACH FOR LBG ALGORITHM
AN EFFICIENT CODEBOOK INITIALIZATION APPROACH FOR LBG ALGORITHMAN EFFICIENT CODEBOOK INITIALIZATION APPROACH FOR LBG ALGORITHM
AN EFFICIENT CODEBOOK INITIALIZATION APPROACH FOR LBG ALGORITHM
 
(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...
(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...
(CVPR2021 Oral) RobustNet: Improving Domain Generalization in Urban-Scene Seg...
 

Similar to Silhouette analysis based action recognition via exploiting human poses

final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
  final  year ieee pojects in pondicherry,bulk ieee projects ,bulk  2015-16 i...  final  year ieee pojects in pondicherry,bulk ieee projects ,bulk  2015-16 i...
final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
nexgentech
 
Announcing the Final Examination of Mr. Paul Smith for the ...
Announcing the Final Examination of Mr. Paul Smith for the ...Announcing the Final Examination of Mr. Paul Smith for the ...
Announcing the Final Examination of Mr. Paul Smith for the ...
butest
 
Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964
Editor IJARCET
 
Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964
Editor IJARCET
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
ijsrd.com
 

Similar to Silhouette analysis based action recognition via exploiting human poses (20)

Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
 
Afmkl
AfmklAfmkl
Afmkl
 
final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
  final  year ieee pojects in pondicherry,bulk ieee projects ,bulk  2015-16 i...  final  year ieee pojects in pondicherry,bulk ieee projects ,bulk  2015-16 i...
final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
 
Announcing the Final Examination of Mr. Paul Smith for the ...
Announcing the Final Examination of Mr. Paul Smith for the ...Announcing the Final Examination of Mr. Paul Smith for the ...
Announcing the Final Examination of Mr. Paul Smith for the ...
 
Action_recognition-topic.pptx
Action_recognition-topic.pptxAction_recognition-topic.pptx
Action_recognition-topic.pptx
 
IRJET- Analysis of Vehicle Number Plate Recognition
IRJET- Analysis of Vehicle Number Plate RecognitionIRJET- Analysis of Vehicle Number Plate Recognition
IRJET- Analysis of Vehicle Number Plate Recognition
 
Matlab abstract 2016
Matlab abstract 2016Matlab abstract 2016
Matlab abstract 2016
 
Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964
 
Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964Volume 2-issue-6-1960-1964
Volume 2-issue-6-1960-1964
 
自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用
 
Real-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceReal-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for Surveillance
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
 
Scalable constrained spectral clustering
Scalable constrained spectral clusteringScalable constrained spectral clustering
Scalable constrained spectral clustering
 
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
 
Deep Learning for Structure-from-Motion (SfM)
Deep Learning for Structure-from-Motion (SfM)Deep Learning for Structure-from-Motion (SfM)
Deep Learning for Structure-from-Motion (SfM)
 
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
 
Key Frame Extraction for Salient Activity Recognition
Key Frame Extraction for Salient Activity RecognitionKey Frame Extraction for Salient Activity Recognition
Key Frame Extraction for Salient Activity Recognition
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
Human Action Recognition Using Deep Learning
Human Action Recognition Using Deep LearningHuman Action Recognition Using Deep Learning
Human Action Recognition Using Deep Learning
 
Be36338341
Be36338341Be36338341
Be36338341
 

Recently uploaded

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 

Silhouette analysis based action recognition via exploiting human poses

  • 1. Silhouette Analysis-Based Action Recognition via Exploiting Human Poses
  • 2. CONTENTS • • • • • • • • • • Abstract & Objective Introduction Software Requirement Hardware Requirement Existing System & Disadvantages Proposed System & Advantages Literature Survey Application Conclusion References
  • 3. Abstract • • • • • In this paper, we propose a novel scheme for human action recognition that combines the advantages of both local and global representations. We explore human silhouettes for human action representation by taking into account the correlation between sequential poses in an action. A modified bag-of-words model, named bag of correlated poses, is introduced to encode temporally local features of actions. To utilize the property of visual word ambiguity, we reduce the dimensionality of our model. To compensate for the loss of structural information, we propose an extended motion template, i.e., extensions of the motion history image, to capture the holistic structural features.
  • 4. OBJECTIVE • The objective of vision-based human action recognition is to label the video sequence with its corresponding action category.
  • 5. Software Requirement • Operating System : Windows XP • Language : MATLAB • Version : MATLAB 7.9
  • 6. Hardware Requirement • Pentium IV – 2.7 GHz • 1 GB DDR RAM • 250 GB Hard Disk
  • 7. Existing system • • • • • • STIPs using a temporal Gabor filter and a spatial Gaussian filter. STIP detectors such as Harris3D, Cuboid, 3D-Hessian, dense sampling, spatiotemporal regularity-based feature HOG/HOF, HOG3D, extended SURF, and MoSIFT. PageRank-based centrality measure to select key poses according to the recovered geometric structure. Utilizing properties of the solution to the Poisson equation to extract space-time features. By calculating the differences between frames and used them as intermediate features. Action recognition framework fusing local 3D-SIFT descriptors and holistic Zernike motion energy image (MEI) features.
  • 8. disadvantages • Segmentation and tracking are not possible. • It is consuming too time for the feature points computation. • Sparse representation, such as bag of visual words (BoVWs), discards geometric relationship of the features and is less discriminative. • Hard-assignment quantization during codebook construction for BoVW. the
  • 9. Proposed system • Here we proposed the method to recognize the action in the silhouette of human. • Here we extract the BoCP(Bag of correlated posses). • BoCP feature will extract in the sequence of steps. – PCA feature extraction followed by k-means clustering and the correlogram matrix construction. – We reduce the correlogram dimension by the use of LDA. • BoCP feature descriptor and Extended-MHI forms the feature vector. • SVM (Support Vector Machine) trains the features and predict the result.
  • 10. advantages • Reduce computational complexity and quantization error. • The proposed scheme takes advantages of local and global features • Provides a discriminative representation for human actions.
  • 12. Action recognition using context and appearance distribution features distribution • We first propose a new spatio-temporal context • • • • • feature of interest points for human action recognition. Each action video is expressed as a set of relative XYT coordinates between pairwise interest points in a local region. We learn a global GMM (referred to as Universal Background Model, UBM) using the relative coordinate features from all the training videos, and then represent each video as the normalized parameters of a video-specific GMM adapted from the global GMM. In order to capture the spatio-temporal relationships at different levels, multiple GMMs are utilized to describe the context distributions of interest points over multi-scale local regions. To describe the appearance information of an action video, we also propose to use GMM to characterize the distribution of local appearance features from the cuboids centered around the interest points. Accordingly, an action video can be represented by two types of distribution features: – 1) multiple GMM distributions of spatio-temporal context; – 2) GMM distribution of local video appearance.
  • 13. Action Recognition using Space-time Shape Difference Images we present a novel motion representation • In this paper, • • • • based on difference images. In this paper we have presented a new method of extracting useful features from human action videos for action recognition. We show that this representation exploits the dynamics of motion, and show its effectiveness in action recognition We showed the effectiveness of our method, and compared our results against other well established algorithms, which shows our algorithm has competitive accuracy, is fast, and furthermore, is not very sensitive to video resolution, partial shape deformation of actions nor the number of clusters used. Future work can include combining other features containing additional shape information, and improving the quality of silhouette extraction.
  • 14. Making action recognition robust to occlusions and viewpoint changes • • • • We propose a novel approach to providing robustness to both occlusions and viewpoint changes that yields significant improvements over existing techniques. At its heart is a local partitioning and hierarchical classification of the 3D Histogram of Oriented Gradients (HOG) descriptor to represent sequences of images that have been concatenated into a data volume. We achieve robustness to occlusions and viewpoint changes by combining training data from all viewpoints to train classifiers that estimate action labels independently over sets of HOG blocks. A top level classifier combines these local labels into a global action class decision.
  • 15. Action recognition using correlogram of body poses and spectral regression • In this paper, we propose a novel representation for human actions using Correlogram of Body Poses (CBP) which takes advantage of both the probabilistic distribution and the temporal relationship of human poses. • To reduce the high dimensionality of the CBP representation, an efficient subspace learning technique called Spectral Regression Discriminant Analysis (SRDA) is explored. • Experimental results on the challenging IXMAS dataset show that the proposed algorithm outperforms the state-of-the-art methods on action recognition.
  • 16. Evaluation of local spatio temporal features for action recognition paper is to evaluate and compare • The purpose of this • • • • previously proposed space-time features in a common experimental setup. In particular, we consider four different feature detectors and six local feature descriptors and use a standard bag-of-features SVM approach for action recognition. We investigate the performance of these methods on a total of 25 action classes distributed over three datasets with varying difficulty. Among interesting conclusions, we demonstrate that regular sampling of space-time features consistently outperforms all tested space-time interest point detectors for human actions in realistic settings. We also demonstrate a consistent ranking for the majority of methods over different datasets and discuss their advantages and limitations.
  • 18. Conclusion • • • • • In this paper, we proposed two new representations, namely, BoCP and the extended-MHI for action recognition. BoCP was a temporally local feature descriptor and the extended-MHI was a holistic motion descriptor. The extension of MHI compensated for information loss in the original approach and later we verified the conjecture that local and holistic features were complementary to each other. In this paper, our system showed promising performance and produced better results than any published paper on the IXMAS. With more sophisticated feature descriptors and advanced dimensionality reduction methods, we reckoned better performance.
  • 19. Future Work • We propose to replace PCA (Principal Component Analysis) feature extraction by ICA (Independent Component Analysis), so that the accuracy of recognition can be improved.
  • 20. References • X. Wu, D. Xu, L. Duan, and J. Luo, “Action recognition using context and appearance distribution features,” • H. Qu, L. Wang, and C. Leckie, “Action recognition using space-time shape difference images,” • D. Weinland, M. O¨ zuysal, and P. Fua, “Making action recognition robust to occlusions and viewpoint changes,” • L. Shao, D. Wu, and X. Chen, “Action recognition using correlogram of body poses and spectral regression,” • H. Wang, M. Ullah, A. Klaser, I. Laptev, and C. Schmid, “Evaluation of local spatio-temporal features for action recognition,”