SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
IJSRD - International Journal for Scientific Research & Development| Vol. 1, Issue 3, 2013 | ISSN (online): 2321-0613
All rights reserved by www.ijsrd.com 617
Speech Recognition using HMM & GMM Models: A Review on
Techniques and Approaches
Ritenkumar R. Patel1
Prof. M. A. Lokhandwala2
Prof. Mitul Patel3
1
M. E. student 2, 3
Assistant Professor
1, 2, 3
E & C Department
1, 2, 3
Parul institute of Engineering & Technology, Limda, Waghodia, Gujarat, India
Abstract— Many ways of communications are used
between human and computer, while using gesture is
considered to be one of the most natural ways in a virtual
reality system.
Speech recognition is one of the typical methods of
non-verbal communication for human beings and we
naturally use various gestures to express our own intentions
in everyday life.
Gesture recognizers are supposed to capture and
analyze the information transmitted by the hands of a person
who communicates in sign language. This is a prerequisite
for automatic sign-to-spoken-language translation, which
has the potential to support the integration of deaf people
into society.
This paper present part of literature review on
ongoing research and findings on different technique and
approaches in gesture recognition using Hidden Markov
Models for vision-based approach.
Keywords: sign language, gesture recognition, hidden
markov model
I. INTRODUCTION
The video-based approaches claim to allow the signer to
move freely without any instrumentation attached to the
body. Trajectory, hand shape and hand locations are tracked
and detected by a camera (or an array of cameras). By
doing so, the signer is constrained to sign in a closed, some-
how controlled environment. The amount of data that has to
be processed to extract and track hands in the image also
imposes a restriction on memory, speed and complexity on
the computer equipment.
A recognition system that we proposed comprises of a
framework of two most important entity, which are
preprocessing (image processing) and recognition
classification (artificial intelligence recognition) as both will
have to work relatively in defining the meaning of an input
sign language.
Vision-based technique has been proposed as we are
using single Matrox camera to collect gesture data. This
technique can cover both face and hands signer in which
signer does not need to wear data gloves device. All
processing tasks are solved by using computer vision
techniques which are more flexible and useful than devised
based measurement approach.
Since sign language is gesticulated fluently and
interactively like other spoken languages, a sign language
recognizer must able to recognize continuous sign
vocabularies in real-time. The authors try to build such a
system for the Bahasa Melayu Sign Language. Gesture in
this paper will always refer to the hand.
Basically, different technique and approaches in
recognizing a gesture for sign language will be reviewed.
II. PREVIOUS WORK
Sensing of human expression is very important for human-
computer interactive applications such as virtual reality,
gesture recognition, and communication. In recent years, a
large number of studies have been made on machine
recognition of human expression.
A number of systems have been implemented that
recognize various types of sign languages. For example,
Starner was able to recognize a distinct forty word lexicon
consisting of pronouns, nouns, verbs, and adjectives taken
from American Sign Language with accuracies of over 90%
[10]. A system developed by Kadous [17] recognized a 95
word lexicon from Australian Sign Language and achieved
accuracies of approximately 80%. Murakami and Taguchi
were able to adequately recognize 42 finger-alphabet
symbols and 10 sign-language words taken from Japanese
Sign Language [19]. Lu [21] and Matuso [18] have also
developed systems for recognizing a subset of Japanese Sign
Language. Finally, Takahashi and Kishino were able to
accurately recognize 34 speech recognition is used in the
Japanese Kana Manual Alphabet [20].
A common approach to describing human motion
is to use a state-based model, such as a Hidden Markov
Model (HMM), to convert a series of motions into a
description of activity. Such systems [1, 2, 3, 4, 5, 6] operate
by training a HMM (or some variant thereon) to parse a
stream of short-term tracked motions.
Another system [3] recognizes simple human
gestures (against a black backdrop), while an earlier system
[2] recognizes simple actions (pick up, put down, push, pull,
etc.), also based on a trained HMM.
An early effort by Yamato et al. [7] uses discrete
HMM’s to successfully recognize image sequences of six
different tennis strokes among three subjects. This
experiment is significant because it used a 25x25 pixel
quantized sub sampled camera image as a feature vector that
is converted into a discrete label by a vector quantizer. The
labels are classified based on discrete HMMs. Even with
such low-level information, the model could learn the set of
motions to perform respectable recognition rates.
According to Iwai, Shimizu, and Yachida [8], a lot
of methods for the recognition of gestures from images have
been proposed. Takahasi used spatiotemporal edges for the
recognition of gestures based on a DP matching method.
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
(IJSRD/Vol. 1/Issue 3/2013/0055)
All rights reserved by www.ijsrd.com
618
Yamato used matrices sampled sparsely from images and
Hid-den Markov Models for the recognition. The sparse
matrices are quantized by the Categorized VQ method and
are used as input for a discrete HMM. Campbell used the 3-
D positions and 3-D motion of face, left hand, and right
hand extracted from stereo images and HMM for the
recognition. HMM method has a better performance than
DP matching method for recognition of public gestures
because of statistically learning.
Research by Starner and Pentland did not attempt
to capture detailed information about the hands [9, 10]. In
this recognition system, sentences of the form (‘personal
pronoun, verb, noun, adjective, (the same) personal
pronoun” are to be recognized with a vocabulary of 40
signs. These systems have mostly concentrated on finger
signing, where the user spells each word with hand signs
corresponding to the letters of the alphabet. They proposed
an extensible system which uses a single color camera to
track hands in real time and interprets American Sign
Language (ASL) using HMM. The hand tracking stage of
the system does not attempt to produce a fine grain
description of hand shape; studies have shown that such
detailed information may not be necessary for humans to
interpret sign language [11, 12]. Instead, the tracking
process produces only a coarse description of hand shape,
orientation, and trajectory as inputs to HMMs. Signs are
modeled with four-states-HMM. They use a single camera
and plain cotton gloves in two colors or no gloves for a
second test. The shape, orientation, and trajectory
information is then input to a HMM for recognition of the
signed words. They achieve recognition accuracies between
75% and 99% allowing only the simplest syntactical
structures.
In [2] Siskind and Morris conjecture that human
event perception does not presuppose object recognition. In
other words, they think visual event recognition is
performed by a visual pathway which is separated from
object recognition. To verify the conjecture, they analyze
motion profiles of objects that participate in different simple
spatial motion events. Their tracker uses a mixture of color
based and motion based techniques. Color based techniques
are used to track objects defined by set of colored pixels
whose saturation and value are above certain thresholds in
each frame. These pixels are then clustered into regions
using a histogram based on hue. Moving pixels are
extracted from frame differences and divided into clusters
based on proximity. Next, each region (generated by color
or motion) in each frame is abstracted by an ellipse. Finally,
feature vector for each frame is generated by computing the
absolute and relative ellipse positions, orientations,
velocities and accelerations. To classify visual events, they
use a set of Hidden Markov Models (HMMs) which are
used as generative models and trained on movies of each
visual event represented by a set of feature vectors. After
training, a new observation is classified as being generated
by the model that assigns the highest likelihood.
Experiments on a set of 6 simple gestures, “pick up,” “put
down,” “push,” “pull,” “drop,” and “throw,” demonstrate
that gestures can be classified based on motion profiles.
Sorrentino, Cuzzolin and Frezza [13] present an
original technique for speech recognitio based on a dynamic
shape representation by combining size functions and
hidden Markov models (HMM). Each gesture is described
by a different probabilistic finite state machine which
models a succession of so called canonical postures of the
hand. Conceptually, they model a gesture as a sequence of
hand postures which project into sets of size functions. The
state dynamics describe the transition between canonical
postures while the observation equations are maps from the
set of canonical postures to size functions. In their proposed
technique, a fundamental hypothesis has been made;
gestures may be modeled as sequences of a finite number of
“canonical" postures of the hand. Each posture is associated
to a state of a probabilistic finite state machine, in particular
of a Hidden Markov Model. Each gesture is identified with
a HMM with an appropriate number of states and transition
probabilities. They come with the result that HMM
paradigm deserves to be confirmed, whereas future work
should concentrate upon a better shape representation and
also the measuring functions proposed in [14] must be
improved since they are particularly sensitive to small
changes in the position of the center of mass of the images.
Further developments on integrating the qualitative
modeling philosophy presented in this paper with
quantitative techniques which describe the 27 degrees of
freedom kinematics of the hand.
Chen, Fu and Huang in [15] have introduced a
speech recognition system to recognize continuous gesture
before stationary background using 2D video input. Reliable
method of image processes has been applied: motion
detection, skin color extraction, edge detection, movement
justification and background subtraction as a real-time
image processing subsystem. The system consists of four
modules: a real time hand tracking and extraction, feature
extraction, hidden Markov model (HMM) training, and
gesture recognition. First, they apply a real-time hand
tracking and extraction algorithm to trace the moving hand
and extract the hand region, then the Fourier descriptor (FD)
has been used to characterize spatial features and the motion
analysis to characterize the temporal features. A spatial and
temporal feature has been combined of the input image
sequence as our feature vector. After the feature vectors
have been extracted, we apply HMMs to recognize the input
gesture. The gesture to be recognized is separately scored
against different HMMs. The model with the highest score
indicates the corresponding gesture. In the experiments, we
have tested our system to recognize 20 different gestures,
and the recognizing rate is above 90%.
Garver [16] has come out with his research to
construct interface environment that allowing mouse control
through pointing gestures and commands executed by
simple arm movements as the goal of having natural
gestures as a modality for basic computer control. Hand
segmentation system has been performed for individual
frames. The hand segmentation system is made up of three
image processing algorithms: dynamic background
subtraction, statistical skin color segmentation, and depth
thresholding. Using the basic filters for segmentation and
locating hands using a very simple blob extraction algorithm
the tracking system is ready to use. HMM has been
proposed for command gesture system to trained gesture
classification. After preparing the gesture for executing a
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
(IJSRD/Vol. 1/Issue 3/2013/0055)
All rights reserved by www.ijsrd.com
619
command, produced gesture sequence that is then processed
by a series of HMMs. Each HMM represents a specific
gesture, and the performed gesture is classified as the
representative model with the highest probability of
correspondence. However only the hand segmentation and
tracking follow well. The initial classification of gestures as
pointing or command works with reasonable accuracy, need
to be improved.
III. GESTURE AND HIDDEN MARKOV MODEL
A. Speech recognition
The vast majority of automatic recognition systems are for
deictic gestures (pointing), emblematic gestures (isolated
signs) and sign languages (with a limited vocabulary and
syntax). Some are components of bimodal systems,
integrated with speech recognition. Some produce precise
hand and arm configuration while others only coarse
motion.
The kinematics of the posture of a human hand can
be described by a mechanical system with 27 degrees of
freedom [1, 2].
Common experience suggests, however, that a
gesture may be characterized by a sequence of only a finite
number of hand postures and that, therefore, it is not
necessary to describe hand posture as a continuum.
Feature extraction and analysis is a robust way to
recognize hand postures and gestures. It can be used not
only to recognize simple hand postures and gestures but
complex ones as well. Its main drawback is that it can
become computationally expensive when a large number of
features are being collected, which slows down system
response time. This slowdown is extremely significant in
virtual environment applications, but should diminish with
the advent of faster machines.
The concept of gesture is loosely defined, and
depends on the context of the interaction. Recognition of
natural, continuous gestures requires temporally segmenting
gestures. Automatically segmenting gestures is difficult, and
is often finessed or ignored in current systems by requiring a
starting position in time and/or space. Similar to this is the
problem of distinguishing intentional gestures from other
“random” movements. There is no standard way to do
gesture recognition – a variety of representations and
classification schemes are used. However, most gesture
recognition systems share some common structure.
B. Overview of Hidden Markov Model
HMM were introduced in the mid 1990’s, and quickly
became the recognition method of choice, due to its implicit
solution to the segmentation problem.
In describing hidden Markov models it is
convenient first to consider Markov chains. Markov chains
are simply finite-state automata in which each state
transition arc has an associated probability value; the
probability values of the arcs leaving a single state sum to
one. Markov chains impose the restriction on the finite-state
automaton that a state can have only one transition arc with
a given output; a restriction that makes Markov chains
deterministic. A hidden Markov model (HMM) can be
considered a generalization of a Markov chain without this
Markov-chain restriction [23]. Since HMMs can have more
than one arc with the same output symbol, they are
nondeterministic, and it is impossible to directly determine
the state sequence for a set of inputs simply by looking at
the output (hence the “hidden” in “hidden Markov model”).
More formally, a HMM is defined as a set of states of which
one state is the initial state, a set of output symbols, and a set
of state transitions. Each state transition is represented by
the state from which the transition starts, the state to which
transition moves, the output symbol generated, and the
probability that the transition is taken [23]. In the context of
speech recognition, each state could represent a set of
possible hand positions. The state transitions represent the
probability that a certain hand position transitions into
another; the corresponding output symbol represents a
specific posture and sequences of output symbols represent
speech recognition. One then uses a group of HMMs, one
for each gesture, and runs a sequence of input data through
each HMM. The input data, derived from pixels in a vision-
based solution or from bend sensor values in a glove-based
solution, can be represented in many different ways, the
most common by feature vectors [9]. The HMM with the
highest forward probability (described later in this section)
determines the users’ most likely gesture. An HMM can also
be used for hand posture recognition; see Liang and
Ouhyoung [24] for details.
Gaolin Fang and Wen Gao [25] stated that there are
some limitations of classical HMMs for sign language
recognition. Firstly, it is the assumption that the
distributions of individual observation parameters can be
well represented as a mixture of Gaussian or autoregressive
densities. This assumption isn’t always consistent with the
fact. Secondly, HMMs have the poorer discrimination than
neural networks. In the HMMs training, each word model is
estimated separately using the corresponding labeled
training observation sequences without considering the
confused data (other models with similar behavior).
IV. RESEARCH METHODOLOGY
Current phase of our development is still under image
processing. Hand region has been successfully detected by
implementing five steps image processing as shown in
figure 1 under the preprocessing stage.
Fig. 1: Basic system flow of recognition system
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
(IJSRD/Vol. 1/Issue 3/2013/0055)
All rights reserved by www.ijsrd.com
620
Thus, review on technique for gesture recognition
using HMM will contribute the idea on how we are going to
model our gesture.
Future plan on the image processing will be
discussed on the next part as the output of it will be used for
the recognition process. The output is supposed to give
much information to model the gesture and perform
recognition process.
V. DISCUSSION
The main requirement for the use of gesture recognition in
human-computer interfaces is a continuous online
recognition with a minimum time delay of the recognition
output, but such a system performance is extremely difficult
to achieve, because of the following reasons: The fact that
the starting and ending point of the gesture is not known is a
very complicated problem.
A frame captured by the frame grabber has to be
immediately preprocessed and the features have to be
extracted and used for the classification, before the next
frame is captured.
Assumption on the lighting, background, signers
location need to be done since the Matrox camera that we
use capable of capturing images into grayscale image.
VI. CONCLUSION AND FUTURE DIRECTION
In this paper, a few literature reviews on technique of
gesture recognition using HMM has been reviewed and
analyzed. Most of them are using colored images as better
result can be achieved.
As the input of gesture recognition, further process
need to be done on the detected hand. Edge detection is
proposed. Age detection used to mark points at which
image intensity changes sharply. So we will get only the
hand region.
Hand motion, posture and orientation are other
areas that need to be explore. Combination of hand
detection with those will be useful input for recognition
process.
REFERENCES
[1]. H. Buxton, “Learning and understanding dynamic
scene activity: a review,” Image and Vision
Computing, 21(1):125–136, January 2003.
[2]. J.M. Siskind and Q. Morris, “A maximum-likelihood
approach to visual event classification,” In Proc. 4th
European Conference on Computer Vision, pages II:
347–360, 1996.
[3]. Y.A. Ivanov and A.F. Bobick, “Recognition of visual
activities and interactions by stochastic parsing,” IEEE
Trans. Pattern Analysis and Machine Intelligence,
22(8):852–872, August 2000.
[4]. J. Lou, Q. Liu, T. Tan, and W. Hu, “Semantic
interpretation of object activities in a surveillance
system,” In Proc. International Conference on Pattern
Recognition, pages III: 777–780, 2002.
[5]. N.M. Oliver, B. Rosario, and A.P. Pentland, “A
Bayesian computer vision system for modeling human
interactions,” IEEE Trans. Pattern Analysis and
Machine Intelligence, 22(8):831–843, August 2000.
[6]. V. Nair and J.J. Clark, “Automated visual surveillance
using hidden markov models,” In International
Conference on Vision Interface, pages 88–93, 2002.
[7]. J. Yamato, J. Ohya, and K. Ishii, “Recognizing human
action in time-sequential images using hidden markov
models,” In Proc. 1992 ICCV, p. 379-385.
[8]. Y.Iwai, H.Shimuzu, and M.Yachida, “Real-Time
Context-based Gesture Recognition Using HMM and
Automation,” Department of Systems and Human
Science, Osaka University, Toyonaka, Osaka 560-
8531, JAPAN, iwai@sys.es.osaka-u.ac.jp
[9]. A T. Staner and A. Pentland, “Visual Recognition of
American Sign Language using Hidden Markov
Models,” Technical Report TR-306, Media Lab, MIT,
1995
[10]. A T. Staner and A. Pentland, “Real-Time American
Sign Language Recognition from Video using Hidden
Markov Models,” Technical Report TR-306, Media
Lab, MIT, 1995
[11]. H. Poizner, U. Bellugi, and V. Lutes-Driscoll,
“Perception of american sign language in dynamic
point-light displays,” 7: 430-440, 1981.
[12]. G. Sperling, M. Landy, Y. Cohen, and M. Pavel,
“Intelligible encoding of ASL image sequences at
extremely low information rates,” Comp. Vision,
Graphics, and Image Proc., 31:335-391, 1985.
[13]. A.Sorrentino F.Cuzzolin, and R.Frezza, “Using
Hidden Markov Models and Dynamic Size Functions
for Gesture Recognition,” British Machine Vision
Conference, 1997.
[14]. A. Verri and C. Uras. “Metric-topological approach to
shape representation and recognition,” Image Vis. C.,
14 (3):189–207, 1996.
[15]. Sheng Chen, Chih-Ming Chu, Chung-Lin Huang,
“Speech recognition using Real Time Tracking
Method and Hidden Markov models,” Institute of
Electrical Engineering, National Tsing Hua University,
Hsin Chu 300, Taiwan, ROC
[16]. R.Garver, “Vision Based Gesture Recognition,
Interaction Lab,” University of California at Santa
Barbara
[17]. Kadous, Waleed, “GRASP: Recognition of Australian
Sign Language Using Instrumented Gloves,”
Bachelor’s thesis, University of New South Wales,
1995.
[18]. Matsuo, Hideaki, Seiji Igi, Shan Lu, Yuji Nagashima,
Yuji Takata, and Terutaka Teshima, “The Recognition
Algorithm with Non-contact for Japanese Sign
Language Using Morphological Analysis,” In
Proceedings of the International Gesture Workshop’97
, Berlin, 273-284, 1997.
[19]. Murakami, Kouichi, and Hitomi, “Taguchi. Gesture
Recognition Using Recurrent Neural Networks,” In
Proceedings of CHI’91 Human Factors in Computing
Systems, 237-242, 1991.
[20]. Takahashi, Tomoichi, and Fumio Kishino, “Speech
recognition Coding Based on Experiments Using a
Speech recognition Interface Device,” SIGCHI
Bulletin 23(2):67-73, 1991.
[21]. 21 Lu, Shan, Seiji Igi, Hideaki Matsuo, and Yuji
Nagashima, “Towards a Dialogue System Based on
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
(IJSRD/Vol. 1/Issue 3/2013/0055)
All rights reserved by www.ijsrd.com
621
Recognition and Synthesis of Japanese Sign
Language,” In Proceedings of the International
Gesture Workshop’97, Berlin, 259-272, 1997.
[22]. 22 J.J. LaViola Jr, A survey of Hand Posture and
Gesture Recognition Techniques and Technology,
Department of Computer Science, Brown University,
Providence, Rhode Island 02912, June 1999
[23]. 23 Charniak, Eugene. Statistical Language
Learning.MIT Press, Cambridge, 1993.
[24]. 24 Liang, Rung-Huei, and Ming Ouhyoung. A Sign
Language Recognition System Using Hidden Markov
Model and Context Sensitive Search. In Proceedings
of the ACM Symposium on Virtual Reality Software
and Technology’96, ACM Press, 59-66, 1996.
[25]. 25 Gaolin Fang, Wen Gao, Jiyong Ma. Signer-
Independent Sign Language Recognition Based on
SOFM/HMM Depatment of Computer Science and
Engineering, Harbin Institute of Technology, Harbin,
150001, China

Contenu connexe

Tendances

Ijaia040203
Ijaia040203Ijaia040203
Ijaia040203ijaia
 
ttA sign language recognition approach for
ttA sign language recognition approach forttA sign language recognition approach for
ttA sign language recognition approach forijcseit
 
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...INFOGAIN PUBLICATION
 
Gesture Acquisition and Recognition of Sign Language
Gesture Acquisition and Recognition of Sign LanguageGesture Acquisition and Recognition of Sign Language
Gesture Acquisition and Recognition of Sign LanguageIRJET Journal
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognitiondbpublications
 
Appearance based static hand gesture alphabet recognition
Appearance based static hand gesture alphabet recognitionAppearance based static hand gesture alphabet recognition
Appearance based static hand gesture alphabet recognitionIAEME Publication
 
A Traffic Sign Classifier Model using Sage Maker
A Traffic Sign Classifier Model using Sage MakerA Traffic Sign Classifier Model using Sage Maker
A Traffic Sign Classifier Model using Sage Makerijtsrd
 
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy up...
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy  up...OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy  up...
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy up...feature software solutions pvt ltd
 
Fingerprint Feature Extraction, Identification and Authentication: A Review
Fingerprint Feature Extraction, Identification and Authentication: A ReviewFingerprint Feature Extraction, Identification and Authentication: A Review
Fingerprint Feature Extraction, Identification and Authentication: A Reviewpaperpublications3
 
To identify the person using gait knn based approach
To identify the person using gait   knn based approachTo identify the person using gait   knn based approach
To identify the person using gait knn based approacheSAT Journals
 
Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)IJERA Editor
 
Artificial Neural Network For Recognition Of Handwritten Devanagari Character
Artificial Neural Network For Recognition Of Handwritten Devanagari CharacterArtificial Neural Network For Recognition Of Handwritten Devanagari Character
Artificial Neural Network For Recognition Of Handwritten Devanagari CharacterIOSR Journals
 
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...IJTET Journal
 
Bangladesh Army University of Science and Technology (BAUST), Saidpur // ...
Bangladesh Army University of Science and Technology (BAUST), Saidpur     // ...Bangladesh Army University of Science and Technology (BAUST), Saidpur     // ...
Bangladesh Army University of Science and Technology (BAUST), Saidpur // ...Naiyan Noor
 
A Novel Approach to Fingerprint Identification Using Gabor Filter-Bank
A Novel Approach to Fingerprint Identification Using Gabor Filter-BankA Novel Approach to Fingerprint Identification Using Gabor Filter-Bank
A Novel Approach to Fingerprint Identification Using Gabor Filter-BankIDES Editor
 
Segmentation, tracking and feature extraction
Segmentation, tracking and feature extractionSegmentation, tracking and feature extraction
Segmentation, tracking and feature extractionijcsa
 
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...IJTET Journal
 
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET Journal
 

Tendances (20)

Ijaia040203
Ijaia040203Ijaia040203
Ijaia040203
 
42 128-1-pb
42 128-1-pb42 128-1-pb
42 128-1-pb
 
ttA sign language recognition approach for
ttA sign language recognition approach forttA sign language recognition approach for
ttA sign language recognition approach for
 
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
A Real-Time Letter Recognition Model for Arabic Sign Language Using Kinect an...
 
Gesture Acquisition and Recognition of Sign Language
Gesture Acquisition and Recognition of Sign LanguageGesture Acquisition and Recognition of Sign Language
Gesture Acquisition and Recognition of Sign Language
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognition
 
Appearance based static hand gesture alphabet recognition
Appearance based static hand gesture alphabet recognitionAppearance based static hand gesture alphabet recognition
Appearance based static hand gesture alphabet recognition
 
A Traffic Sign Classifier Model using Sage Maker
A Traffic Sign Classifier Model using Sage MakerA Traffic Sign Classifier Model using Sage Maker
A Traffic Sign Classifier Model using Sage Maker
 
Palmprint Identification Using FRIT
Palmprint Identification Using FRITPalmprint Identification Using FRIT
Palmprint Identification Using FRIT
 
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy up...
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy  up...OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy  up...
OPTIMIZED FINGERPRINT COMPRESSION WITHOUT LOSS OF DATAProposed workblessy up...
 
Fingerprint Feature Extraction, Identification and Authentication: A Review
Fingerprint Feature Extraction, Identification and Authentication: A ReviewFingerprint Feature Extraction, Identification and Authentication: A Review
Fingerprint Feature Extraction, Identification and Authentication: A Review
 
To identify the person using gait knn based approach
To identify the person using gait   knn based approachTo identify the person using gait   knn based approach
To identify the person using gait knn based approach
 
Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)
 
Artificial Neural Network For Recognition Of Handwritten Devanagari Character
Artificial Neural Network For Recognition Of Handwritten Devanagari CharacterArtificial Neural Network For Recognition Of Handwritten Devanagari Character
Artificial Neural Network For Recognition Of Handwritten Devanagari Character
 
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...
Reduction of False Acceptance Rate Using Cross Validation for Fingerprint Rec...
 
Bangladesh Army University of Science and Technology (BAUST), Saidpur // ...
Bangladesh Army University of Science and Technology (BAUST), Saidpur     // ...Bangladesh Army University of Science and Technology (BAUST), Saidpur     // ...
Bangladesh Army University of Science and Technology (BAUST), Saidpur // ...
 
A Novel Approach to Fingerprint Identification Using Gabor Filter-Bank
A Novel Approach to Fingerprint Identification Using Gabor Filter-BankA Novel Approach to Fingerprint Identification Using Gabor Filter-Bank
A Novel Approach to Fingerprint Identification Using Gabor Filter-Bank
 
Segmentation, tracking and feature extraction
Segmentation, tracking and feature extractionSegmentation, tracking and feature extraction
Segmentation, tracking and feature extraction
 
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...
A Comprehensive Approach for Multi Biometric Recognition Using Sclera Vein an...
 
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine LearningIRJET- Sign Language Interpreter using Image Processing and Machine Learning
IRJET- Sign Language Interpreter using Image Processing and Machine Learning
 

En vedette

Presentation MCB seminar 09032011
Presentation MCB seminar 09032011Presentation MCB seminar 09032011
Presentation MCB seminar 09032011Pierre Jacob
 
Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformaticsbutest
 
Statistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoStatistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoLake Como School of Advanced Studies
 
Introduction to HMMs in Bioinformatics
Introduction to HMMs in BioinformaticsIntroduction to HMMs in Bioinformatics
Introduction to HMMs in Bioinformaticsavrilcoghlan
 
Stock Market Analysis Markov Models
Stock Market Analysis Markov ModelsStock Market Analysis Markov Models
Stock Market Analysis Markov ModelsGabriel Policiuc
 
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...Search Engine Friendly Web Design: Designing For People Who Use Search Engine...
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...Search Marketing Expo - SMX
 
Hidden Markov Model & Stock Prediction
Hidden Markov Model & Stock PredictionHidden Markov Model & Stock Prediction
Hidden Markov Model & Stock PredictionDavid Chiu
 
Data Science - Part XIII - Hidden Markov Models
Data Science - Part XIII - Hidden Markov ModelsData Science - Part XIII - Hidden Markov Models
Data Science - Part XIII - Hidden Markov ModelsDerek Kane
 
Stock Market Prediction using Hidden Markov Models and Investor sentiment
Stock Market Prediction using Hidden Markov Models and Investor sentimentStock Market Prediction using Hidden Markov Models and Investor sentiment
Stock Market Prediction using Hidden Markov Models and Investor sentimentPatrick Nicolas
 
Hidden Markov Models
Hidden Markov ModelsHidden Markov Models
Hidden Markov ModelsVu Pham
 
Markov Models
Markov ModelsMarkov Models
Markov ModelsVu Pham
 
10 Steps to Great SEO: InfusionCon 2011
10 Steps to Great SEO: InfusionCon 201110 Steps to Great SEO: InfusionCon 2011
10 Steps to Great SEO: InfusionCon 2011Rand Fishkin
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 

En vedette (14)

Presentation MCB seminar 09032011
Presentation MCB seminar 09032011Presentation MCB seminar 09032011
Presentation MCB seminar 09032011
 
Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformatics
 
Statistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. PeixotoStatistical inference of generative network models - Tiago P. Peixoto
Statistical inference of generative network models - Tiago P. Peixoto
 
Introduction to HMMs in Bioinformatics
Introduction to HMMs in BioinformaticsIntroduction to HMMs in Bioinformatics
Introduction to HMMs in Bioinformatics
 
Stock Market Analysis Markov Models
Stock Market Analysis Markov ModelsStock Market Analysis Markov Models
Stock Market Analysis Markov Models
 
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...Search Engine Friendly Web Design: Designing For People Who Use Search Engine...
Search Engine Friendly Web Design: Designing For People Who Use Search Engine...
 
Hidden Markov Model & Stock Prediction
Hidden Markov Model & Stock PredictionHidden Markov Model & Stock Prediction
Hidden Markov Model & Stock Prediction
 
HIDDEN MARKOV MODEL AND ITS APPLICATION
HIDDEN MARKOV MODEL AND ITS APPLICATIONHIDDEN MARKOV MODEL AND ITS APPLICATION
HIDDEN MARKOV MODEL AND ITS APPLICATION
 
Data Science - Part XIII - Hidden Markov Models
Data Science - Part XIII - Hidden Markov ModelsData Science - Part XIII - Hidden Markov Models
Data Science - Part XIII - Hidden Markov Models
 
Stock Market Prediction using Hidden Markov Models and Investor sentiment
Stock Market Prediction using Hidden Markov Models and Investor sentimentStock Market Prediction using Hidden Markov Models and Investor sentiment
Stock Market Prediction using Hidden Markov Models and Investor sentiment
 
Hidden Markov Models
Hidden Markov ModelsHidden Markov Models
Hidden Markov Models
 
Markov Models
Markov ModelsMarkov Models
Markov Models
 
10 Steps to Great SEO: InfusionCon 2011
10 Steps to Great SEO: InfusionCon 201110 Steps to Great SEO: InfusionCon 2011
10 Steps to Great SEO: InfusionCon 2011
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 

Similaire à Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches

Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...
Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...
Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...CSCJournals
 
Vision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionVision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionIJAAS Team
 
Real time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMReal time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMijtsrd
 
Hand Gesture Recognition System for Human-Computer Interaction with Web-Cam
Hand Gesture Recognition System for Human-Computer Interaction with Web-CamHand Gesture Recognition System for Human-Computer Interaction with Web-Cam
Hand Gesture Recognition System for Human-Computer Interaction with Web-Camijsrd.com
 
Iaetsd appearance based american sign language recognition
Iaetsd appearance based  american sign language recognitionIaetsd appearance based  american sign language recognition
Iaetsd appearance based american sign language recognitionIaetsd Iaetsd
 
hand gesture based interactive photo silder
hand gesture based interactive photo silderhand gesture based interactive photo silder
hand gesture based interactive photo sildersampada muley
 
Computer Based Human Gesture Recognition With Study Of Algorithms
Computer Based Human Gesture Recognition With Study Of AlgorithmsComputer Based Human Gesture Recognition With Study Of Algorithms
Computer Based Human Gesture Recognition With Study Of AlgorithmsIOSR Journals
 
Recognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingRecognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingIJERA Editor
 
Design and Development of Motion Based Continuous Sign Language Detection
Design and Development of Motion Based Continuous Sign Language DetectionDesign and Development of Motion Based Continuous Sign Language Detection
Design and Development of Motion Based Continuous Sign Language DetectionIRJET Journal
 
HSV Brightness Factor Matching for Gesture Recognition System
HSV Brightness Factor Matching for Gesture Recognition SystemHSV Brightness Factor Matching for Gesture Recognition System
HSV Brightness Factor Matching for Gesture Recognition SystemCSCJournals
 
Natural Hand Gestures Recognition System for Intelligent HCI: A Survey
Natural Hand Gestures Recognition System for Intelligent HCI: A SurveyNatural Hand Gestures Recognition System for Intelligent HCI: A Survey
Natural Hand Gestures Recognition System for Intelligent HCI: A SurveyEditor IJCATR
 
Gesture Recognition Technology
Gesture Recognition TechnologyGesture Recognition Technology
Gesture Recognition TechnologyNikith Kumar Reddy
 
Gesture recognition using artificial neural network,a technology for identify...
Gesture recognition using artificial neural network,a technology for identify...Gesture recognition using artificial neural network,a technology for identify...
Gesture recognition using artificial neural network,a technology for identify...NidhinRaj Saikripa
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysispaperpublications3
 
A gesture recognition system for the Colombian sign language based on convolu...
A gesture recognition system for the Colombian sign language based on convolu...A gesture recognition system for the Colombian sign language based on convolu...
A gesture recognition system for the Colombian sign language based on convolu...journalBEEI
 
Pakistan sign language to Urdu translator using Kinect
Pakistan sign language to Urdu translator using KinectPakistan sign language to Urdu translator using Kinect
Pakistan sign language to Urdu translator using KinectCSITiaesprime
 
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Waqas Tariq
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysispaperpublications3
 

Similaire à Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches (20)

Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...
Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...
Novel Approach to Use HU Moments with Image Processing Techniques for Real Ti...
 
Vision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language RecognitionVision Based Approach to Sign Language Recognition
Vision Based Approach to Sign Language Recognition
 
Real time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMReal time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVM
 
Hand Gesture Recognition System for Human-Computer Interaction with Web-Cam
Hand Gesture Recognition System for Human-Computer Interaction with Web-CamHand Gesture Recognition System for Human-Computer Interaction with Web-Cam
Hand Gesture Recognition System for Human-Computer Interaction with Web-Cam
 
Iaetsd appearance based american sign language recognition
Iaetsd appearance based  american sign language recognitionIaetsd appearance based  american sign language recognition
Iaetsd appearance based american sign language recognition
 
hand gesture based interactive photo silder
hand gesture based interactive photo silderhand gesture based interactive photo silder
hand gesture based interactive photo silder
 
Computer Based Human Gesture Recognition With Study Of Algorithms
Computer Based Human Gesture Recognition With Study Of AlgorithmsComputer Based Human Gesture Recognition With Study Of Algorithms
Computer Based Human Gesture Recognition With Study Of Algorithms
 
Recognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse CodingRecognition of Facial Emotions Based on Sparse Coding
Recognition of Facial Emotions Based on Sparse Coding
 
Design and Development of Motion Based Continuous Sign Language Detection
Design and Development of Motion Based Continuous Sign Language DetectionDesign and Development of Motion Based Continuous Sign Language Detection
Design and Development of Motion Based Continuous Sign Language Detection
 
HSV Brightness Factor Matching for Gesture Recognition System
HSV Brightness Factor Matching for Gesture Recognition SystemHSV Brightness Factor Matching for Gesture Recognition System
HSV Brightness Factor Matching for Gesture Recognition System
 
Paper
PaperPaper
Paper
 
Natural Hand Gestures Recognition System for Intelligent HCI: A Survey
Natural Hand Gestures Recognition System for Intelligent HCI: A SurveyNatural Hand Gestures Recognition System for Intelligent HCI: A Survey
Natural Hand Gestures Recognition System for Intelligent HCI: A Survey
 
Nikppt
NikpptNikppt
Nikppt
 
Gesture Recognition Technology
Gesture Recognition TechnologyGesture Recognition Technology
Gesture Recognition Technology
 
Gesture recognition using artificial neural network,a technology for identify...
Gesture recognition using artificial neural network,a technology for identify...Gesture recognition using artificial neural network,a technology for identify...
Gesture recognition using artificial neural network,a technology for identify...
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysis
 
A gesture recognition system for the Colombian sign language based on convolu...
A gesture recognition system for the Colombian sign language based on convolu...A gesture recognition system for the Colombian sign language based on convolu...
A gesture recognition system for the Colombian sign language based on convolu...
 
Pakistan sign language to Urdu translator using Kinect
Pakistan sign language to Urdu translator using KinectPakistan sign language to Urdu translator using Kinect
Pakistan sign language to Urdu translator using Kinect
 
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysis
 

Plus de ijsrd.com

IoT Enabled Smart Grid
IoT Enabled Smart GridIoT Enabled Smart Grid
IoT Enabled Smart Gridijsrd.com
 
A Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of ThingsA Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of Thingsijsrd.com
 
IoT for Everyday Life
IoT for Everyday LifeIoT for Everyday Life
IoT for Everyday Lifeijsrd.com
 
Study on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTStudy on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTijsrd.com
 
Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...ijsrd.com
 
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...ijsrd.com
 
A Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's LifeA Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's Lifeijsrd.com
 
Pedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language LearningPedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language Learningijsrd.com
 
Virtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation SystemVirtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation Systemijsrd.com
 
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...ijsrd.com
 
Understanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart RefrigeratorUnderstanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart Refrigeratorijsrd.com
 
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...ijsrd.com
 
A Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingA Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingijsrd.com
 
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web LogsWeb Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logsijsrd.com
 
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMAPPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMijsrd.com
 
Making model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point TrackingMaking model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point Trackingijsrd.com
 
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...ijsrd.com
 
Study and Review on Various Current Comparators
Study and Review on Various Current ComparatorsStudy and Review on Various Current Comparators
Study and Review on Various Current Comparatorsijsrd.com
 
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...ijsrd.com
 
Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.ijsrd.com
 

Plus de ijsrd.com (20)

IoT Enabled Smart Grid
IoT Enabled Smart GridIoT Enabled Smart Grid
IoT Enabled Smart Grid
 
A Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of ThingsA Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of Things
 
IoT for Everyday Life
IoT for Everyday LifeIoT for Everyday Life
IoT for Everyday Life
 
Study on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTStudy on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOT
 
Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...
 
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
 
A Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's LifeA Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's Life
 
Pedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language LearningPedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language Learning
 
Virtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation SystemVirtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation System
 
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
 
Understanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart RefrigeratorUnderstanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart Refrigerator
 
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
 
A Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingA Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processing
 
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web LogsWeb Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
 
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMAPPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
 
Making model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point TrackingMaking model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point Tracking
 
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
 
Study and Review on Various Current Comparators
Study and Review on Various Current ComparatorsStudy and Review on Various Current Comparators
Study and Review on Various Current Comparators
 
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
 
Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.
 

Dernier

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spaintimesproduction05
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 

Dernier (20)

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 

Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches

  • 1. IJSRD - International Journal for Scientific Research & Development| Vol. 1, Issue 3, 2013 | ISSN (online): 2321-0613 All rights reserved by www.ijsrd.com 617 Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches Ritenkumar R. Patel1 Prof. M. A. Lokhandwala2 Prof. Mitul Patel3 1 M. E. student 2, 3 Assistant Professor 1, 2, 3 E & C Department 1, 2, 3 Parul institute of Engineering & Technology, Limda, Waghodia, Gujarat, India Abstract— Many ways of communications are used between human and computer, while using gesture is considered to be one of the most natural ways in a virtual reality system. Speech recognition is one of the typical methods of non-verbal communication for human beings and we naturally use various gestures to express our own intentions in everyday life. Gesture recognizers are supposed to capture and analyze the information transmitted by the hands of a person who communicates in sign language. This is a prerequisite for automatic sign-to-spoken-language translation, which has the potential to support the integration of deaf people into society. This paper present part of literature review on ongoing research and findings on different technique and approaches in gesture recognition using Hidden Markov Models for vision-based approach. Keywords: sign language, gesture recognition, hidden markov model I. INTRODUCTION The video-based approaches claim to allow the signer to move freely without any instrumentation attached to the body. Trajectory, hand shape and hand locations are tracked and detected by a camera (or an array of cameras). By doing so, the signer is constrained to sign in a closed, some- how controlled environment. The amount of data that has to be processed to extract and track hands in the image also imposes a restriction on memory, speed and complexity on the computer equipment. A recognition system that we proposed comprises of a framework of two most important entity, which are preprocessing (image processing) and recognition classification (artificial intelligence recognition) as both will have to work relatively in defining the meaning of an input sign language. Vision-based technique has been proposed as we are using single Matrox camera to collect gesture data. This technique can cover both face and hands signer in which signer does not need to wear data gloves device. All processing tasks are solved by using computer vision techniques which are more flexible and useful than devised based measurement approach. Since sign language is gesticulated fluently and interactively like other spoken languages, a sign language recognizer must able to recognize continuous sign vocabularies in real-time. The authors try to build such a system for the Bahasa Melayu Sign Language. Gesture in this paper will always refer to the hand. Basically, different technique and approaches in recognizing a gesture for sign language will be reviewed. II. PREVIOUS WORK Sensing of human expression is very important for human- computer interactive applications such as virtual reality, gesture recognition, and communication. In recent years, a large number of studies have been made on machine recognition of human expression. A number of systems have been implemented that recognize various types of sign languages. For example, Starner was able to recognize a distinct forty word lexicon consisting of pronouns, nouns, verbs, and adjectives taken from American Sign Language with accuracies of over 90% [10]. A system developed by Kadous [17] recognized a 95 word lexicon from Australian Sign Language and achieved accuracies of approximately 80%. Murakami and Taguchi were able to adequately recognize 42 finger-alphabet symbols and 10 sign-language words taken from Japanese Sign Language [19]. Lu [21] and Matuso [18] have also developed systems for recognizing a subset of Japanese Sign Language. Finally, Takahashi and Kishino were able to accurately recognize 34 speech recognition is used in the Japanese Kana Manual Alphabet [20]. A common approach to describing human motion is to use a state-based model, such as a Hidden Markov Model (HMM), to convert a series of motions into a description of activity. Such systems [1, 2, 3, 4, 5, 6] operate by training a HMM (or some variant thereon) to parse a stream of short-term tracked motions. Another system [3] recognizes simple human gestures (against a black backdrop), while an earlier system [2] recognizes simple actions (pick up, put down, push, pull, etc.), also based on a trained HMM. An early effort by Yamato et al. [7] uses discrete HMM’s to successfully recognize image sequences of six different tennis strokes among three subjects. This experiment is significant because it used a 25x25 pixel quantized sub sampled camera image as a feature vector that is converted into a discrete label by a vector quantizer. The labels are classified based on discrete HMMs. Even with such low-level information, the model could learn the set of motions to perform respectable recognition rates. According to Iwai, Shimizu, and Yachida [8], a lot of methods for the recognition of gestures from images have been proposed. Takahasi used spatiotemporal edges for the recognition of gestures based on a DP matching method.
  • 2. Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches (IJSRD/Vol. 1/Issue 3/2013/0055) All rights reserved by www.ijsrd.com 618 Yamato used matrices sampled sparsely from images and Hid-den Markov Models for the recognition. The sparse matrices are quantized by the Categorized VQ method and are used as input for a discrete HMM. Campbell used the 3- D positions and 3-D motion of face, left hand, and right hand extracted from stereo images and HMM for the recognition. HMM method has a better performance than DP matching method for recognition of public gestures because of statistically learning. Research by Starner and Pentland did not attempt to capture detailed information about the hands [9, 10]. In this recognition system, sentences of the form (‘personal pronoun, verb, noun, adjective, (the same) personal pronoun” are to be recognized with a vocabulary of 40 signs. These systems have mostly concentrated on finger signing, where the user spells each word with hand signs corresponding to the letters of the alphabet. They proposed an extensible system which uses a single color camera to track hands in real time and interprets American Sign Language (ASL) using HMM. The hand tracking stage of the system does not attempt to produce a fine grain description of hand shape; studies have shown that such detailed information may not be necessary for humans to interpret sign language [11, 12]. Instead, the tracking process produces only a coarse description of hand shape, orientation, and trajectory as inputs to HMMs. Signs are modeled with four-states-HMM. They use a single camera and plain cotton gloves in two colors or no gloves for a second test. The shape, orientation, and trajectory information is then input to a HMM for recognition of the signed words. They achieve recognition accuracies between 75% and 99% allowing only the simplest syntactical structures. In [2] Siskind and Morris conjecture that human event perception does not presuppose object recognition. In other words, they think visual event recognition is performed by a visual pathway which is separated from object recognition. To verify the conjecture, they analyze motion profiles of objects that participate in different simple spatial motion events. Their tracker uses a mixture of color based and motion based techniques. Color based techniques are used to track objects defined by set of colored pixels whose saturation and value are above certain thresholds in each frame. These pixels are then clustered into regions using a histogram based on hue. Moving pixels are extracted from frame differences and divided into clusters based on proximity. Next, each region (generated by color or motion) in each frame is abstracted by an ellipse. Finally, feature vector for each frame is generated by computing the absolute and relative ellipse positions, orientations, velocities and accelerations. To classify visual events, they use a set of Hidden Markov Models (HMMs) which are used as generative models and trained on movies of each visual event represented by a set of feature vectors. After training, a new observation is classified as being generated by the model that assigns the highest likelihood. Experiments on a set of 6 simple gestures, “pick up,” “put down,” “push,” “pull,” “drop,” and “throw,” demonstrate that gestures can be classified based on motion profiles. Sorrentino, Cuzzolin and Frezza [13] present an original technique for speech recognitio based on a dynamic shape representation by combining size functions and hidden Markov models (HMM). Each gesture is described by a different probabilistic finite state machine which models a succession of so called canonical postures of the hand. Conceptually, they model a gesture as a sequence of hand postures which project into sets of size functions. The state dynamics describe the transition between canonical postures while the observation equations are maps from the set of canonical postures to size functions. In their proposed technique, a fundamental hypothesis has been made; gestures may be modeled as sequences of a finite number of “canonical" postures of the hand. Each posture is associated to a state of a probabilistic finite state machine, in particular of a Hidden Markov Model. Each gesture is identified with a HMM with an appropriate number of states and transition probabilities. They come with the result that HMM paradigm deserves to be confirmed, whereas future work should concentrate upon a better shape representation and also the measuring functions proposed in [14] must be improved since they are particularly sensitive to small changes in the position of the center of mass of the images. Further developments on integrating the qualitative modeling philosophy presented in this paper with quantitative techniques which describe the 27 degrees of freedom kinematics of the hand. Chen, Fu and Huang in [15] have introduced a speech recognition system to recognize continuous gesture before stationary background using 2D video input. Reliable method of image processes has been applied: motion detection, skin color extraction, edge detection, movement justification and background subtraction as a real-time image processing subsystem. The system consists of four modules: a real time hand tracking and extraction, feature extraction, hidden Markov model (HMM) training, and gesture recognition. First, they apply a real-time hand tracking and extraction algorithm to trace the moving hand and extract the hand region, then the Fourier descriptor (FD) has been used to characterize spatial features and the motion analysis to characterize the temporal features. A spatial and temporal feature has been combined of the input image sequence as our feature vector. After the feature vectors have been extracted, we apply HMMs to recognize the input gesture. The gesture to be recognized is separately scored against different HMMs. The model with the highest score indicates the corresponding gesture. In the experiments, we have tested our system to recognize 20 different gestures, and the recognizing rate is above 90%. Garver [16] has come out with his research to construct interface environment that allowing mouse control through pointing gestures and commands executed by simple arm movements as the goal of having natural gestures as a modality for basic computer control. Hand segmentation system has been performed for individual frames. The hand segmentation system is made up of three image processing algorithms: dynamic background subtraction, statistical skin color segmentation, and depth thresholding. Using the basic filters for segmentation and locating hands using a very simple blob extraction algorithm the tracking system is ready to use. HMM has been proposed for command gesture system to trained gesture classification. After preparing the gesture for executing a
  • 3. Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches (IJSRD/Vol. 1/Issue 3/2013/0055) All rights reserved by www.ijsrd.com 619 command, produced gesture sequence that is then processed by a series of HMMs. Each HMM represents a specific gesture, and the performed gesture is classified as the representative model with the highest probability of correspondence. However only the hand segmentation and tracking follow well. The initial classification of gestures as pointing or command works with reasonable accuracy, need to be improved. III. GESTURE AND HIDDEN MARKOV MODEL A. Speech recognition The vast majority of automatic recognition systems are for deictic gestures (pointing), emblematic gestures (isolated signs) and sign languages (with a limited vocabulary and syntax). Some are components of bimodal systems, integrated with speech recognition. Some produce precise hand and arm configuration while others only coarse motion. The kinematics of the posture of a human hand can be described by a mechanical system with 27 degrees of freedom [1, 2]. Common experience suggests, however, that a gesture may be characterized by a sequence of only a finite number of hand postures and that, therefore, it is not necessary to describe hand posture as a continuum. Feature extraction and analysis is a robust way to recognize hand postures and gestures. It can be used not only to recognize simple hand postures and gestures but complex ones as well. Its main drawback is that it can become computationally expensive when a large number of features are being collected, which slows down system response time. This slowdown is extremely significant in virtual environment applications, but should diminish with the advent of faster machines. The concept of gesture is loosely defined, and depends on the context of the interaction. Recognition of natural, continuous gestures requires temporally segmenting gestures. Automatically segmenting gestures is difficult, and is often finessed or ignored in current systems by requiring a starting position in time and/or space. Similar to this is the problem of distinguishing intentional gestures from other “random” movements. There is no standard way to do gesture recognition – a variety of representations and classification schemes are used. However, most gesture recognition systems share some common structure. B. Overview of Hidden Markov Model HMM were introduced in the mid 1990’s, and quickly became the recognition method of choice, due to its implicit solution to the segmentation problem. In describing hidden Markov models it is convenient first to consider Markov chains. Markov chains are simply finite-state automata in which each state transition arc has an associated probability value; the probability values of the arcs leaving a single state sum to one. Markov chains impose the restriction on the finite-state automaton that a state can have only one transition arc with a given output; a restriction that makes Markov chains deterministic. A hidden Markov model (HMM) can be considered a generalization of a Markov chain without this Markov-chain restriction [23]. Since HMMs can have more than one arc with the same output symbol, they are nondeterministic, and it is impossible to directly determine the state sequence for a set of inputs simply by looking at the output (hence the “hidden” in “hidden Markov model”). More formally, a HMM is defined as a set of states of which one state is the initial state, a set of output symbols, and a set of state transitions. Each state transition is represented by the state from which the transition starts, the state to which transition moves, the output symbol generated, and the probability that the transition is taken [23]. In the context of speech recognition, each state could represent a set of possible hand positions. The state transitions represent the probability that a certain hand position transitions into another; the corresponding output symbol represents a specific posture and sequences of output symbols represent speech recognition. One then uses a group of HMMs, one for each gesture, and runs a sequence of input data through each HMM. The input data, derived from pixels in a vision- based solution or from bend sensor values in a glove-based solution, can be represented in many different ways, the most common by feature vectors [9]. The HMM with the highest forward probability (described later in this section) determines the users’ most likely gesture. An HMM can also be used for hand posture recognition; see Liang and Ouhyoung [24] for details. Gaolin Fang and Wen Gao [25] stated that there are some limitations of classical HMMs for sign language recognition. Firstly, it is the assumption that the distributions of individual observation parameters can be well represented as a mixture of Gaussian or autoregressive densities. This assumption isn’t always consistent with the fact. Secondly, HMMs have the poorer discrimination than neural networks. In the HMMs training, each word model is estimated separately using the corresponding labeled training observation sequences without considering the confused data (other models with similar behavior). IV. RESEARCH METHODOLOGY Current phase of our development is still under image processing. Hand region has been successfully detected by implementing five steps image processing as shown in figure 1 under the preprocessing stage. Fig. 1: Basic system flow of recognition system
  • 4. Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches (IJSRD/Vol. 1/Issue 3/2013/0055) All rights reserved by www.ijsrd.com 620 Thus, review on technique for gesture recognition using HMM will contribute the idea on how we are going to model our gesture. Future plan on the image processing will be discussed on the next part as the output of it will be used for the recognition process. The output is supposed to give much information to model the gesture and perform recognition process. V. DISCUSSION The main requirement for the use of gesture recognition in human-computer interfaces is a continuous online recognition with a minimum time delay of the recognition output, but such a system performance is extremely difficult to achieve, because of the following reasons: The fact that the starting and ending point of the gesture is not known is a very complicated problem. A frame captured by the frame grabber has to be immediately preprocessed and the features have to be extracted and used for the classification, before the next frame is captured. Assumption on the lighting, background, signers location need to be done since the Matrox camera that we use capable of capturing images into grayscale image. VI. CONCLUSION AND FUTURE DIRECTION In this paper, a few literature reviews on technique of gesture recognition using HMM has been reviewed and analyzed. Most of them are using colored images as better result can be achieved. As the input of gesture recognition, further process need to be done on the detected hand. Edge detection is proposed. Age detection used to mark points at which image intensity changes sharply. So we will get only the hand region. Hand motion, posture and orientation are other areas that need to be explore. Combination of hand detection with those will be useful input for recognition process. REFERENCES [1]. H. Buxton, “Learning and understanding dynamic scene activity: a review,” Image and Vision Computing, 21(1):125–136, January 2003. [2]. J.M. Siskind and Q. Morris, “A maximum-likelihood approach to visual event classification,” In Proc. 4th European Conference on Computer Vision, pages II: 347–360, 1996. [3]. Y.A. Ivanov and A.F. Bobick, “Recognition of visual activities and interactions by stochastic parsing,” IEEE Trans. Pattern Analysis and Machine Intelligence, 22(8):852–872, August 2000. [4]. J. Lou, Q. Liu, T. Tan, and W. Hu, “Semantic interpretation of object activities in a surveillance system,” In Proc. International Conference on Pattern Recognition, pages III: 777–780, 2002. [5]. N.M. Oliver, B. Rosario, and A.P. Pentland, “A Bayesian computer vision system for modeling human interactions,” IEEE Trans. Pattern Analysis and Machine Intelligence, 22(8):831–843, August 2000. [6]. V. Nair and J.J. Clark, “Automated visual surveillance using hidden markov models,” In International Conference on Vision Interface, pages 88–93, 2002. [7]. J. Yamato, J. Ohya, and K. Ishii, “Recognizing human action in time-sequential images using hidden markov models,” In Proc. 1992 ICCV, p. 379-385. [8]. Y.Iwai, H.Shimuzu, and M.Yachida, “Real-Time Context-based Gesture Recognition Using HMM and Automation,” Department of Systems and Human Science, Osaka University, Toyonaka, Osaka 560- 8531, JAPAN, iwai@sys.es.osaka-u.ac.jp [9]. A T. Staner and A. Pentland, “Visual Recognition of American Sign Language using Hidden Markov Models,” Technical Report TR-306, Media Lab, MIT, 1995 [10]. A T. Staner and A. Pentland, “Real-Time American Sign Language Recognition from Video using Hidden Markov Models,” Technical Report TR-306, Media Lab, MIT, 1995 [11]. H. Poizner, U. Bellugi, and V. Lutes-Driscoll, “Perception of american sign language in dynamic point-light displays,” 7: 430-440, 1981. [12]. G. Sperling, M. Landy, Y. Cohen, and M. Pavel, “Intelligible encoding of ASL image sequences at extremely low information rates,” Comp. Vision, Graphics, and Image Proc., 31:335-391, 1985. [13]. A.Sorrentino F.Cuzzolin, and R.Frezza, “Using Hidden Markov Models and Dynamic Size Functions for Gesture Recognition,” British Machine Vision Conference, 1997. [14]. A. Verri and C. Uras. “Metric-topological approach to shape representation and recognition,” Image Vis. C., 14 (3):189–207, 1996. [15]. Sheng Chen, Chih-Ming Chu, Chung-Lin Huang, “Speech recognition using Real Time Tracking Method and Hidden Markov models,” Institute of Electrical Engineering, National Tsing Hua University, Hsin Chu 300, Taiwan, ROC [16]. R.Garver, “Vision Based Gesture Recognition, Interaction Lab,” University of California at Santa Barbara [17]. Kadous, Waleed, “GRASP: Recognition of Australian Sign Language Using Instrumented Gloves,” Bachelor’s thesis, University of New South Wales, 1995. [18]. Matsuo, Hideaki, Seiji Igi, Shan Lu, Yuji Nagashima, Yuji Takata, and Terutaka Teshima, “The Recognition Algorithm with Non-contact for Japanese Sign Language Using Morphological Analysis,” In Proceedings of the International Gesture Workshop’97 , Berlin, 273-284, 1997. [19]. Murakami, Kouichi, and Hitomi, “Taguchi. Gesture Recognition Using Recurrent Neural Networks,” In Proceedings of CHI’91 Human Factors in Computing Systems, 237-242, 1991. [20]. Takahashi, Tomoichi, and Fumio Kishino, “Speech recognition Coding Based on Experiments Using a Speech recognition Interface Device,” SIGCHI Bulletin 23(2):67-73, 1991. [21]. 21 Lu, Shan, Seiji Igi, Hideaki Matsuo, and Yuji Nagashima, “Towards a Dialogue System Based on
  • 5. Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches (IJSRD/Vol. 1/Issue 3/2013/0055) All rights reserved by www.ijsrd.com 621 Recognition and Synthesis of Japanese Sign Language,” In Proceedings of the International Gesture Workshop’97, Berlin, 259-272, 1997. [22]. 22 J.J. LaViola Jr, A survey of Hand Posture and Gesture Recognition Techniques and Technology, Department of Computer Science, Brown University, Providence, Rhode Island 02912, June 1999 [23]. 23 Charniak, Eugene. Statistical Language Learning.MIT Press, Cambridge, 1993. [24]. 24 Liang, Rung-Huei, and Ming Ouhyoung. A Sign Language Recognition System Using Hidden Markov Model and Context Sensitive Search. In Proceedings of the ACM Symposium on Virtual Reality Software and Technology’96, ACM Press, 59-66, 1996. [25]. 25 Gaolin Fang, Wen Gao, Jiyong Ma. Signer- Independent Sign Language Recognition Based on SOFM/HMM Depatment of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001, China