SlideShare une entreprise Scribd logo
1  sur  20
Visual-speech to text 
conversion applicable 
to telephone 
communication for deaf 
individuals 
30TH APRIL 2013
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
INTRODUCTION 
 Lip-reading technique, 
 speech can be understood by interpreting 
movements of lips, face and tongue. 
 not one-to-one 
 Impossible to distinguish phonemes using 
visual information alone
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
 the Cued Speech system 
 developed by Cornett 
 contains two components: 
the hand shape the hand position relative to the 
face. 
 Hand shapes- consonant phonemes 
 hand positions -vowel phonemes. 
 improves speech perception to a large extent
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
the Cued Speech system
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
AIM OF NEW SYSTEM 
 To investigate the designing of a system able to 
automatically recognize Cued Speech and convert it 
to text. 
 Possible for deaf or speech-impaired individuals to 
communicate with each other and also with normal-hearing 
persons 
 Using gestures 
 captured by devices equipped by a camera
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
METHODS 
 Corpus, feature extraction, and 
statistical modeling 
 The speakers’ lips were painted blue, and color 
marks were placed on the speakers’ fingers. . 
 The data were derived from a video recording of 
the cuers pronouncing and coding in Cued 
Speech 
 landmarks with different colors were placed on 
the fingers
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
 faster and more accurate image processing 
stage. 
 The audio part of the video recording was 
synchronized with the image. 
 An automatic image processing method was 
appliedli pt ow idththe ( Av)i,d eo 
 lip aperture (B), 
 lip area (S). 
 pinching of the upper lip (Bsup) 
 lower (Binf) lip
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
 Concatenative feature fusion 
 Tracks and extracts the xy coordinates 
each time frame, 
 uses those values as features in the 
HMM modeling. 
 uses the concatenation of the 
synchronous lip shape and hand features 
as the joint feature vector given by,
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
Joint lip hand 
feature vector, 
Lip shape 
feature vector, 
Hand feature 
vector, 
Dimensionality of the 
joint feature vector 
 Parameters used for lip 
shape modeling.
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
RESULTS 
 Isolated word recognition 
1. Recognition in normal-hearing subject
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
2. Recognition in deaf subject
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
3. Multi-speaker isolated word recognition: 
 investigate whether it is possible to train speaker-independent 
HMMs for Cued Speech recognition. 
 The training data consisted of 750 words from the 
normal-hearing subject, and 750 words from the 
deaf subject. 
 For testing 700 words from normal-hearing subject 
and 700 words from the deaf subject were used, 
respectively. 
 Each state was modeled with a mixture of 4 
Gaussian distributions. 
 For lip shape and hand shape integration, 
concatenative feature fusion was used.
Visual-speech to text conversion applicable to telephone communication for deaf individuals
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
4. Continuous phoneme recognition 
 Phoneme correct for continuous phoneme word 
recognition in the case of a normal-hearing subject.
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
Phoneme correct for continuous phoneme word 
recognition in the case of a deaf subject.
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
CONCLUSION 
 Hand shapes and lips shape were integrated 
using concatenative feature fusion and HMM-based 
automatic recognition was conducted. 
 For continuous phoneme recognition, a 86% 
phoneme correct was achieved for the normal-hearing 
cuer and a 82.7% phoneme correct for 
the dead cuer were achieved, respectively. 
 Speech in both normal-hearing and deaf 
subjects were also conducted obtaining a 
94.9% and a 89% accuracy, respectively. 
.
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
CONCLUSION 
 A multi-speaker experiment using data 
from both normal-hearing and deaf subject 
showed a 89.6% word accuracy, on 
average. 
 This result indicates that training speaker-independent 
HMMs for Cued Speech using 
a large number of subjects should not face 
particular difficulties
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
REFERENCES 
 G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior, 
“recent Advances in the automatic recognition of audiovisual 
speech,” in Proceedings of the IEEE, vol. 91, issue 9, pp. 
1306–1326, 2003. 
 S. Nakamura, K. Kumatani, and S. Tamura, “Multi-modal 
temporal asynchronicity modeling by product hmms for 
robust audio-visual speech recognition,” in Proceedings of 
Fourth IEEE International Conference on Multimodal 
Interfaces (ICMI’02), p. 305, 2002. 
 R. O. Cornett, “Cued speech,” American Annals of the Deaf, 
vol. 112, pp. 3–13, 1967. 
 J. Leybaert, “Phonology acquired through the eyes and 
spelling in deaf children,”Journal of Experimental Child 
Psychology, vol. 75, pp. 291– 318, 2000
Thank you!
Visual-speech to text conversion applicable to telephone communication for deaf individuals 
ANY 
QUESTION 
S?

Contenu connexe

Tendances

Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemREHMAT ULLAH
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Voice To Text Presentation
Voice To Text PresentationVoice To Text Presentation
Voice To Text Presentationshahinmehr
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project reportSarang Afle
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionManthan Gandhi
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognitionananth
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 

Tendances (20)

Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Voice To Text Presentation
Voice To Text PresentationVoice To Text Presentation
Voice To Text Presentation
 
VOICE BROWSER
VOICE BROWSERVOICE BROWSER
VOICE BROWSER
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Hand gesture recognition
Hand gesture recognitionHand gesture recognition
Hand gesture recognition
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 

En vedette

Chat Room System using Java Swing
Chat Room System using Java SwingChat Room System using Java Swing
Chat Room System using Java SwingTejas Garodia
 
Maestria en gestion de la innovación uniminuto
Maestria en gestion de la innovación uniminutoMaestria en gestion de la innovación uniminuto
Maestria en gestion de la innovación uniminutoOscar Lunatico
 
10 pasos a la felicidad
10 pasos a la felicidad10 pasos a la felicidad
10 pasos a la felicidadlopaumoval
 
STOP DIABETES CON FLP PERU
STOP DIABETES CON FLP PERUSTOP DIABETES CON FLP PERU
STOP DIABETES CON FLP PERUVictor Ravines
 
Foro - Ley electoral
Foro - Ley electoralForo - Ley electoral
Foro - Ley electoralPorOtraCuba
 
The Future of News, Publishing, and Media (INMA 2010 Presentation)
The Future of News, Publishing, and Media (INMA 2010 Presentation)The Future of News, Publishing, and Media (INMA 2010 Presentation)
The Future of News, Publishing, and Media (INMA 2010 Presentation)Gerd Leonhard
 
Industrial Investment Engineering Presentation
Industrial Investment Engineering PresentationIndustrial Investment Engineering Presentation
Industrial Investment Engineering PresentationDavid1Mayagoitia
 
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...Timo Halima
 
Color vision
Color visionColor vision
Color visionguisbond
 
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...Better email response time using Microsoft Exchange 2013 with the Dell PowerE...
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...Principled Technologies
 
Agua de mar es salud
Agua de mar es saludAgua de mar es salud
Agua de mar es saludPrema Perez
 
13 insights (Troiano Branding)
13 insights (Troiano Branding)13 insights (Troiano Branding)
13 insights (Troiano Branding)Luis Rasquilha
 
Helden - Jugend gestern Und heute
Helden - Jugend gestern Und heuteHelden - Jugend gestern Und heute
Helden - Jugend gestern Und heuteMH1970
 
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevila
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevilaDAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevila
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevilaEmprèn UPF
 

En vedette (20)

Chat Room System using Java Swing
Chat Room System using Java SwingChat Room System using Java Swing
Chat Room System using Java Swing
 
Maestria en gestion de la innovación uniminuto
Maestria en gestion de la innovación uniminutoMaestria en gestion de la innovación uniminuto
Maestria en gestion de la innovación uniminuto
 
(2012-12-12) HIPERTENSION ARTERIAL (DOC)
(2012-12-12) HIPERTENSION ARTERIAL (DOC)(2012-12-12) HIPERTENSION ARTERIAL (DOC)
(2012-12-12) HIPERTENSION ARTERIAL (DOC)
 
Proyecto marketing móvil
Proyecto marketing móvilProyecto marketing móvil
Proyecto marketing móvil
 
resume
resumeresume
resume
 
Noches románticas
Noches románticasNoches románticas
Noches románticas
 
10 pasos a la felicidad
10 pasos a la felicidad10 pasos a la felicidad
10 pasos a la felicidad
 
STOP DIABETES CON FLP PERU
STOP DIABETES CON FLP PERUSTOP DIABETES CON FLP PERU
STOP DIABETES CON FLP PERU
 
Foro - Ley electoral
Foro - Ley electoralForo - Ley electoral
Foro - Ley electoral
 
The Future of News, Publishing, and Media (INMA 2010 Presentation)
The Future of News, Publishing, and Media (INMA 2010 Presentation)The Future of News, Publishing, and Media (INMA 2010 Presentation)
The Future of News, Publishing, and Media (INMA 2010 Presentation)
 
Industrial Investment Engineering Presentation
Industrial Investment Engineering PresentationIndustrial Investment Engineering Presentation
Industrial Investment Engineering Presentation
 
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...
eCommerce Helsinki 2016_Anders Innovations & GlobalSign_16th march, 2016, Hel...
 
Color vision
Color visionColor vision
Color vision
 
Actividad 2
Actividad 2Actividad 2
Actividad 2
 
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...Better email response time using Microsoft Exchange 2013 with the Dell PowerE...
Better email response time using Microsoft Exchange 2013 with the Dell PowerE...
 
Agua de mar es salud
Agua de mar es saludAgua de mar es salud
Agua de mar es salud
 
13 insights (Troiano Branding)
13 insights (Troiano Branding)13 insights (Troiano Branding)
13 insights (Troiano Branding)
 
Helden - Jugend gestern Und heute
Helden - Jugend gestern Und heuteHelden - Jugend gestern Und heute
Helden - Jugend gestern Und heute
 
Historia De Mi Vida
Historia De Mi Vida Historia De Mi Vida
Historia De Mi Vida
 
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevila
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevilaDAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevila
DAFO PERSONAL - Catedra Bancaja UPF-idec 12 feb2011 _lluis soldevila
 

Similaire à Visual speech to text conversion applicable to telephone communication

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...IRJET Journal
 
lips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxlips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxnaghamallella
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSijnlc
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignmentskevig
 
lips _reading _in computer_ vision_n.ppt
lips _reading _in computer_ vision_n.pptlips _reading _in computer_ vision_n.ppt
lips _reading _in computer_ vision_n.pptnaghamallella
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...karthik annam
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...TELKOMNIKA JOURNAL
 
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READINGLIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READINGIRJET Journal
 
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...IRJET Journal
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language ProcessingVikalp Mahendra
 
silent sound technology pdf
silent sound technology pdfsilent sound technology pdf
silent sound technology pdfrahul mishra
 
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docxaryan532920
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemeskevig
 
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...IOSR Journals
 
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESkevig
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIDES Editor
 
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURESMULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURESmlaij
 
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...Hearing by seeing: Can improving the visibility of the speaker's lips make yo...
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...HCI Lab
 

Similaire à Visual speech to text conversion applicable to telephone communication (20)

LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
 
lips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptxlips _reading_nagham _salim compute.pptx
lips _reading_nagham _salim compute.pptx
 
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTSEFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
EFFECT OF MFCC BASED FEATURES FOR SPEECH SIGNAL ALIGNMENTS
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
 
lips _reading _in computer_ vision_n.ppt
lips _reading _in computer_ vision_n.pptlips _reading _in computer_ vision_n.ppt
lips _reading _in computer_ vision_n.ppt
 
Web AI.pptx
Web AI.pptxWeb AI.pptx
Web AI.pptx
 
Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...Performance estimation based recurrent-convolutional encoder decoder for spee...
Performance estimation based recurrent-convolutional encoder decoder for spee...
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...
 
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READINGLIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
LIP READING: VISUAL SPEECH RECOGNITION USING LIP READING
 
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
silent sound technology pdf
silent sound technology pdfsilent sound technology pdf
silent sound technology pdf
 
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx Procedia Computer Science   94  ( 2016 )  295 – 301 Avail.docx
Procedia Computer Science 94 ( 2016 ) 295 – 301 Avail.docx
 
Mobile asl
Mobile aslMobile asl
Mobile asl
 
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and PhonemesEffect of Dynamic Time Warping on Alignment of Phrases and Phonemes
Effect of Dynamic Time Warping on Alignment of Phrases and Phonemes
 
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...Advances in Automatic Speech Recognition: From Audio-Only  To Audio-Visual Sp...
Advances in Automatic Speech Recognition: From Audio-Only To Audio-Visual Sp...
 
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for Lipreading
 
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURESMULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
MULTILINGUAL SPEECH TO TEXT USING DEEP LEARNING BASED ON MFCC FEATURES
 
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...Hearing by seeing: Can improving the visibility of the speaker's lips make yo...
Hearing by seeing: Can improving the visibility of the speaker's lips make yo...
 

Plus de Swathi Venugopal

A new low cost shrm for adjustable-speed pump applications
A new low cost  shrm  for adjustable-speed pump applicationsA new low cost  shrm  for adjustable-speed pump applications
A new low cost shrm for adjustable-speed pump applicationsSwathi Venugopal
 
Harnessing high altitude wind power
Harnessing high altitude wind powerHarnessing high altitude wind power
Harnessing high altitude wind powerSwathi Venugopal
 
Micro stepping mode for stepper motor
Micro stepping mode for stepper motorMicro stepping mode for stepper motor
Micro stepping mode for stepper motorSwathi Venugopal
 
A Frequency-based RF Partial Discharge Detector for Low-power Wireless Sens...
A Frequency-based  RF Partial Discharge Detector  for Low-power Wireless Sens...A Frequency-based  RF Partial Discharge Detector  for Low-power Wireless Sens...
A Frequency-based RF Partial Discharge Detector for Low-power Wireless Sens...Swathi Venugopal
 
Estimation of induction motor operating power factor.
Estimation of induction motor operating power factor.Estimation of induction motor operating power factor.
Estimation of induction motor operating power factor.Swathi Venugopal
 
Save energy save enviornment ii
Save energy save enviornment iiSave energy save enviornment ii
Save energy save enviornment iiSwathi Venugopal
 
Grid integration issues and solutions
Grid integration issues and solutionsGrid integration issues and solutions
Grid integration issues and solutionsSwathi Venugopal
 

Plus de Swathi Venugopal (7)

A new low cost shrm for adjustable-speed pump applications
A new low cost  shrm  for adjustable-speed pump applicationsA new low cost  shrm  for adjustable-speed pump applications
A new low cost shrm for adjustable-speed pump applications
 
Harnessing high altitude wind power
Harnessing high altitude wind powerHarnessing high altitude wind power
Harnessing high altitude wind power
 
Micro stepping mode for stepper motor
Micro stepping mode for stepper motorMicro stepping mode for stepper motor
Micro stepping mode for stepper motor
 
A Frequency-based RF Partial Discharge Detector for Low-power Wireless Sens...
A Frequency-based  RF Partial Discharge Detector  for Low-power Wireless Sens...A Frequency-based  RF Partial Discharge Detector  for Low-power Wireless Sens...
A Frequency-based RF Partial Discharge Detector for Low-power Wireless Sens...
 
Estimation of induction motor operating power factor.
Estimation of induction motor operating power factor.Estimation of induction motor operating power factor.
Estimation of induction motor operating power factor.
 
Save energy save enviornment ii
Save energy save enviornment iiSave energy save enviornment ii
Save energy save enviornment ii
 
Grid integration issues and solutions
Grid integration issues and solutionsGrid integration issues and solutions
Grid integration issues and solutions
 

Dernier

Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...drmkjayanthikannan
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxMuhammadAsimMuhammad6
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxchumtiyababu
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 

Dernier (20)

Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 

Visual speech to text conversion applicable to telephone communication

  • 1. Visual-speech to text conversion applicable to telephone communication for deaf individuals 30TH APRIL 2013
  • 2. Visual-speech to text conversion applicable to telephone communication for deaf individuals INTRODUCTION  Lip-reading technique,  speech can be understood by interpreting movements of lips, face and tongue.  not one-to-one  Impossible to distinguish phonemes using visual information alone
  • 3. Visual-speech to text conversion applicable to telephone communication for deaf individuals  the Cued Speech system  developed by Cornett  contains two components: the hand shape the hand position relative to the face.  Hand shapes- consonant phonemes  hand positions -vowel phonemes.  improves speech perception to a large extent
  • 4. Visual-speech to text conversion applicable to telephone communication for deaf individuals the Cued Speech system
  • 5. Visual-speech to text conversion applicable to telephone communication for deaf individuals AIM OF NEW SYSTEM  To investigate the designing of a system able to automatically recognize Cued Speech and convert it to text.  Possible for deaf or speech-impaired individuals to communicate with each other and also with normal-hearing persons  Using gestures  captured by devices equipped by a camera
  • 6. Visual-speech to text conversion applicable to telephone communication for deaf individuals METHODS  Corpus, feature extraction, and statistical modeling  The speakers’ lips were painted blue, and color marks were placed on the speakers’ fingers. .  The data were derived from a video recording of the cuers pronouncing and coding in Cued Speech  landmarks with different colors were placed on the fingers
  • 7. Visual-speech to text conversion applicable to telephone communication for deaf individuals  faster and more accurate image processing stage.  The audio part of the video recording was synchronized with the image.  An automatic image processing method was appliedli pt ow idththe ( Av)i,d eo  lip aperture (B),  lip area (S).  pinching of the upper lip (Bsup)  lower (Binf) lip
  • 8. Visual-speech to text conversion applicable to telephone communication for deaf individuals  Concatenative feature fusion  Tracks and extracts the xy coordinates each time frame,  uses those values as features in the HMM modeling.  uses the concatenation of the synchronous lip shape and hand features as the joint feature vector given by,
  • 9. Visual-speech to text conversion applicable to telephone communication for deaf individuals Joint lip hand feature vector, Lip shape feature vector, Hand feature vector, Dimensionality of the joint feature vector  Parameters used for lip shape modeling.
  • 10. Visual-speech to text conversion applicable to telephone communication for deaf individuals RESULTS  Isolated word recognition 1. Recognition in normal-hearing subject
  • 11. Visual-speech to text conversion applicable to telephone communication for deaf individuals 2. Recognition in deaf subject
  • 12. Visual-speech to text conversion applicable to telephone communication for deaf individuals 3. Multi-speaker isolated word recognition:  investigate whether it is possible to train speaker-independent HMMs for Cued Speech recognition.  The training data consisted of 750 words from the normal-hearing subject, and 750 words from the deaf subject.  For testing 700 words from normal-hearing subject and 700 words from the deaf subject were used, respectively.  Each state was modeled with a mixture of 4 Gaussian distributions.  For lip shape and hand shape integration, concatenative feature fusion was used.
  • 13. Visual-speech to text conversion applicable to telephone communication for deaf individuals
  • 14. Visual-speech to text conversion applicable to telephone communication for deaf individuals 4. Continuous phoneme recognition  Phoneme correct for continuous phoneme word recognition in the case of a normal-hearing subject.
  • 15. Visual-speech to text conversion applicable to telephone communication for deaf individuals Phoneme correct for continuous phoneme word recognition in the case of a deaf subject.
  • 16. Visual-speech to text conversion applicable to telephone communication for deaf individuals CONCLUSION  Hand shapes and lips shape were integrated using concatenative feature fusion and HMM-based automatic recognition was conducted.  For continuous phoneme recognition, a 86% phoneme correct was achieved for the normal-hearing cuer and a 82.7% phoneme correct for the dead cuer were achieved, respectively.  Speech in both normal-hearing and deaf subjects were also conducted obtaining a 94.9% and a 89% accuracy, respectively. .
  • 17. Visual-speech to text conversion applicable to telephone communication for deaf individuals CONCLUSION  A multi-speaker experiment using data from both normal-hearing and deaf subject showed a 89.6% word accuracy, on average.  This result indicates that training speaker-independent HMMs for Cued Speech using a large number of subjects should not face particular difficulties
  • 18. Visual-speech to text conversion applicable to telephone communication for deaf individuals REFERENCES  G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior, “recent Advances in the automatic recognition of audiovisual speech,” in Proceedings of the IEEE, vol. 91, issue 9, pp. 1306–1326, 2003.  S. Nakamura, K. Kumatani, and S. Tamura, “Multi-modal temporal asynchronicity modeling by product hmms for robust audio-visual speech recognition,” in Proceedings of Fourth IEEE International Conference on Multimodal Interfaces (ICMI’02), p. 305, 2002.  R. O. Cornett, “Cued speech,” American Annals of the Deaf, vol. 112, pp. 3–13, 1967.  J. Leybaert, “Phonology acquired through the eyes and spelling in deaf children,”Journal of Experimental Child Psychology, vol. 75, pp. 291– 318, 2000
  • 20. Visual-speech to text conversion applicable to telephone communication for deaf individuals ANY QUESTION S?