SlideShare a Scribd company logo
1 of 4
Download to read offline
IOSR Journal of Computer Engineering (IOSRJCE)
ISSN: 2278-0661 Volume 3, Issue 4 (July-Aug. 2012), PP 07-10
www.iosrjournals.org
www.iosrjournals.org 7 | Page
Substitution Error Analysis for Improving the Word Accuracy in
Telugu Language Automatic Speech Recognition System
M. Nagamani, P. N. Girija
University of Hyderabad, Hyderabad.
Abstract: Use of natural languages for the computer communication is one of the current research topics. The
speech recognition plays a central role in communicating the computer by means of speech. Speech Recognition
is the process of converting analog signal into the symbolic gesture form known as text. An Automatic Speech
Recognition (ASR) is the process of converting input speech signal given to the system, into text. This input
signal is any human spoken word. Though the last Four decades, research is going on bringing the system
perception near to the human being, in recognition word accuracy. Many domains play a role in degrading the
of system performance in which language and its pronunciation variants caused by different reasons like accent,
gender, dialects of the speech are in general factors. In specific, system environment, articulatory phonetics,
acoustic phonetics, (acoustic system), lexical model (pronunciation dictionary) and language model including
the mood of the speaker. Acoustic modeling can be done in most of the cases by using signal processing, where
as lexicon model require a sufficient human intervention as, it is based on the language and human perception.
Once sufficient primary data is built then automatic processing can be done using different modeling techniques
to derive more data for proper training of the speech recognition system. Here the linguistics are play a major
role in building the robust system.
In lexical model design language plays a major role. Each language has its own rhythm in speech and
language aspects. Based on language rhythm worldly languages are classified as stress timed and syllable timed
rhythm. Most of ASR systems use the lexical model that is built for stressed timed languages. All Indian
languages are syllable timed rhythmic languages one such is Telugu Language. In this paper analysis of the
decoding results of ASR system using two different lexical model environments. One is CMU lexicon which is
based on stress timed language as the tool is used American accent English phonemes and another UOH lexicon
which is handcraft lexicon for Telugu language which is also a syllable timed language.. Further studied are the
gender and accents (pronunciation variant factors) effecting the Substitutional errors in ASR system. The
confusion matrix for vowel and consonants alone analyzed for both cases and also for isolated word recognition
where the confusion matrix gives the most common phonemes substituted. In all the cases the UOH lexicon
based ASR system gives the improvement of word accuracy around 20 to 30%.
Speech is a process used to communicate from a speaker to listener. Pronunciation relates to speech,
and humans have an intuitive feel for pronunciation. For instance, people chuckle when words are
mispronounced and notice when foreign accent colors a speaker’s pronunciations.[1]. If the words were always
pronounced in the same way, ASR would be relatively easy. However, for various reasons words are almost
always pronounced differently and varied from one speaker to another and from once situation to another. The
variability is due to co-articulation, reasonal accents, speaking rate, speaking style etc.
I. Language need in ASR study:
There are 6912 languages are there the purpose of communication between human being around the
world. The need of the language is to computerization of many human need domains, Ubiquitous information
access, phone-based information access, mobile devices which demand speech as modality to interact,
globalization in cross-cultural human-human interaction, multilingual communities, Humanitarian needs like
disaster, healthcare, military applications to communicate with local people and main focus on Human machine
interaction where people expect speech-driven applications in their mother tongue will specifically demand the
speech recognition research to work in regional language . such one application here we like to develop a
speech recognition system working in Telugu Language.
II. Telugu Language and ASR
According to the 1997 senses total 69,600,000 population in India who are speaking Telugu.
Population tatal all countries around 69,758, 890 are the Telugu language speaking people. The language
coverage regions are Andhra Pradesh and neighboring states. Also in Bahrain, Canada, Fiji,
Malaysia(Peninsular), Mauritius, Singapore, South Africa, united Arab Emirates and United States. This Telugu
language also called with alternate names as Andhra, Gentoo, Tailangi, Telangire, Telegu, Telgi, Tengu,
Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech
www.iosrjournals.org 8 | Page
Terangi, Tolangan. The dialects of this language are Berad, Dasari, Dommara, Golari, Kamathi, Komtao,
Konda-Reddi, Salewari, Telangana, Telugu, Vadaga, Srikakula, Vishakhapatnam, East Godaveri, Rayalseema,
Nellore, Guntur, Vadari, Yanadi. This also classified as Dravidian, South-central and Telugu. There are
5,000,000 second language speakers. Fully developed Bible from 1854-2002. That is importance of Telugu
language.
Telugu is one of the oldest language in the world. The old form of Telugu dates back to 1000BC.
Approximately 74 million people speak Telugu as their native language. It is the third most widely spoken
language in India. It is a language primary spoken in south India. It belongs to Dravidian group of language.
Including non native speakers there are nearly 90 million people speaking Telugu in Andhra Pradesh alone. Also
Telugu speakers available in USA, UK, Malaysia, Fiji Islands, Mauritius and South Africa. Nature of Telugu
language is Syllabic as similar to the languages of India. Each symbol in Telugu script represent a complete
syllable. There is a very little scope for confusion and speaking problems contest in Telugu unthinkable, since
everyone will score 100%. In that sense it is a WYSIWYG script. This form of script is considered to be most
scientific by the linguists. This syllabic script has been achieved by the use of a set of basic symbols, a set of
modifier symbols and rules for modification. Officially there are eighteen Vowels, thirty six consonants and
three dual symbols. Only thirteen vowels, thirty five consonants are in common usage. Telugu script has the
capacity to represent almost the entire phonetic spectrum of all Indian (and most world) languages. Telugu
stands as one of the best script in the world while maintaining an extensive sound base. The Telugu basic
symbol pronunciation table along with the Romanization of Telugu sounds i.e RIT form of Telugu
pronunciation using Telugu Lipi standards is shown in Table 1. Telugu Vuccharana pattika(Pronunciation
Table)
III. Effect of speaker variant speech:
The performance of an ASR system tested by different speakers’ variant tends to markedly degrade as
gender and non-native speakers make pronunciation variants as compared to native speakers. In this section the
effect of non-native and gender speech on the performance of an ASR system constructed from native speech is
discussed. In particular, Telugu is selected the target language for this paper and non-native speaker is from
Hindi. To complete this work, a baseline Telugu ASR system is first constructed with CMU phonemes mapped
with the Telugu Phonemes are constructed by considering all 51 akshara’s of basic Telugu glyphs, and its
performance is then evaluated using the Telugu words uttered by the Hindi speakers and Telugu speakers with
different region people.
Baseline Telugu ASR
A set of Telugu words are considered(Nagamani, 2010) Telugu wordlist of 665 words is used as the
training set for the native-Telugu ASR system. Here after these Telugu word list called UoH word list0. UWL0
is a 665 words of 10 speakers data with regional and nativity variant speech covering syllables of Telugu
language word list.
Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech
www.iosrjournals.org 9 | Page
Phonetically rich text corpora
To cover all the symbols or akshara are considered for define the phone list for the Telugu Language.
Total 51 akshara are considered hence base form it required 51 symbols of alphabets to represent these phones.
First step all the CMU based phones are tried to represent the 51 aksharas. The selected symbols are the derived
phone list from the CMU phone list. Human languages are mapped to the Computer language by means of
ASCII code. CMU phone list represent 39 phones which are not sufficient and also they are build based on
American English pronunciation.
Phonetically rich data can be selected from a medium corpus of text. A set of such words were derived
by processing Telugu on-line tutors available in internet and also from primary school syllables of Telugu
Language subject to build corpus[3-1 ]. This process involved translating the Telugu word corpus printed in
English(RIT) form to generating set of words that are phonetically rich(i.e., the words contains most phonemes
of the language). Here we considered on text based words around 600 which covered most syllables of
morphemes and also speaker variability purpose male and female variation study.
Analysis of the pronunciation variability
Pronunciation lexicon for Telugu Speech Recognition system using UOH defined Phonelist. Refine the
pronunciation lexicon by adding new pronunciation for the same utterance based on acoustic signal, add or
deleted phonemes in the phonelist, refine the acoustic signal or wave file for single speaker.
Vowels:
Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech
www.iosrjournals.org 10 | Page
Consonents:
Table of Confusion pair words in error analysis:
IV. Result analysis
The ASR system recognition is performed by taking the Vowel sounds, consonant sounds and Isolated
words of different size of letters present in it. Each experiment is carried out by correcting the Substitutional
errors. Substitution errors are caused by Insertion and deletion errors will be corrected by adjusting the phone
set and acoustic signal and finally reached to 100% word accuracy in individual and combination of Vowel and
consonant test data. The confusion matrix for vowel and consonants shown in Figure 4.1 and Figure 4.2 will
give the substitution of phones list in lower and upper triangles shown in the matrix. The diagonal will
represent the recognition accuracy.
1.1.1 Confusion matrix for Vowels
Figure 4.1. confustion matrix drawn for Vowels in 10 experiments
1.1.1.1 Confustion matrix for Consonents
Figure 4.2 Confusion matrix for Telugu Consonents.
References:
[1] “Automatic Speech Understanding” http://ewh.ieee.org/r10/bombay//news6/AutoSpeechRecog/ASR.htm
[2] http://www.research.ibm.com/thinkresearch/pages/2002/20020918_speech.shtml
[2] http://cslu.cse.ogi.edu/asr/
[3] http://www-4.ibm.com/software/speech/
[4] A. Gallardo-Antolín*, J. Ferreiros, J. Macías-Guarasa, R. de Córdoba and J. M. Pardo “Incorporating multiple-hmm acoustic
modeling in a Modular large vocabulary speech recognition system in Telephone environment” ICSIP2000.
[5] Yoo Rhee Oh, Jae Sam Yoon, Hong Kook Kim “Acoustic model adaptation based on pronunciation variability analysis for non-
native speech recognition” Speech Communication, Volume 49, Issue 1, January 2007, Pages 59-70

More Related Content

What's hot

2014-Article 7 ISC
2014-Article 7 ISC2014-Article 7 ISC
2014-Article 7 ISCNatashaPDA
 
2014-Article 10 ISC
2014-Article 10 ISC2014-Article 10 ISC
2014-Article 10 ISCNatashaPDA
 
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDE
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDECODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDE
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDEJoy Avelino
 
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...SubmissionResearchpa
 
Speech Ability of Grade 10 Students
Speech Ability of Grade 10 StudentsSpeech Ability of Grade 10 Students
Speech Ability of Grade 10 Studentsijtsrd
 
ELTLT CONFERENCE
ELTLT CONFERENCEELTLT CONFERENCE
ELTLT CONFERENCERidhaIlma1
 
Applied Linguistics & Language Teaching
               Applied Linguistics & Language Teaching               Applied Linguistics & Language Teaching
Applied Linguistics & Language TeachingFarhad Mohammad
 
communicative languag
communicative languagcommunicative languag
communicative languagkhalafi
 
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...Yasser Al-Shboul
 
The Appropriateness in Advice-Giving From a Cross-Cultural Perspective
The Appropriateness in Advice-Giving From a Cross-Cultural PerspectiveThe Appropriateness in Advice-Giving From a Cross-Cultural Perspective
The Appropriateness in Advice-Giving From a Cross-Cultural PerspectiveYasser Al-Shboul
 
Aptitude As In Individual Difference In Sla 2
Aptitude As In Individual Difference In Sla 2Aptitude As In Individual Difference In Sla 2
Aptitude As In Individual Difference In Sla 2Dr. Cupid Lucid
 
Language needs of computer learners
Language needs of computer learnersLanguage needs of computer learners
Language needs of computer learnersAlexander Decker
 
Presentación2.ppt input and interaction
Presentación2.ppt input and interactionPresentación2.ppt input and interaction
Presentación2.ppt input and interactionJoel Acosta
 
Communicative Language Teaching
Communicative Language TeachingCommunicative Language Teaching
Communicative Language TeachingRajabul Gufron
 

What's hot (20)

2014-Article 7 ISC
2014-Article 7 ISC2014-Article 7 ISC
2014-Article 7 ISC
 
2014-Article 10 ISC
2014-Article 10 ISC2014-Article 10 ISC
2014-Article 10 ISC
 
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDE
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDECODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDE
CODE SWITCHING IN EFL CLASSROOM: TEACHER'S ATTITUDE
 
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...
Innovation Of Arranged Input In Foreign Language Acquisition At Indonesian Pe...
 
Investigating the Integration of Culture Teaching in Foreign Language Classro...
Investigating the Integration of Culture Teaching in Foreign Language Classro...Investigating the Integration of Culture Teaching in Foreign Language Classro...
Investigating the Integration of Culture Teaching in Foreign Language Classro...
 
Speech Ability of Grade 10 Students
Speech Ability of Grade 10 StudentsSpeech Ability of Grade 10 Students
Speech Ability of Grade 10 Students
 
ELTLT CONFERENCE
ELTLT CONFERENCEELTLT CONFERENCE
ELTLT CONFERENCE
 
Aptitude
AptitudeAptitude
Aptitude
 
Investigating the Integration of Culture Teaching in Foreign Language Classro...
Investigating the Integration of Culture Teaching in Foreign Language Classro...Investigating the Integration of Culture Teaching in Foreign Language Classro...
Investigating the Integration of Culture Teaching in Foreign Language Classro...
 
Applied Linguistics & Language Teaching
               Applied Linguistics & Language Teaching               Applied Linguistics & Language Teaching
Applied Linguistics & Language Teaching
 
communicative languag
communicative languagcommunicative languag
communicative languag
 
Embarking The Six Thinking Hats in EFL Students’ Dissertation Writing at Said...
Embarking The Six Thinking Hats in EFL Students’ Dissertation Writing at Said...Embarking The Six Thinking Hats in EFL Students’ Dissertation Writing at Said...
Embarking The Six Thinking Hats in EFL Students’ Dissertation Writing at Said...
 
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...
GENDER DIFFERENCES IN THE APPROPRIATENESS OF ADVICE-GIVING AMONG IRANIAN EFL ...
 
PSYCHOLOGY AND LANGUAGE LEARNING
PSYCHOLOGY AND LANGUAGE LEARNINGPSYCHOLOGY AND LANGUAGE LEARNING
PSYCHOLOGY AND LANGUAGE LEARNING
 
The Appropriateness in Advice-Giving From a Cross-Cultural Perspective
The Appropriateness in Advice-Giving From a Cross-Cultural PerspectiveThe Appropriateness in Advice-Giving From a Cross-Cultural Perspective
The Appropriateness in Advice-Giving From a Cross-Cultural Perspective
 
Alavi glossary m
Alavi glossary mAlavi glossary m
Alavi glossary m
 
Aptitude As In Individual Difference In Sla 2
Aptitude As In Individual Difference In Sla 2Aptitude As In Individual Difference In Sla 2
Aptitude As In Individual Difference In Sla 2
 
Language needs of computer learners
Language needs of computer learnersLanguage needs of computer learners
Language needs of computer learners
 
Presentación2.ppt input and interaction
Presentación2.ppt input and interactionPresentación2.ppt input and interaction
Presentación2.ppt input and interaction
 
Communicative Language Teaching
Communicative Language TeachingCommunicative Language Teaching
Communicative Language Teaching
 

Viewers also liked

Enhanced Clustering Algorithm for Processing Online Data
Enhanced Clustering Algorithm for Processing Online DataEnhanced Clustering Algorithm for Processing Online Data
Enhanced Clustering Algorithm for Processing Online DataIOSR Journals
 
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...IOSR Journals
 
The prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeThe prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeIOSR Journals
 
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...IOSR Journals
 
Test preparation for the fifth grade
Test preparation for the fifth grade Test preparation for the fifth grade
Test preparation for the fifth grade Aleksandra Nikolic
 
Using Social Media to Promote your Toastmasters Club
Using Social Media to Promote your Toastmasters ClubUsing Social Media to Promote your Toastmasters Club
Using Social Media to Promote your Toastmasters ClubStar Bradshaw, UXC
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformIOSR Journals
 

Viewers also liked (20)

H0944649
H0944649H0944649
H0944649
 
Enhanced Clustering Algorithm for Processing Online Data
Enhanced Clustering Algorithm for Processing Online DataEnhanced Clustering Algorithm for Processing Online Data
Enhanced Clustering Algorithm for Processing Online Data
 
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
 
The prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP typeThe prediction of moisture through the use of neural networks MLP type
The prediction of moisture through the use of neural networks MLP type
 
Manual maestro construcor
Manual maestro construcorManual maestro construcor
Manual maestro construcor
 
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...
Tools for Ontology Building from Texts: Analysis and Improvement of the Resul...
 
Test preparation for the fifth grade
Test preparation for the fifth grade Test preparation for the fifth grade
Test preparation for the fifth grade
 
A0150106
A0150106A0150106
A0150106
 
I0935053
I0935053I0935053
I0935053
 
B0150711
B0150711B0150711
B0150711
 
A0610105
A0610105A0610105
A0610105
 
H0154448
H0154448H0154448
H0154448
 
Bahasa indonesia
Bahasa indonesiaBahasa indonesia
Bahasa indonesia
 
Using Social Media to Promote your Toastmasters Club
Using Social Media to Promote your Toastmasters ClubUsing Social Media to Promote your Toastmasters Club
Using Social Media to Promote your Toastmasters Club
 
Thanhham
ThanhhamThanhham
Thanhham
 
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR TransformText Extraction of Colour Images using Mathematical Morphology & HAAR Transform
Text Extraction of Colour Images using Mathematical Morphology & HAAR Transform
 
G0364045
G0364045G0364045
G0364045
 
R094108112
R094108112R094108112
R094108112
 
C0951520
C0951520C0951520
C0951520
 
Q6
Q6Q6
Q6
 

Similar to Improving word accuracy of Telugu ASR with substitution error analysis

Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogrikevig
 
Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...ijnlc
 
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...kevig
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageijnlc
 
Automatic Speech Recognition of Malayalam Language Nasal Class Phonemes
Automatic Speech Recognition of Malayalam Language Nasal Class PhonemesAutomatic Speech Recognition of Malayalam Language Nasal Class Phonemes
Automatic Speech Recognition of Malayalam Language Nasal Class PhonemesEditor IJCATR
 
An Unsupervised Approach to Develop Stemmer
An Unsupervised Approach to Develop StemmerAn Unsupervised Approach to Develop Stemmer
An Unsupervised Approach to Develop Stemmerkevig
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466IJRAT
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Kannada Phonemes to Speech Dictionary: Statistical Approach
Kannada Phonemes to Speech Dictionary: Statistical ApproachKannada Phonemes to Speech Dictionary: Statistical Approach
Kannada Phonemes to Speech Dictionary: Statistical ApproachIJERA Editor
 
Phonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaPhonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaIJERA Editor
 
B047006011
B047006011B047006011
B047006011inventy
 
B047006011
B047006011B047006011
B047006011inventy
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerijnlc
 
Marathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language ProcessingMarathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language Processingiosrjce
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...sipij
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesDr. Amit Kumar Jha
 

Similar to Improving word accuracy of Telugu ASR with substitution error analysis (20)

B0340710
B0340710B0340710
B0340710
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...
 
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...
TRANSLITERATION BY ORTHOGRAPHY OR PHONOLOGY FOR HINDI AND MARATHI TO ENGLISH:...
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
Automatic Speech Recognition of Malayalam Language Nasal Class Phonemes
Automatic Speech Recognition of Malayalam Language Nasal Class PhonemesAutomatic Speech Recognition of Malayalam Language Nasal Class Phonemes
Automatic Speech Recognition of Malayalam Language Nasal Class Phonemes
 
An Unsupervised Approach to Develop Stemmer
An Unsupervised Approach to Develop StemmerAn Unsupervised Approach to Develop Stemmer
An Unsupervised Approach to Develop Stemmer
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Kannada Phonemes to Speech Dictionary: Statistical Approach
Kannada Phonemes to Speech Dictionary: Statistical ApproachKannada Phonemes to Speech Dictionary: Statistical Approach
Kannada Phonemes to Speech Dictionary: Statistical Approach
 
Phonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: KannadaPhonetic Dictionary for Natural Language Processing: Kannada
Phonetic Dictionary for Natural Language Processing: Kannada
 
B047006011
B047006011B047006011
B047006011
 
B047006011
B047006011B047006011
B047006011
 
ReseachPaper
ReseachPaperReseachPaper
ReseachPaper
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzer
 
Marathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language ProcessingMarathi Text-To-Speech Synthesis using Natural Language Processing
Marathi Text-To-Speech Synthesis using Natural Language Processing
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
 
551 466-472
551 466-472551 466-472
551 466-472
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languages
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 

Recently uploaded (20)

Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 

Improving word accuracy of Telugu ASR with substitution error analysis

  • 1. IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 3, Issue 4 (July-Aug. 2012), PP 07-10 www.iosrjournals.org www.iosrjournals.org 7 | Page Substitution Error Analysis for Improving the Word Accuracy in Telugu Language Automatic Speech Recognition System M. Nagamani, P. N. Girija University of Hyderabad, Hyderabad. Abstract: Use of natural languages for the computer communication is one of the current research topics. The speech recognition plays a central role in communicating the computer by means of speech. Speech Recognition is the process of converting analog signal into the symbolic gesture form known as text. An Automatic Speech Recognition (ASR) is the process of converting input speech signal given to the system, into text. This input signal is any human spoken word. Though the last Four decades, research is going on bringing the system perception near to the human being, in recognition word accuracy. Many domains play a role in degrading the of system performance in which language and its pronunciation variants caused by different reasons like accent, gender, dialects of the speech are in general factors. In specific, system environment, articulatory phonetics, acoustic phonetics, (acoustic system), lexical model (pronunciation dictionary) and language model including the mood of the speaker. Acoustic modeling can be done in most of the cases by using signal processing, where as lexicon model require a sufficient human intervention as, it is based on the language and human perception. Once sufficient primary data is built then automatic processing can be done using different modeling techniques to derive more data for proper training of the speech recognition system. Here the linguistics are play a major role in building the robust system. In lexical model design language plays a major role. Each language has its own rhythm in speech and language aspects. Based on language rhythm worldly languages are classified as stress timed and syllable timed rhythm. Most of ASR systems use the lexical model that is built for stressed timed languages. All Indian languages are syllable timed rhythmic languages one such is Telugu Language. In this paper analysis of the decoding results of ASR system using two different lexical model environments. One is CMU lexicon which is based on stress timed language as the tool is used American accent English phonemes and another UOH lexicon which is handcraft lexicon for Telugu language which is also a syllable timed language.. Further studied are the gender and accents (pronunciation variant factors) effecting the Substitutional errors in ASR system. The confusion matrix for vowel and consonants alone analyzed for both cases and also for isolated word recognition where the confusion matrix gives the most common phonemes substituted. In all the cases the UOH lexicon based ASR system gives the improvement of word accuracy around 20 to 30%. Speech is a process used to communicate from a speaker to listener. Pronunciation relates to speech, and humans have an intuitive feel for pronunciation. For instance, people chuckle when words are mispronounced and notice when foreign accent colors a speaker’s pronunciations.[1]. If the words were always pronounced in the same way, ASR would be relatively easy. However, for various reasons words are almost always pronounced differently and varied from one speaker to another and from once situation to another. The variability is due to co-articulation, reasonal accents, speaking rate, speaking style etc. I. Language need in ASR study: There are 6912 languages are there the purpose of communication between human being around the world. The need of the language is to computerization of many human need domains, Ubiquitous information access, phone-based information access, mobile devices which demand speech as modality to interact, globalization in cross-cultural human-human interaction, multilingual communities, Humanitarian needs like disaster, healthcare, military applications to communicate with local people and main focus on Human machine interaction where people expect speech-driven applications in their mother tongue will specifically demand the speech recognition research to work in regional language . such one application here we like to develop a speech recognition system working in Telugu Language. II. Telugu Language and ASR According to the 1997 senses total 69,600,000 population in India who are speaking Telugu. Population tatal all countries around 69,758, 890 are the Telugu language speaking people. The language coverage regions are Andhra Pradesh and neighboring states. Also in Bahrain, Canada, Fiji, Malaysia(Peninsular), Mauritius, Singapore, South Africa, united Arab Emirates and United States. This Telugu language also called with alternate names as Andhra, Gentoo, Tailangi, Telangire, Telegu, Telgi, Tengu,
  • 2. Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech www.iosrjournals.org 8 | Page Terangi, Tolangan. The dialects of this language are Berad, Dasari, Dommara, Golari, Kamathi, Komtao, Konda-Reddi, Salewari, Telangana, Telugu, Vadaga, Srikakula, Vishakhapatnam, East Godaveri, Rayalseema, Nellore, Guntur, Vadari, Yanadi. This also classified as Dravidian, South-central and Telugu. There are 5,000,000 second language speakers. Fully developed Bible from 1854-2002. That is importance of Telugu language. Telugu is one of the oldest language in the world. The old form of Telugu dates back to 1000BC. Approximately 74 million people speak Telugu as their native language. It is the third most widely spoken language in India. It is a language primary spoken in south India. It belongs to Dravidian group of language. Including non native speakers there are nearly 90 million people speaking Telugu in Andhra Pradesh alone. Also Telugu speakers available in USA, UK, Malaysia, Fiji Islands, Mauritius and South Africa. Nature of Telugu language is Syllabic as similar to the languages of India. Each symbol in Telugu script represent a complete syllable. There is a very little scope for confusion and speaking problems contest in Telugu unthinkable, since everyone will score 100%. In that sense it is a WYSIWYG script. This form of script is considered to be most scientific by the linguists. This syllabic script has been achieved by the use of a set of basic symbols, a set of modifier symbols and rules for modification. Officially there are eighteen Vowels, thirty six consonants and three dual symbols. Only thirteen vowels, thirty five consonants are in common usage. Telugu script has the capacity to represent almost the entire phonetic spectrum of all Indian (and most world) languages. Telugu stands as one of the best script in the world while maintaining an extensive sound base. The Telugu basic symbol pronunciation table along with the Romanization of Telugu sounds i.e RIT form of Telugu pronunciation using Telugu Lipi standards is shown in Table 1. Telugu Vuccharana pattika(Pronunciation Table) III. Effect of speaker variant speech: The performance of an ASR system tested by different speakers’ variant tends to markedly degrade as gender and non-native speakers make pronunciation variants as compared to native speakers. In this section the effect of non-native and gender speech on the performance of an ASR system constructed from native speech is discussed. In particular, Telugu is selected the target language for this paper and non-native speaker is from Hindi. To complete this work, a baseline Telugu ASR system is first constructed with CMU phonemes mapped with the Telugu Phonemes are constructed by considering all 51 akshara’s of basic Telugu glyphs, and its performance is then evaluated using the Telugu words uttered by the Hindi speakers and Telugu speakers with different region people. Baseline Telugu ASR A set of Telugu words are considered(Nagamani, 2010) Telugu wordlist of 665 words is used as the training set for the native-Telugu ASR system. Here after these Telugu word list called UoH word list0. UWL0 is a 665 words of 10 speakers data with regional and nativity variant speech covering syllables of Telugu language word list.
  • 3. Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech www.iosrjournals.org 9 | Page Phonetically rich text corpora To cover all the symbols or akshara are considered for define the phone list for the Telugu Language. Total 51 akshara are considered hence base form it required 51 symbols of alphabets to represent these phones. First step all the CMU based phones are tried to represent the 51 aksharas. The selected symbols are the derived phone list from the CMU phone list. Human languages are mapped to the Computer language by means of ASCII code. CMU phone list represent 39 phones which are not sufficient and also they are build based on American English pronunciation. Phonetically rich data can be selected from a medium corpus of text. A set of such words were derived by processing Telugu on-line tutors available in internet and also from primary school syllables of Telugu Language subject to build corpus[3-1 ]. This process involved translating the Telugu word corpus printed in English(RIT) form to generating set of words that are phonetically rich(i.e., the words contains most phonemes of the language). Here we considered on text based words around 600 which covered most syllables of morphemes and also speaker variability purpose male and female variation study. Analysis of the pronunciation variability Pronunciation lexicon for Telugu Speech Recognition system using UOH defined Phonelist. Refine the pronunciation lexicon by adding new pronunciation for the same utterance based on acoustic signal, add or deleted phonemes in the phonelist, refine the acoustic signal or wave file for single speaker. Vowels:
  • 4. Substitution error analysis for improving the word Accuracy in Telugu Language Automatic Speech www.iosrjournals.org 10 | Page Consonents: Table of Confusion pair words in error analysis: IV. Result analysis The ASR system recognition is performed by taking the Vowel sounds, consonant sounds and Isolated words of different size of letters present in it. Each experiment is carried out by correcting the Substitutional errors. Substitution errors are caused by Insertion and deletion errors will be corrected by adjusting the phone set and acoustic signal and finally reached to 100% word accuracy in individual and combination of Vowel and consonant test data. The confusion matrix for vowel and consonants shown in Figure 4.1 and Figure 4.2 will give the substitution of phones list in lower and upper triangles shown in the matrix. The diagonal will represent the recognition accuracy. 1.1.1 Confusion matrix for Vowels Figure 4.1. confustion matrix drawn for Vowels in 10 experiments 1.1.1.1 Confustion matrix for Consonents Figure 4.2 Confusion matrix for Telugu Consonents. References: [1] “Automatic Speech Understanding” http://ewh.ieee.org/r10/bombay//news6/AutoSpeechRecog/ASR.htm [2] http://www.research.ibm.com/thinkresearch/pages/2002/20020918_speech.shtml [2] http://cslu.cse.ogi.edu/asr/ [3] http://www-4.ibm.com/software/speech/ [4] A. Gallardo-Antolín*, J. Ferreiros, J. Macías-Guarasa, R. de Córdoba and J. M. Pardo “Incorporating multiple-hmm acoustic modeling in a Modular large vocabulary speech recognition system in Telephone environment” ICSIP2000. [5] Yoo Rhee Oh, Jae Sam Yoon, Hong Kook Kim “Acoustic model adaptation based on pronunciation variability analysis for non- native speech recognition” Speech Communication, Volume 49, Issue 1, January 2007, Pages 59-70