SlideShare une entreprise Scribd logo
1  sur  27
PRESENTATION ON
SPEECH RECOGNITION USING
NEURAL NETWORK
Prepared by-
Kamonasish Hore (100103003)
CSE , Dept. of IT,
IST, Gauhati University
Outlines:
 Introduction
 Objective
 Benefits of Speech recognition
 Literature Survey
 Hardware and Software Requirement Specifications
 Proposed Work
 Phases of the Project
 Conclusion
 Future Scope
 Bibliography
Introduction:
What is Speech?
 The faculty or act of expressing or describing thoughts, feelings, or
perceptions by the articulation of words.
 A form of communication in spoken language, made by a speaker before
an audience.
Speech Recognition:
 Speech Recognition (SR) is the ability to translate a dictation or spoken
word to text.
 Also known as “automatic speech recognition” (ASR), “computer
speech recognition”, or “speech to text” (STT)
Where it can be used ?
 Dictation
 System control/navigation
 Commercial/Industrial applications
 Personal Computers
 Health Care
 Telephony - Smart-phones
- Customer Helpline Services
Artificial Neural Networks:
 An artificial neural network is a computer program, which attempt to
emulate the biological functions of the Human brain.
 They are an excellent classification systems, and have been effective
with noisy, patterned, variable data streams containing multiple,
overlapping, interacting and incomplete cues.
 Neural networks do not require the complete specification of a problem,
learning instead through exposure to large amount of example data.
 Neural networks comprise of an input layer, one or more hidden layers,
and one output layer. The way in which the nodes and layers of a
network are organised is called the networks architecture.
Objective:
 The project is started with a sole aim in mind that the
design should be able to recognize the voice of a person
by analyzing the speech signal.
Benefits of Speech Recognition:
There are many pros of speech recognition out which few are listed
below:
 Faster than “hand-writing”.
 Allows for better spelling, whether it be in text or documents.
 Helpful for people with a mental or physical disability.
 Hands-free capability.
Literature Survey:
 In the journal [1] entitled "Speech Recognition in the Electronic Health Record"
whose authors are Sherry Doggett, Julie A. Dooling (RHIA), Susan Lucci (RHIT,
CHPS, CMT, AHDI-F ) have done work on ‘Speech Recognition in the Electronic
Health Record (EHR)’ using Front-end speech recognition (FESR) and back-end
speech recognition (BESR) technologies help in the production of legible and
comprehensive document(s). It also serves as a productivity tool to help lower costs
and increase productivity, especially when compared to the manual labor required by
traditional dictation and transcription in the field of healthcare.
 In the paper [2] entitled “ Literature Review on Automatic Speech Recognition”
whose authors are Wiqas Ghai, Khalsa College (ASR) of Technology & Business
Studies, Mohali, Punjab and Navdeep Singh, Mata Gujri College, Fatehgarh Sahib,
Punjab have done work in the field of ‘Automatic Speech Recognition(ASR)’ for
developing an effective ASR for different languages and to show technological
perspective of ASR in different countries They have used artificial neural networks
(ANNs), mathematical models of the low-level circuits in the human brain, to improve
speech-recognition performance, through a model known as the ANN-Hidden Markov
Model (ANN-HMM) which have shown improvements in large-vocabulary speech
recognition systems.
Hardware and Software
Requirement specifications:
 Microphone.
 Software for converting the voice signal into .wav file format.
 Laptop/Desktop
 Matlab (R2011a and above)
Proposed Work : System Block Diagram
 Phase 1: Creating a specific data set (0 - 9).
 Phase 2: Recording the data.
 Phase 3: Feature Extraction.
 Phase 4: Analyzing and evaluating to a specific value/
Training.
 Phase 5: Evaluating the system against the real time data/
Testing.
 Phase 6: Analyzing the identification results.
Phases of the Project:
Phase 1: Creating a specific data set (0 - 9).
 A data set is build containing the numerical words (0-9) of English
language. We had selected to use the numerical words as it would save
our time and it would help us to make extreme feature extraction of
speaker’s voice.
Data set for numeric words
Zero (0)
One (1)
Two (2)
Three (3)
Four (4)
Five (5)
Six (6)
Seven (7)
Eight (8)
Nine (9)
Phase 2: Recording the data.
 Now, the next part of the project, the recording part, which requires a
headphone connected to our working system (laptop) where our
proposed system is being implemented and tested.
 We had selected the Numerical words set for this purpose and recorded
the voice using the in-built Matlab function “Wave record”, which records
sound using PC-based audio input device.
 The recording for each sample is done for 2 sec. each with standard
sampling rate of 44100 Hz which is also approved by International
Phonetic Association as Standard sample rate for Speech processing.
Phase 3: Feature Extraction.
 We had given the numerical set to around 50 speech (30 male and 20
female) to speak out their voice at different expressions and environment.
 After recording we select all the voices and convert the vector into matrix
form.
 As given as vector input, it creates a matrix one column at a time also
‘vec2mat’ places extra entries in the output matrix if necessary.
Phase 4: Analyzing and evaluating to a specific value/
Training.
 This phase contributes to the decision logic part of our project.
Phase 5: Evaluating the system against the real time data/
Testing.
 This phase is the last and designated indispensible part of our project.
 In the training phase we had trained the neuron with 500 different
samples of speech containing from both male and female in different
environments and expressions.
 During testing phase our program was tested with 10 male and 10
female different samples and found the samples to be recognized, and
also resulted that with the increase of number of samples accuracy rate of
recognition increases.
 The testing environment should be soundproof so that we shall get the
result with more accuracy.
Phase 6: Analyzing the identification results.
 In this phase the result of different voice sample is discussed.
 After taking the voice sample i.e. numeric (0-9) from both male and
female, we calculate the recognition percentage of the samples in
different sets of samples as taking 10,20,30,40 and 50 voices per numeric
word in analytical part of our program.
Phase 6: Analyzing the identification results.
 Table for 50 voice samples:
For 50 voice samples Recognition Rate (%) Un-Recognized Rate (%)
Zero(0) 82 18
One(1) 88 12
Two(2) 80 20
Three(3) 88 12
Four(4) 82 18
Five(5) 80.8 19.2
Six(6) 82.6 17.4
Seven(7) 83 17
Eight(8) 86 14
Nine(9) 86 14
 Recognition graph for 50 voice samples.
 Variation graph between No. of Voices (0-9) Vs Recognition Rate.
 Calculating Recognition Rate:
 Recognition rate = Voice (0+1+2+3+4+5+6+7+8+9) X 100%
10
 Table of final Recognition Rate achieved by different voice samples.
No. of voice samples Recognition Rate
10 75.1
20 75.35
30 77.15
40 80.08
50 83.84
 The final out-put of the project i.e. speech recognition is
shown below:- (Program runned in Matlab)
Conclusion:
 The main objective of this project was to identify speech of a person using
neural network.
 In order to meet this objective we had taken the numerical digits from
Zero to Nine (0-9) as the data set to be fed to the system.
 The system is tested against the voice signal of around 50 persons and
system gives approximately 82% accuracy.
 Lastly on completion, we can conclude our project with words that with
the increase of number of samples, the recognition rate increases
gradually.
Future Scope:
 Accuracy will become better and better.
 Dictation speech recognition will gradually become accepted.
 Small hand-held writing tablets for computer speech recognition dictation
and data entry will be developed, as faster processors and more memory
become available.
 Microphone and sound systems will be designed to adapt more quickly to
changing background noise levels, different environments, with better
recognition of extraneous material to be discarded.
Bibliography:
 [1] Russell, Ingrid. "Neural Networks Module". Retrieved 2012
 [2] "Speech Recognition in the Electronic Health Record (Updated)." Journal of
AHIMA 84, no.9 (Sept 2013).
 [3] International Journal of Computer Applications (0975 – 8887), Volume 41– No.8,
March 2012.
 [4] International Journal of Engineering Trends and Technology-
Volume4Issue2- 2013.
 [5] International Journal of Computer Applications (0975 – 8887), Volume 41– No.8,
March 2012.
THANK YOU

Contenu connexe

Tendances

A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technologySrijanKumar18
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overviewVarun Jain
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationTriloki Gupta
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power pointMadhuri Yellapu
 
Speech emotion recognition
Speech emotion recognitionSpeech emotion recognition
Speech emotion recognitionsaniya shaikh
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Chiranjeevi Adi
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processingazhagujaisudhan
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxNsaroj kumar
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt finalLohit Dalal
 
Natural language processing
Natural language processingNatural language processing
Natural language processingYogendra Tamang
 

Tendances (20)

Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech processing
Speech processingSpeech processing
Speech processing
 
Sign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols ClassificationSign Language Recognition based on Hands symbols Classification
Sign Language Recognition based on Hands symbols Classification
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power point
 
Speech emotion recognition
Speech emotion recognitionSpeech emotion recognition
Speech emotion recognition
 
Hand Gesture Recognition
Hand Gesture RecognitionHand Gesture Recognition
Hand Gesture Recognition
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processing
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptx
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt final
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 

Similaire à SPEECH RECOGNITION USING NEURAL NETWORK

NLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMNLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMvivatechijri
 
Speech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningSpeech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningIRJET Journal
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptxJhalakDashora
 
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORKijitcs
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template MatchingIJORCS
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 
A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningEmily Smith
 
Emotion Recognition through Speech Analysis using various Deep Learning Algor...
Emotion Recognition through Speech Analysis using various Deep Learning Algor...Emotion Recognition through Speech Analysis using various Deep Learning Algor...
Emotion Recognition through Speech Analysis using various Deep Learning Algor...IRJET Journal
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...csandit
 
Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...csandit
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...cscpconf
 
On the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognitionOn the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognitionjournalBEEI
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionIRJET Journal
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Waqas Tariq
 

Similaire à SPEECH RECOGNITION USING NEURAL NETWORK (20)

NLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMNLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEM
 
Speech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine LearningSpeech Emotion Recognition Using Machine Learning
Speech Emotion Recognition Using Machine Learning
 
50120140502007
5012014050200750120140502007
50120140502007
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template Matching
 
K010416167
K010416167K010416167
K010416167
 
10
1010
10
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 
A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine Learning
 
Emotion Recognition through Speech Analysis using various Deep Learning Algor...
Emotion Recognition through Speech Analysis using various Deep Learning Algor...Emotion Recognition through Speech Analysis using various Deep Learning Algor...
Emotion Recognition through Speech Analysis using various Deep Learning Algor...
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
 
Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...Combined feature extraction techniques and naive bayes classifier for speech ...
Combined feature extraction techniques and naive bayes classifier for speech ...
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
 
On the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognitionOn the use of voice activity detection in speech emotion recognition
On the use of voice activity detection in speech emotion recognition
 
SRIKANTH RAMASWAMY POSTER
SRIKANTH RAMASWAMY POSTERSRIKANTH RAMASWAMY POSTER
SRIKANTH RAMASWAMY POSTER
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and Validation
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech Recognition
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
 

Dernier

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 

Dernier (20)

The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 

SPEECH RECOGNITION USING NEURAL NETWORK

  • 1. PRESENTATION ON SPEECH RECOGNITION USING NEURAL NETWORK Prepared by- Kamonasish Hore (100103003) CSE , Dept. of IT, IST, Gauhati University
  • 2. Outlines:  Introduction  Objective  Benefits of Speech recognition  Literature Survey  Hardware and Software Requirement Specifications  Proposed Work  Phases of the Project  Conclusion  Future Scope  Bibliography
  • 3. Introduction: What is Speech?  The faculty or act of expressing or describing thoughts, feelings, or perceptions by the articulation of words.  A form of communication in spoken language, made by a speaker before an audience.
  • 4. Speech Recognition:  Speech Recognition (SR) is the ability to translate a dictation or spoken word to text.  Also known as “automatic speech recognition” (ASR), “computer speech recognition”, or “speech to text” (STT)
  • 5. Where it can be used ?  Dictation  System control/navigation  Commercial/Industrial applications  Personal Computers  Health Care  Telephony - Smart-phones - Customer Helpline Services
  • 6. Artificial Neural Networks:  An artificial neural network is a computer program, which attempt to emulate the biological functions of the Human brain.  They are an excellent classification systems, and have been effective with noisy, patterned, variable data streams containing multiple, overlapping, interacting and incomplete cues.  Neural networks do not require the complete specification of a problem, learning instead through exposure to large amount of example data.  Neural networks comprise of an input layer, one or more hidden layers, and one output layer. The way in which the nodes and layers of a network are organised is called the networks architecture.
  • 7. Objective:  The project is started with a sole aim in mind that the design should be able to recognize the voice of a person by analyzing the speech signal.
  • 8. Benefits of Speech Recognition: There are many pros of speech recognition out which few are listed below:  Faster than “hand-writing”.  Allows for better spelling, whether it be in text or documents.  Helpful for people with a mental or physical disability.  Hands-free capability.
  • 9. Literature Survey:  In the journal [1] entitled "Speech Recognition in the Electronic Health Record" whose authors are Sherry Doggett, Julie A. Dooling (RHIA), Susan Lucci (RHIT, CHPS, CMT, AHDI-F ) have done work on ‘Speech Recognition in the Electronic Health Record (EHR)’ using Front-end speech recognition (FESR) and back-end speech recognition (BESR) technologies help in the production of legible and comprehensive document(s). It also serves as a productivity tool to help lower costs and increase productivity, especially when compared to the manual labor required by traditional dictation and transcription in the field of healthcare.  In the paper [2] entitled “ Literature Review on Automatic Speech Recognition” whose authors are Wiqas Ghai, Khalsa College (ASR) of Technology & Business Studies, Mohali, Punjab and Navdeep Singh, Mata Gujri College, Fatehgarh Sahib, Punjab have done work in the field of ‘Automatic Speech Recognition(ASR)’ for developing an effective ASR for different languages and to show technological perspective of ASR in different countries They have used artificial neural networks (ANNs), mathematical models of the low-level circuits in the human brain, to improve speech-recognition performance, through a model known as the ANN-Hidden Markov Model (ANN-HMM) which have shown improvements in large-vocabulary speech recognition systems.
  • 10. Hardware and Software Requirement specifications:  Microphone.  Software for converting the voice signal into .wav file format.  Laptop/Desktop  Matlab (R2011a and above)
  • 11. Proposed Work : System Block Diagram
  • 12.  Phase 1: Creating a specific data set (0 - 9).  Phase 2: Recording the data.  Phase 3: Feature Extraction.  Phase 4: Analyzing and evaluating to a specific value/ Training.  Phase 5: Evaluating the system against the real time data/ Testing.  Phase 6: Analyzing the identification results. Phases of the Project:
  • 13. Phase 1: Creating a specific data set (0 - 9).  A data set is build containing the numerical words (0-9) of English language. We had selected to use the numerical words as it would save our time and it would help us to make extreme feature extraction of speaker’s voice. Data set for numeric words Zero (0) One (1) Two (2) Three (3) Four (4) Five (5) Six (6) Seven (7) Eight (8) Nine (9)
  • 14. Phase 2: Recording the data.  Now, the next part of the project, the recording part, which requires a headphone connected to our working system (laptop) where our proposed system is being implemented and tested.  We had selected the Numerical words set for this purpose and recorded the voice using the in-built Matlab function “Wave record”, which records sound using PC-based audio input device.  The recording for each sample is done for 2 sec. each with standard sampling rate of 44100 Hz which is also approved by International Phonetic Association as Standard sample rate for Speech processing.
  • 15. Phase 3: Feature Extraction.  We had given the numerical set to around 50 speech (30 male and 20 female) to speak out their voice at different expressions and environment.  After recording we select all the voices and convert the vector into matrix form.  As given as vector input, it creates a matrix one column at a time also ‘vec2mat’ places extra entries in the output matrix if necessary.
  • 16. Phase 4: Analyzing and evaluating to a specific value/ Training.  This phase contributes to the decision logic part of our project.
  • 17. Phase 5: Evaluating the system against the real time data/ Testing.  This phase is the last and designated indispensible part of our project.  In the training phase we had trained the neuron with 500 different samples of speech containing from both male and female in different environments and expressions.  During testing phase our program was tested with 10 male and 10 female different samples and found the samples to be recognized, and also resulted that with the increase of number of samples accuracy rate of recognition increases.  The testing environment should be soundproof so that we shall get the result with more accuracy.
  • 18. Phase 6: Analyzing the identification results.  In this phase the result of different voice sample is discussed.  After taking the voice sample i.e. numeric (0-9) from both male and female, we calculate the recognition percentage of the samples in different sets of samples as taking 10,20,30,40 and 50 voices per numeric word in analytical part of our program.
  • 19. Phase 6: Analyzing the identification results.  Table for 50 voice samples: For 50 voice samples Recognition Rate (%) Un-Recognized Rate (%) Zero(0) 82 18 One(1) 88 12 Two(2) 80 20 Three(3) 88 12 Four(4) 82 18 Five(5) 80.8 19.2 Six(6) 82.6 17.4 Seven(7) 83 17 Eight(8) 86 14 Nine(9) 86 14
  • 20.  Recognition graph for 50 voice samples.
  • 21.  Variation graph between No. of Voices (0-9) Vs Recognition Rate.
  • 22.  Calculating Recognition Rate:  Recognition rate = Voice (0+1+2+3+4+5+6+7+8+9) X 100% 10  Table of final Recognition Rate achieved by different voice samples. No. of voice samples Recognition Rate 10 75.1 20 75.35 30 77.15 40 80.08 50 83.84
  • 23.  The final out-put of the project i.e. speech recognition is shown below:- (Program runned in Matlab)
  • 24. Conclusion:  The main objective of this project was to identify speech of a person using neural network.  In order to meet this objective we had taken the numerical digits from Zero to Nine (0-9) as the data set to be fed to the system.  The system is tested against the voice signal of around 50 persons and system gives approximately 82% accuracy.  Lastly on completion, we can conclude our project with words that with the increase of number of samples, the recognition rate increases gradually.
  • 25. Future Scope:  Accuracy will become better and better.  Dictation speech recognition will gradually become accepted.  Small hand-held writing tablets for computer speech recognition dictation and data entry will be developed, as faster processors and more memory become available.  Microphone and sound systems will be designed to adapt more quickly to changing background noise levels, different environments, with better recognition of extraneous material to be discarded.
  • 26. Bibliography:  [1] Russell, Ingrid. "Neural Networks Module". Retrieved 2012  [2] "Speech Recognition in the Electronic Health Record (Updated)." Journal of AHIMA 84, no.9 (Sept 2013).  [3] International Journal of Computer Applications (0975 – 8887), Volume 41– No.8, March 2012.  [4] International Journal of Engineering Trends and Technology- Volume4Issue2- 2013.  [5] International Journal of Computer Applications (0975 – 8887), Volume 41– No.8, March 2012.