SlideShare une entreprise Scribd logo
1  sur  34
SPEAKER RECOGNITION 
SYSTEMS 
BY 
NAMRATHA D’CRUZ
Sub areas of speaker recognition 
• Speaker verification system 
• Speaker identification system
Speaker recognition problem 
Signal 
processor 
Comparison 
distance 
measurement 
Decision 
logic 
Reference 
patterns 
s(n) x D 
Pattern Distance identification 
vector 
General representation of the speaker recognition problem
 A representation of the speech signal is obtained 
using digital speech processing techniques 
 which preserve the features of the speech signal 
that are relevant to speaker identity. 
 The resulting pattern is compared to previously 
prepared reference patterns. 
 Decision logic is used to make a choice among 
available alternatives
 For speaker verification system if we denote the PDF 
for the measurement vector x for the ith speaker as pi(x) 
then the decision rule is given by 
 Where ci is a constant for the ith speaker and pav(x) is 
the average PDF for the measurement vector x 
 For speaker identification system the decision rule is 
given by
Speaker verification system 
Computer verification of speakers 
Block diagram of a speaker verification system
 Online digital speaker verification system was 
developed by Rosenberg and others. 
 The person wishing to be verified first enters his 
claimed identity. 
 On request from verification system utters his 
verification phrase, and requests some transaction to 
be made in the event he is verified. 
 The spoken utterance is processed to obtain a 
pattern which is compared to the stored reference 
patterns for the claimed identity.
 On the basis of the transaction requested the error mix 
constant (Ci) is determined . 
 Based on error mix constant decision to accept or reject 
is made.
Accept 
Reject 
Signal processing aspects of the speaker verification system
Signal Processing Parts Of The Speaker 
Verification System 
End point detection system: the sample 
utterances which occurs somewhere within a pre 
selected time interval is located. 
Pitch detector : it is used to measure the pitch 
contour of the utterance. 
Energy measurements: short-time energy 
measurements is made to give energy contours.
Signal Processing Parts Of The Speaker 
Verification System 
LPC analysis: is used to give predictor parameter 
contours. 
 LPC is a tool used for representing the spectral 
envelope of a digital signal of speech 
in compressed form, using the information of a linear 
predictive model. 
 Autocorrelation formulation method is used. 
Formant analysis: estimates of the formant 
locations is made. 
LPF: 16hz low pass is used
Measurement contours for the test utterance “we 
were away a year ago” 
 Data are estimated at 100 times per second 
 Smoothened by 16hz LPF, linear phase, FIR 
digital filter.
Pitch period and intensity contours of an utterance used in speaker 
verification
Plot of first 3 formants ,pitch and intensity for a speaker 
verification utterance
Plots of the first 8 LPC coefficients for a speaker verification 
utterance
 After the desired parametric representation has been 
computed it is compared with the corresponding 
reference patterns for the speaker whose identity is 
claimed. 
 Speaker is generally not able to speak at precisely the 
same rate for different repetitions of the verification 
phase. 
 As a solution to this problem non linear time warping of 
the input patterns is done to obtain the best possible 
registration between stored pattern and the measured 
patterns for speakers sample utterance.
Time warping 
 The time scale t of a reference utterance is warped so 
that significant events in some measurement contour a(t) 
line up with the same significant events in the reference 
contour r(t). 
 The warping function is assumed to be 
τ=α t+q(t) 
Where 
q(t) - is the non linear time warp function 
α – average slope of the time warp function
Time warping 
 Boundary condition s are imposed to ensure that the 
beginning and ending points of both the sample and 
reference utterances line up properly. 
 The boundary conditions are: 
τ1=α t1+q(t1) 
τ2=α t2+q(t2) 
 Function q(t) and constant α have to be chosen so as to 
best align the measured contours. 
 Simpler and faster solution is to utilize the method of 
dynamic programming to optimally choose a constrained 
warping function.
Illustration of time warping
Time warping 
 Consider time warping for a pair of contours which are 
sampled at a discrete set of points . 
 Let the points be in the measured contour be labeled 
n=1,2,…,N. 
 Let the points in the reference contour be labeled 
m=1,2,…,M. 
 Time warping function w is chosen as 
m=w(n)
Time warping 
 The boundary on w(n) conditions are: 
w(1) = 1 beginning points 
w(N) = M ending points 
 To limit the degree of non linearity of the warping 
function mild continuity condition is imposed 
 That the warping function w cannot change by more 
than 2 grid points at any index n 
w(n+1)-w(n) = 0,1,2 if w(n) != w(n-1) 
= 1,2 if w(n) = w(n-1) 
 Thus slope of warping function is either 0,1 or 2
Time warping 
 To determine which of the conditions of equation to use 
at grid index n requires the use of similarity measure 
between the reference data measured at grid index n and 
the test data measured at grid index m. 
 The similarity measure is used to determine the path of 
the warping function which minimizes the max total 
distance ,subject to constraints of continuity equation.
An example of a typical time warping
Time warping 
 Figure shows the possible grid coordinates (n,m) and a 
warping function w(n). 
 Consider N = 20 reference and M = 15 test utterance. 
 Because of continuity constraints the warping function 
must lie within the parallelogram. 
 The final step is to compute overall distance measures 
and then compare the distance to an appropriately 
chosen threshold. 
 The simplest distance contour measure is a normalized 
sum of squares .
Distance measure 
 For the jth measurement contour ,the distance dj would 
be of the form 
 Where ajs (i) is the value of the jth measurement contour 
at time i 
 ajr (i) ) is the value of the jth reference contour at time 
i, and σaj(i) is the standard deviation of the jth 
measurement at time i
Distance measure 
 The distance function is given by 
 Where wj is the jth weight chosen on the basis of the 
effectiveness of the jth measurement in verifying the 
speaker.
SPEAKER IDENTIFICATION 
SYSTEMS 
 Almost similar to the speaker verification systems 
 Main difference is choice of parameters to make 
distance measurements. 
 N distance measurements have to be made rather than 1. 
 Final decision is to choose the speaker whose reference 
patterns are closest in distance to the sample patterns.
SPEAKER IDENTIFICATION 
SYSTEMS 
 More sophisticated and robust distance measure is used. 
 Let x be an L- dimensional column vector representing 
input pattern , in which the kth component of x is the kth 
measurement. 
 It is assumed that joint PDF of the measurements for the 
ith speaker is a multi dimensional Gaussian distribution 
with mean mi and covariance matrix wi. Thus ,the L-dimensional 
Gaussian density function for x is given by
SPEAKER IDENTIFICATION SYSTEMS 
 Where is the inverse of the matrix (assuming is 
non singular),| | is the determinant of , and the t 
denotes the transpose of a vector. The decision rule 
which minimizes the probability of error states that the 
measurement vector X should be assigned to class i if 
 Where pi is the priori probability that belongs to the ith 
class. Since ln y is a monotonically increasing function 
of its argument y, the decision rule can be simplified as 
Decide class i if
SPEAKER IDENTIFICATION SYSTEMS 
 The bias term does not provide any advantage over the 
decision rule . Thus the distance measure is defined as 
 The mean and covariance vector is defined as
Examples of some measured parameters
Speaker identification accuracy
Speaker identification accuracy(using 
cepstrum parameters)
Speaker recognition systems

Contenu connexe

Tendances

COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisRushin Shah
 
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...gt_ebuddy
 
Automatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAutomatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAbdullah al Mamun
 
Speech based password authentication system on FPGA
Speech based password authentication system on FPGASpeech based password authentication system on FPGA
Speech based password authentication system on FPGARajesh Roshan
 
1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSINGmukesh bhardwaj
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONniranjan kumar
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognitionphyuhsan
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signalVinodhini
 
LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech RecognitionDr. Uday Saikia
 
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnnAutomatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnnijcsa
 
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...CSCJournals
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 

Tendances (18)

SPEAKER VERIFICATION
SPEAKER VERIFICATIONSPEAKER VERIFICATION
SPEAKER VERIFICATION
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...
Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recogn...
 
Automatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAutomatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approach
 
Speech based password authentication system on FPGA
Speech based password authentication system on FPGASpeech based password authentication system on FPGA
Speech based password authentication system on FPGA
 
1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signal
 
LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech Recognition
 
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnnAutomatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
 
Subband Coding
Subband CodingSubband Coding
Subband Coding
 
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...
A New Method for Pitch Tracking and Voicing Decision Based on Spectral Multi-...
 
Final ppt
Final pptFinal ppt
Final ppt
 
V041203124126
V041203124126V041203124126
V041203124126
 
Matlab: Speech Signal Analysis
Matlab: Speech Signal AnalysisMatlab: Speech Signal Analysis
Matlab: Speech Signal Analysis
 
A017410108
A017410108A017410108
A017410108
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 

En vedette

homeworkpaperhouse
homeworkpaperhousehomeworkpaperhouse
homeworkpaperhouseFatih Taylı
 
Top 10 Most Popular Torrent Sites of 2015
Top 10 Most Popular Torrent Sites of 2015Top 10 Most Popular Torrent Sites of 2015
Top 10 Most Popular Torrent Sites of 2015succinctattorne69
 
Проект "Образование"
Проект "Образование"Проект "Образование"
Проект "Образование"Роман Ра
 
Evaluation of my music magazine
Evaluation of my music magazineEvaluation of my music magazine
Evaluation of my music magazineemaanadnan
 
Wind Energy in Transportation
Wind Energy in TransportationWind Energy in Transportation
Wind Energy in TransportationFatih Taylı
 
01.voici l'écrivain 01
01.voici l'écrivain 0101.voici l'écrivain 01
01.voici l'écrivain 01swatijulie84
 
Conteht apageedf
Conteht apageedfConteht apageedf
Conteht apageedfemaanadnan
 
Daniel's presentation from AIU I-9
Daniel's presentation from AIU I-9Daniel's presentation from AIU I-9
Daniel's presentation from AIU I-9Fatih Taylı
 
Evaluation of my music magazine
Evaluation of my music magazineEvaluation of my music magazine
Evaluation of my music magazineemaanadnan
 
Practica blog
Practica blogPractica blog
Practica blogfcortijoh
 
შეფასების გზამკვლევი
შეფასების გზამკვლევიშეფასების გზამკვლევი
შეფასების გზამკვლევიMaia Metreveli
 
ხეო, უფლისაო...
ხეო, უფლისაო...ხეო, უფლისაო...
ხეო, უფლისაო...Maia Metreveli
 

En vedette (20)

AB'16:26
AB'16:26AB'16:26
AB'16:26
 
Figen
FigenFigen
Figen
 
Final project
Final projectFinal project
Final project
 
homeworkpaperhouse
homeworkpaperhousehomeworkpaperhouse
homeworkpaperhouse
 
Top 10 Most Popular Torrent Sites of 2015
Top 10 Most Popular Torrent Sites of 2015Top 10 Most Popular Torrent Sites of 2015
Top 10 Most Popular Torrent Sites of 2015
 
Practica blog
Practica blogPractica blog
Practica blog
 
Проект "Образование"
Проект "Образование"Проект "Образование"
Проект "Образование"
 
Evaluation of my music magazine
Evaluation of my music magazineEvaluation of my music magazine
Evaluation of my music magazine
 
Wind Energy in Transportation
Wind Energy in TransportationWind Energy in Transportation
Wind Energy in Transportation
 
Double spread
Double spreadDouble spread
Double spread
 
Banking (1)
Banking (1)Banking (1)
Banking (1)
 
01.voici l'écrivain 01
01.voici l'écrivain 0101.voici l'écrivain 01
01.voici l'écrivain 01
 
Conteht apageedf
Conteht apageedfConteht apageedf
Conteht apageedf
 
Maga
MagaMaga
Maga
 
Daniel's presentation from AIU I-9
Daniel's presentation from AIU I-9Daniel's presentation from AIU I-9
Daniel's presentation from AIU I-9
 
Mayúsculas pdf
Mayúsculas pdfMayúsculas pdf
Mayúsculas pdf
 
Evaluation of my music magazine
Evaluation of my music magazineEvaluation of my music magazine
Evaluation of my music magazine
 
Practica blog
Practica blogPractica blog
Practica blog
 
შეფასების გზამკვლევი
შეფასების გზამკვლევიშეფასების გზამკვლევი
შეფასების გზამკვლევი
 
ხეო, უფლისაო...
ხეო, უფლისაო...ხეო, უფლისაო...
ხეო, უფლისაო...
 

Similaire à Speaker recognition systems

Pulse code modulation and Quantization
Pulse code modulation and QuantizationPulse code modulation and Quantization
Pulse code modulation and QuantizationMuhamamd Awaissaleem
 
Optimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelOptimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelAWANISHKUMAR84
 
Method for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionMethod for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionCSCJournals
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Tanjarul Islam Mishu
 
Distributed_Array_Algos.pptx
Distributed_Array_Algos.pptxDistributed_Array_Algos.pptx
Distributed_Array_Algos.pptxf20170649g
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Wiener Filter Hardware Realization
Wiener Filter Hardware RealizationWiener Filter Hardware Realization
Wiener Filter Hardware RealizationSayan Chaudhuri
 
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...Polytechnique Montreal
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Venkata Sudhir Vedurla
 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAngie Miller
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationIvan Kitov
 
Channel and clipping level estimation for ofdm in io t –based networks a review
Channel and clipping level estimation for ofdm in io t –based networks a reviewChannel and clipping level estimation for ofdm in io t –based networks a review
Channel and clipping level estimation for ofdm in io t –based networks a reviewIJARIIT
 

Similaire à Speaker recognition systems (20)

dsp
dspdsp
dsp
 
Digital communication unit II
Digital communication unit IIDigital communication unit II
Digital communication unit II
 
Pulse code modulation and Quantization
Pulse code modulation and QuantizationPulse code modulation and Quantization
Pulse code modulation and Quantization
 
Optimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelOptimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN Channel
 
Method for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionMethod for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF Injection
 
Adaptive Filtering.ppt
Adaptive Filtering.pptAdaptive Filtering.ppt
Adaptive Filtering.ppt
 
Where Next
Where NextWhere Next
Where Next
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
 
Digitization
DigitizationDigitization
Digitization
 
Distributed_Array_Algos.pptx
Distributed_Array_Algos.pptxDistributed_Array_Algos.pptx
Distributed_Array_Algos.pptx
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Unit-1.pptx
Unit-1.pptxUnit-1.pptx
Unit-1.pptx
 
Wiener Filter Hardware Realization
Wiener Filter Hardware RealizationWiener Filter Hardware Realization
Wiener Filter Hardware Realization
 
Technical details
Technical detailsTechnical details
Technical details
 
Channel equalization
Channel equalizationChannel equalization
Channel equalization
 
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...
Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectru...
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer Design
 
Investigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlationInvestigation of repeated blasts at Aitik mine using waveform cross correlation
Investigation of repeated blasts at Aitik mine using waveform cross correlation
 
Channel and clipping level estimation for ofdm in io t –based networks a review
Channel and clipping level estimation for ofdm in io t –based networks a reviewChannel and clipping level estimation for ofdm in io t –based networks a review
Channel and clipping level estimation for ofdm in io t –based networks a review
 

Dernier

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilVinayVitekari
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxmaisarahman1
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxMuhammadAsimMuhammad6
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEselvakumar948
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 

Dernier (20)

Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 

Speaker recognition systems

  • 1. SPEAKER RECOGNITION SYSTEMS BY NAMRATHA D’CRUZ
  • 2. Sub areas of speaker recognition • Speaker verification system • Speaker identification system
  • 3. Speaker recognition problem Signal processor Comparison distance measurement Decision logic Reference patterns s(n) x D Pattern Distance identification vector General representation of the speaker recognition problem
  • 4.  A representation of the speech signal is obtained using digital speech processing techniques  which preserve the features of the speech signal that are relevant to speaker identity.  The resulting pattern is compared to previously prepared reference patterns.  Decision logic is used to make a choice among available alternatives
  • 5.  For speaker verification system if we denote the PDF for the measurement vector x for the ith speaker as pi(x) then the decision rule is given by  Where ci is a constant for the ith speaker and pav(x) is the average PDF for the measurement vector x  For speaker identification system the decision rule is given by
  • 6. Speaker verification system Computer verification of speakers Block diagram of a speaker verification system
  • 7.  Online digital speaker verification system was developed by Rosenberg and others.  The person wishing to be verified first enters his claimed identity.  On request from verification system utters his verification phrase, and requests some transaction to be made in the event he is verified.  The spoken utterance is processed to obtain a pattern which is compared to the stored reference patterns for the claimed identity.
  • 8.  On the basis of the transaction requested the error mix constant (Ci) is determined .  Based on error mix constant decision to accept or reject is made.
  • 9. Accept Reject Signal processing aspects of the speaker verification system
  • 10. Signal Processing Parts Of The Speaker Verification System End point detection system: the sample utterances which occurs somewhere within a pre selected time interval is located. Pitch detector : it is used to measure the pitch contour of the utterance. Energy measurements: short-time energy measurements is made to give energy contours.
  • 11. Signal Processing Parts Of The Speaker Verification System LPC analysis: is used to give predictor parameter contours.  LPC is a tool used for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model.  Autocorrelation formulation method is used. Formant analysis: estimates of the formant locations is made. LPF: 16hz low pass is used
  • 12. Measurement contours for the test utterance “we were away a year ago”  Data are estimated at 100 times per second  Smoothened by 16hz LPF, linear phase, FIR digital filter.
  • 13. Pitch period and intensity contours of an utterance used in speaker verification
  • 14. Plot of first 3 formants ,pitch and intensity for a speaker verification utterance
  • 15. Plots of the first 8 LPC coefficients for a speaker verification utterance
  • 16.  After the desired parametric representation has been computed it is compared with the corresponding reference patterns for the speaker whose identity is claimed.  Speaker is generally not able to speak at precisely the same rate for different repetitions of the verification phase.  As a solution to this problem non linear time warping of the input patterns is done to obtain the best possible registration between stored pattern and the measured patterns for speakers sample utterance.
  • 17. Time warping  The time scale t of a reference utterance is warped so that significant events in some measurement contour a(t) line up with the same significant events in the reference contour r(t).  The warping function is assumed to be τ=α t+q(t) Where q(t) - is the non linear time warp function α – average slope of the time warp function
  • 18. Time warping  Boundary condition s are imposed to ensure that the beginning and ending points of both the sample and reference utterances line up properly.  The boundary conditions are: τ1=α t1+q(t1) τ2=α t2+q(t2)  Function q(t) and constant α have to be chosen so as to best align the measured contours.  Simpler and faster solution is to utilize the method of dynamic programming to optimally choose a constrained warping function.
  • 20. Time warping  Consider time warping for a pair of contours which are sampled at a discrete set of points .  Let the points be in the measured contour be labeled n=1,2,…,N.  Let the points in the reference contour be labeled m=1,2,…,M.  Time warping function w is chosen as m=w(n)
  • 21. Time warping  The boundary on w(n) conditions are: w(1) = 1 beginning points w(N) = M ending points  To limit the degree of non linearity of the warping function mild continuity condition is imposed  That the warping function w cannot change by more than 2 grid points at any index n w(n+1)-w(n) = 0,1,2 if w(n) != w(n-1) = 1,2 if w(n) = w(n-1)  Thus slope of warping function is either 0,1 or 2
  • 22. Time warping  To determine which of the conditions of equation to use at grid index n requires the use of similarity measure between the reference data measured at grid index n and the test data measured at grid index m.  The similarity measure is used to determine the path of the warping function which minimizes the max total distance ,subject to constraints of continuity equation.
  • 23. An example of a typical time warping
  • 24. Time warping  Figure shows the possible grid coordinates (n,m) and a warping function w(n).  Consider N = 20 reference and M = 15 test utterance.  Because of continuity constraints the warping function must lie within the parallelogram.  The final step is to compute overall distance measures and then compare the distance to an appropriately chosen threshold.  The simplest distance contour measure is a normalized sum of squares .
  • 25. Distance measure  For the jth measurement contour ,the distance dj would be of the form  Where ajs (i) is the value of the jth measurement contour at time i  ajr (i) ) is the value of the jth reference contour at time i, and σaj(i) is the standard deviation of the jth measurement at time i
  • 26. Distance measure  The distance function is given by  Where wj is the jth weight chosen on the basis of the effectiveness of the jth measurement in verifying the speaker.
  • 27. SPEAKER IDENTIFICATION SYSTEMS  Almost similar to the speaker verification systems  Main difference is choice of parameters to make distance measurements.  N distance measurements have to be made rather than 1.  Final decision is to choose the speaker whose reference patterns are closest in distance to the sample patterns.
  • 28. SPEAKER IDENTIFICATION SYSTEMS  More sophisticated and robust distance measure is used.  Let x be an L- dimensional column vector representing input pattern , in which the kth component of x is the kth measurement.  It is assumed that joint PDF of the measurements for the ith speaker is a multi dimensional Gaussian distribution with mean mi and covariance matrix wi. Thus ,the L-dimensional Gaussian density function for x is given by
  • 29. SPEAKER IDENTIFICATION SYSTEMS  Where is the inverse of the matrix (assuming is non singular),| | is the determinant of , and the t denotes the transpose of a vector. The decision rule which minimizes the probability of error states that the measurement vector X should be assigned to class i if  Where pi is the priori probability that belongs to the ith class. Since ln y is a monotonically increasing function of its argument y, the decision rule can be simplified as Decide class i if
  • 30. SPEAKER IDENTIFICATION SYSTEMS  The bias term does not provide any advantage over the decision rule . Thus the distance measure is defined as  The mean and covariance vector is defined as
  • 31. Examples of some measured parameters