SlideShare une entreprise Scribd logo
1  sur  18
LINEAR PREDICTIVE CODING
-Compiled by
-Shruti Dasgupta
Introduction
 Linear Predictive Coding (LPC) is one of the most

powerful speech analysis techniques, and one of the
most useful methods for encoding good quality
speech at a low bit rate. It provides extremely
accurate estimates of speech parameters, and is
relatively efficient for computation.
 The most important aspect of LPC is the linear
predictive filter which allows the value of the next
sample to be determined by a linear combination of
previous samples.
 This provides a rate of 64000 bits/second. Linear
predictive coding reduces this to 2400 bits/second. At
this reduced rate the speech has a distinctive
synthetic sound and there is a noticeable loss of
quality. However, the speech is still audible and it can
still be easily understood. Since there is information
loss in linear predictive coding, it is a lossy form of
compression.
 LPC starts with the assumption that the speech signal is

produced by a buzzer at the end of a tube. The glottis (the space
between the vocal cords) produces the buzz, which is
characterized by its intensity (loudness) and frequency (pitch).
The vocal tract (the throat and mouth) forms the tube, which is
characterized by its resonances, which are called formants.
 LPC analyzes the speech signal by estimating the
formants, removing their effects from the speech signal, and
estimating the intensity and frequency of the remaining buzz.
The process of removing the formants is called inverse
filtering, and the remaining signal is called the residue.
 The numbers which describe the formants and the residue can
be stored or transmitted somewhere else. LPC synthesizes the
speech signal by reversing the process: use the residue to
create a source signal, use the formants to create a filter (which
represents the tube), and run the source through the
filter, resulting in speech.
 Because speech signals vary with time, this process is done on
short chunks of the speech signal, which are called frames.
Usually 30 to 50 frames per second give intelligible speech with
good compression.
LPC: Vocoder
 It has two key components: analysis or encoding and

synthesis or decoding. The analysis part of LPC involves
examining the speech signal and breaking it down into
segments or blocks. Each segment is than examined
further to find the answers to several key questions:
• Is the segment voiced or unvoiced?
• What is the pitch of the segment?
• What parameters are needed to build a filter that models
the vocal tract for the current segment?
 LPC analysis is usually conducted by a sender who
answers these questions and usually transmits these
answers onto a receiver.
 The receiver performs LPC synthesis by using the answers
received to build a filter that when provided the correct
input source will be able to accurately reproduce the
original speech signal. Essentially, LPC synthesis tries to
imitate human speech production.
LPC : Vocoder Figure
Human vs LPC Speech
Production
LPC-10
 According to government standard 1014, also

known as LPC-10, the input signal is sampled at
a rate of 8000 samples per second.
 This input signal is then broken up into segments
or blocks which are each analysed and
transmitted to the receiver.
 The 8000 samples in each second of speech
signal are broken into 180 sample segments. This
means that each segment represents 22.5
milliseconds of the input speech signal.
LPC : Excitation
 Before a speech segment is determined as being voiced or









unvoiced it is first passed through a low-pass filter with a
bandwidth of 1 kHz.
Determining if a segment is voiced or unvoiced is important
because voiced sounds have a different waveform then unvoiced
sounds. The differences in the two waveforms creates a need for
the use of two different input signals for the LPC filter in the
synthesis or decoding.
One input signal is for voiced sounds and the other is for
unvoiced. The LPC encoder notifies the decoder if a signal
segment is voiced or unvoiced by sending a single bit.
There are two steps in the process of determining if a speech
segment is voiced or unvoiced. The first step is to look at the
amplitude of the signal, also known as the energy in the
segment. If the amplitude levels are large then the segment is
classified as voiced and if they are small then the segment is
considered unvoiced. This
determination requires a
preconceived notion about the range of amplitude values and
energy levels associated with the two types of sound.
Voiced speech segments have large amplitudes, unvoiced
Spectrum of Voiced and Unvoiced
Sounds

Voiced Sound
‘E’

Unvoiced Sound ‘S’
 Determining if a segment is a voiced or unvoiced sound is not all









of the information that is needed by the LPC decoder to
accurately reproduce a speech signal. In order to produce an
input signal for the LPC filter the decoder also needs another
attribute of the current speech segment known as the pitch
period.
The period for any wave, including speech signals, can be
defined as the time required for one wave cycle to completely
pass a fixed position. For speech signals, the pitch period can be
thought of as the period of the vocal cord vibration that occurs
during the production of voiced speech.
Therefore, the pitch period is only needed for the decoding of
voiced segments and is not required for unvoiced segments
since they are produced by turbulent air flow not vocal cord
vibrations.
It is very computationally intensive to determine the pitch period
for a given segment of speech. There are several different types
of algorithms that could be used.
LPC-10 does not use an algorithm with autocorrelation, instead it
uses an algorithm called average magnitude difference function
 An advantage of the AMDF function is that it can be used to

determine if a sample is voiced or unvoiced. When the AMDF
function is applied to an unvoiced signal, the difference
between the minimum and the average values is very small
compared to voiced signals.
 For unvoiced segments the AMDF function we also have a
minimum when P equals the pitch period however, any
additional minimums that are obtained will be very close to the
average value. This means that these minimums will not be
very deep

AMDF for ‘E’

AMDF for ‘S’
LPC: Articulation
 The filter that is used by the decoder to recreate the original input signal

is created based on a set of coefficients. These coefficients are
extracted from the original signal during encoding and are transmitted to
the receiver for use in decoding. Each speech segment has different
filter coefficients or parameters that it uses to recreate the original
sound. Not only are the parameters themselves different from segment
to segment, but the number of parameters differ from voiced to unvoiced
segment. Voiced segments use 10 parameters to build the filter while
unvoiced sounds use only 4 parameters.
 A filter with n parameters is referred to as an nth order filter. In order to
find the filter coefficients that best match the current segment being
analysed the encoder attempts to minimize the mean squared error. The
mean squared error is expressed as:

 where {yn} is the set of speech samples for the current segment and {ai}

is the set of coefficients. In order to provide the most accurate
coefficients, {ai} is chosen to minimize the average value of en² for all
samples in the segment.
 In order to solve for the filter coefficients E[yn-iyn-j] has to be estimate.
 In an uncompressed form, speech is usually transmitted at

64,000 bits/second using 8 bits/sample and a rate of 8 kHz for
sampling. LPC reduces this rate to 2,400 bits/second by
breaking the speech into segments and then sending the
voiced/unvoiced information, the pitch period, and the
coefficients for the filter that represents the vocal tract for each
segment.
 The input signal used by the filter on the receiver end is
determined by the classification of the speech segment as
voiced or unvoiced and by the pitch period of the segment. The
encoder sends a single bit to tell if the current segment is voiced
or unvoiced. The pitch period is quantized using a logcompanded quantizer to one of 60 possible values. 6 bits are
required to represent the pitch period.
 If the segment contains voiced speech than a 10th order filter is
used. This means that 11 values are needed: 10 reflection
coefficients and the gain.
 If the segment contains unvoiced speech than a 4th order filter is
used. This means that 5 values are needed: 4 reflection
coefficients and the gain. The reflection coefficients are denote
kn where 1 # n # 10 for voiced speech filters and 1 # n # 4 for
The 54-bit Frame












1 bit voiced/unvoiced
6 bits pitch period (60 values)
10 bits k1 and k2 (5 each)
10 bits k3 and k4 (5 each)
16 bits k5, k6, k7, k8 (4 each)
3 bits k9
2 bits k10
5 bits gain G
1 bit synchronization
54 bits TOTAL BITS PER FRAME
Where k1…..k10 are the 10 reflection coefficients
of the filter
Verification of Bit rate for LPC
 Sample rate = 8000 samples/second
 Samples per segment = 180 samples/segment

 Segment rate = Sample Rate/ Samples per








Segment
= (8000 samples/second)/(180 samples/second)
= 44.444444.... segments/second
Segment size = 54 bits/segment
Bit rate = Segment size * Segment rate
= (54 bits/segment) * (44.44 segments/second)
= 2400 bits/second
LPC Synthesis/Decoding
 The process of decoding a sequence of speech segments is the









reverse of the encoding process. Each segment is decoded
individually and the sequence of reproduced sound segments is
joined together to represent the entire input speech signal. The
decoding or synthesis of a speech segment is based on the 54
bits of information that are transmitted from the encoder.
The speech signal is declared voiced or unvoiced based on the
voiced/unvoiced determination bit. The decoder needs to know
what type of signal the segment contains in order to determine
what type of excitement signal will be given to the LPC filter.
For voiced segments a pulse is used as the excitement signal.
This pulse consists of 40 samples and is locally stored by the
decoder.
For unvoiced segments white noise produced by a
pseudorandom number generator is used as the input for the
filter.
The pitch period for voiced segments is then used to determine
whether the 40 sample pulse needs to be truncated or extended.
If the pulse needs to be extended it is padded with zeros since
the definition of a pulse said that it travels through an
 Each segment of speech has a different LPC filter that is

eventually produced using the reflection coefficients and the
gain that are received from the encoder.
 10 reflection coefficients are used for voiced segment filters
and 4 reflection coefficients are used for unvoiced segments.
These reflection coefficients are used to generate the vocal
tract coefficients or parameters which are used to create the
filter.
 The final step of decoding a segment of speech is to pass the
excitement signal through the filter to produce the synthesized
speech signal.

LPC Decoder
Thank You

Contenu connexe

Tendances

Adjacent channel interference in wireless .
Adjacent channel interference in wireless .Adjacent channel interference in wireless .
Adjacent channel interference in wireless .Mahmood Showrav
 
Path Loss and Shadowing
Path Loss and ShadowingPath Loss and Shadowing
Path Loss and ShadowingYash Gupta
 
Sonet (synchronous optical networking )
Sonet (synchronous optical networking )Sonet (synchronous optical networking )
Sonet (synchronous optical networking )Hamza Sajjad
 
Eye diagram in Communication
Eye diagram in CommunicationEye diagram in Communication
Eye diagram in CommunicationSivanesh M
 
Power delay profile,delay spread and doppler spread
Power delay profile,delay spread and doppler spreadPower delay profile,delay spread and doppler spread
Power delay profile,delay spread and doppler spreadManish Srivastava
 
Ec 2401 wireless communication unit 4
Ec 2401 wireless communication   unit 4Ec 2401 wireless communication   unit 4
Ec 2401 wireless communication unit 4JAIGANESH SEKAR
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signalVinodhini
 
Diversity Techniques in Wireless Communication
Diversity Techniques in Wireless CommunicationDiversity Techniques in Wireless Communication
Diversity Techniques in Wireless CommunicationSahar Foroughi
 

Tendances (20)

Adaptive filter
Adaptive filterAdaptive filter
Adaptive filter
 
Adjacent channel interference in wireless .
Adjacent channel interference in wireless .Adjacent channel interference in wireless .
Adjacent channel interference in wireless .
 
Equalization
EqualizationEqualization
Equalization
 
Spread spectrum
Spread spectrumSpread spectrum
Spread spectrum
 
Introduction to Adaptive filters
Introduction to Adaptive filtersIntroduction to Adaptive filters
Introduction to Adaptive filters
 
SPEECH CODING
SPEECH CODINGSPEECH CODING
SPEECH CODING
 
Line coding
Line codingLine coding
Line coding
 
Path Loss and Shadowing
Path Loss and ShadowingPath Loss and Shadowing
Path Loss and Shadowing
 
Fading Seminar
Fading SeminarFading Seminar
Fading Seminar
 
Design of Filters PPT
Design of Filters PPTDesign of Filters PPT
Design of Filters PPT
 
Sonet (synchronous optical networking )
Sonet (synchronous optical networking )Sonet (synchronous optical networking )
Sonet (synchronous optical networking )
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalization
 
Chebyshev filter
Chebyshev filterChebyshev filter
Chebyshev filter
 
Eye diagram in Communication
Eye diagram in CommunicationEye diagram in Communication
Eye diagram in Communication
 
Power delay profile,delay spread and doppler spread
Power delay profile,delay spread and doppler spreadPower delay profile,delay spread and doppler spread
Power delay profile,delay spread and doppler spread
 
Ec 2401 wireless communication unit 4
Ec 2401 wireless communication   unit 4Ec 2401 wireless communication   unit 4
Ec 2401 wireless communication unit 4
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signal
 
Diversity Techniques in Wireless Communication
Diversity Techniques in Wireless CommunicationDiversity Techniques in Wireless Communication
Diversity Techniques in Wireless Communication
 
Final ppt
Final pptFinal ppt
Final ppt
 
Multirate DSP
Multirate DSPMultirate DSP
Multirate DSP
 

En vedette

LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech RecognitionDr. Uday Saikia
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentationchakravarthy Gopi
 
Linear prediction
Linear predictionLinear prediction
Linear predictionUma Rajaram
 
Dpcm ( Differential Pulse Code Modulation )
Dpcm ( Differential Pulse Code Modulation )Dpcm ( Differential Pulse Code Modulation )
Dpcm ( Differential Pulse Code Modulation )Mohammed Abdullah
 
Samen geestelijk gezond jan van speybroeck
Samen geestelijk gezond   jan van speybroeckSamen geestelijk gezond   jan van speybroeck
Samen geestelijk gezond jan van speybroeckUPoliteia
 
Samen geestelijk gezond joris housen - samenwerking shm en ggz in netwerk s...
Samen geestelijk gezond   joris housen - samenwerking shm en ggz in netwerk s...Samen geestelijk gezond   joris housen - samenwerking shm en ggz in netwerk s...
Samen geestelijk gezond joris housen - samenwerking shm en ggz in netwerk s...UPoliteia
 
Bart van Steenberghe - Getting things done & Outlook
Bart van Steenberghe - Getting things done & OutlookBart van Steenberghe - Getting things done & Outlook
Bart van Steenberghe - Getting things done & OutlookUPoliteia
 
Businessman with Heart: Visual Resume
Businessman with Heart: Visual ResumeBusinessman with Heart: Visual Resume
Businessman with Heart: Visual ResumeJoshua Green
 
Samen geestelijk gezond katrien schryvers - vermaatschappelijking van zorg
Samen geestelijk gezond   katrien schryvers - vermaatschappelijking van zorgSamen geestelijk gezond   katrien schryvers - vermaatschappelijking van zorg
Samen geestelijk gezond katrien schryvers - vermaatschappelijking van zorgUPoliteia
 
1 paul putteman - algemeen kader
1   paul putteman - algemeen kader1   paul putteman - algemeen kader
1 paul putteman - algemeen kaderUPoliteia
 
Survey magazine
Survey magazineSurvey magazine
Survey magazinelduah
 
Laurelle prelim1
Laurelle prelim1Laurelle prelim1
Laurelle prelim1lduah
 
Question 3
Question 3Question 3
Question 3lduah
 
4 carla verbandt - dreigingsanalyse
4   carla verbandt - dreigingsanalyse4   carla verbandt - dreigingsanalyse
4 carla verbandt - dreigingsanalyseUPoliteia
 
IMGENT - Marc van Daele en Lie Lauwers
IMGENT - Marc van Daele en Lie LauwersIMGENT - Marc van Daele en Lie Lauwers
IMGENT - Marc van Daele en Lie LauwersUPoliteia
 
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...Port of Antwerp
 
Dinobusters - Sssst, hier werkt men boeklancering
Dinobusters -  Sssst, hier werkt men boeklanceringDinobusters -  Sssst, hier werkt men boeklancering
Dinobusters - Sssst, hier werkt men boeklanceringUPoliteia
 
Question 1- Laurelle Duah-Nantwi
Question 1- Laurelle Duah-NantwiQuestion 1- Laurelle Duah-Nantwi
Question 1- Laurelle Duah-Nantwilduah
 
Samen geestelijk gezond marlien de coen - mensen met psychische problemen a...
Samen geestelijk gezond   marlien de coen - mensen met psychische problemen a...Samen geestelijk gezond   marlien de coen - mensen met psychische problemen a...
Samen geestelijk gezond marlien de coen - mensen met psychische problemen a...UPoliteia
 
Samen geestelijk gezond hans meganck - depressief leuven
Samen geestelijk gezond   hans meganck - depressief leuvenSamen geestelijk gezond   hans meganck - depressief leuven
Samen geestelijk gezond hans meganck - depressief leuvenUPoliteia
 

En vedette (20)

LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech Recognition
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentation
 
Linear prediction
Linear predictionLinear prediction
Linear prediction
 
Dpcm ( Differential Pulse Code Modulation )
Dpcm ( Differential Pulse Code Modulation )Dpcm ( Differential Pulse Code Modulation )
Dpcm ( Differential Pulse Code Modulation )
 
Samen geestelijk gezond jan van speybroeck
Samen geestelijk gezond   jan van speybroeckSamen geestelijk gezond   jan van speybroeck
Samen geestelijk gezond jan van speybroeck
 
Samen geestelijk gezond joris housen - samenwerking shm en ggz in netwerk s...
Samen geestelijk gezond   joris housen - samenwerking shm en ggz in netwerk s...Samen geestelijk gezond   joris housen - samenwerking shm en ggz in netwerk s...
Samen geestelijk gezond joris housen - samenwerking shm en ggz in netwerk s...
 
Bart van Steenberghe - Getting things done & Outlook
Bart van Steenberghe - Getting things done & OutlookBart van Steenberghe - Getting things done & Outlook
Bart van Steenberghe - Getting things done & Outlook
 
Businessman with Heart: Visual Resume
Businessman with Heart: Visual ResumeBusinessman with Heart: Visual Resume
Businessman with Heart: Visual Resume
 
Samen geestelijk gezond katrien schryvers - vermaatschappelijking van zorg
Samen geestelijk gezond   katrien schryvers - vermaatschappelijking van zorgSamen geestelijk gezond   katrien schryvers - vermaatschappelijking van zorg
Samen geestelijk gezond katrien schryvers - vermaatschappelijking van zorg
 
1 paul putteman - algemeen kader
1   paul putteman - algemeen kader1   paul putteman - algemeen kader
1 paul putteman - algemeen kader
 
Survey magazine
Survey magazineSurvey magazine
Survey magazine
 
Laurelle prelim1
Laurelle prelim1Laurelle prelim1
Laurelle prelim1
 
Question 3
Question 3Question 3
Question 3
 
4 carla verbandt - dreigingsanalyse
4   carla verbandt - dreigingsanalyse4   carla verbandt - dreigingsanalyse
4 carla verbandt - dreigingsanalyse
 
IMGENT - Marc van Daele en Lie Lauwers
IMGENT - Marc van Daele en Lie LauwersIMGENT - Marc van Daele en Lie Lauwers
IMGENT - Marc van Daele en Lie Lauwers
 
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...
Port of Antwerp services for Brazil - Presentation @ Belgian Day in Minais Ge...
 
Dinobusters - Sssst, hier werkt men boeklancering
Dinobusters -  Sssst, hier werkt men boeklanceringDinobusters -  Sssst, hier werkt men boeklancering
Dinobusters - Sssst, hier werkt men boeklancering
 
Question 1- Laurelle Duah-Nantwi
Question 1- Laurelle Duah-NantwiQuestion 1- Laurelle Duah-Nantwi
Question 1- Laurelle Duah-Nantwi
 
Samen geestelijk gezond marlien de coen - mensen met psychische problemen a...
Samen geestelijk gezond   marlien de coen - mensen met psychische problemen a...Samen geestelijk gezond   marlien de coen - mensen met psychische problemen a...
Samen geestelijk gezond marlien de coen - mensen met psychische problemen a...
 
Samen geestelijk gezond hans meganck - depressief leuven
Samen geestelijk gezond   hans meganck - depressief leuvenSamen geestelijk gezond   hans meganck - depressief leuven
Samen geestelijk gezond hans meganck - depressief leuven
 

Similaire à Linear Predictive Coding

Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing ApplicationsDDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing ApplicationsAmr E. Mohamed
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IAmr E. Mohamed
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
Analysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition TechniquesAnalysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition Techniquesidescitation
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET Journal
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabchakravarthy Gopi
 
NTSC Software Decoding Presentation
NTSC Software Decoding PresentationNTSC Software Decoding Presentation
NTSC Software Decoding PresentationPrateek Dayal
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing documenthimadrigupta
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and CommunicationBenesh Selvanesan
 
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...IRJET Journal
 
Audio compression 1
Audio compression 1Audio compression 1
Audio compression 1Rajat Kumar
 

Similaire à Linear Predictive Coding (20)

G010424248
G010424248G010424248
G010424248
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing ApplicationsDDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
 
Speech encoding techniques
Speech encoding techniquesSpeech encoding techniques
Speech encoding techniques
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
H0814247
H0814247H0814247
H0814247
 
Speech coding std
Speech coding stdSpeech coding std
Speech coding std
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
Analysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition TechniquesAnalysis of PEAQ Model using Wavelet Decomposition Techniques
Analysis of PEAQ Model using Wavelet Decomposition Techniques
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Data Compression
Data CompressionData Compression
Data Compression
 
Lpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlabLpc vocoder implemented by using matlab
Lpc vocoder implemented by using matlab
 
NTSC Software Decoding Presentation
NTSC Software Decoding PresentationNTSC Software Decoding Presentation
NTSC Software Decoding Presentation
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing document
 
Multimedia Compression and Communication
Multimedia Compression and CommunicationMultimedia Compression and Communication
Multimedia Compression and Communication
 
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
 
Audio compression 1
Audio compression 1Audio compression 1
Audio compression 1
 

Plus de Shruti Bhatnagar Dasgupta (12)

Linear equations in Two variables
Linear equations in Two variablesLinear equations in Two variables
Linear equations in Two variables
 
Forces and its types
Forces and its typesForces and its types
Forces and its types
 
Plants and their classification
Plants and their classificationPlants and their classification
Plants and their classification
 
HCF and LCM
HCF and LCMHCF and LCM
HCF and LCM
 
Noise cancellation and supression
Noise cancellation and supressionNoise cancellation and supression
Noise cancellation and supression
 
Smart phone
Smart phoneSmart phone
Smart phone
 
PLC LADDER DIAGRAM
PLC LADDER DIAGRAMPLC LADDER DIAGRAM
PLC LADDER DIAGRAM
 
Circuit theory basics
Circuit theory basicsCircuit theory basics
Circuit theory basics
 
Programmable logic controller
Programmable logic controllerProgrammable logic controller
Programmable logic controller
 
Green radio (final)
Green radio (final)Green radio (final)
Green radio (final)
 
Knowledge management (2)
Knowledge management (2)Knowledge management (2)
Knowledge management (2)
 
4 g technology
4 g technology4 g technology
4 g technology
 

Dernier

High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMSHigh Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMSsandhya757531
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Sumanth A
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solidnamansinghjarodiya
 
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHTEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHSneha Padhiar
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating SystemRashmi Bhat
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfDrew Moseley
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
Robotics Group 10 (Control Schemes) cse.pdf
Robotics Group 10  (Control Schemes) cse.pdfRobotics Group 10  (Control Schemes) cse.pdf
Robotics Group 10 (Control Schemes) cse.pdfsahilsajad201
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsNovel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsResearcher Researcher
 
signals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsignals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsapna80328
 
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENT
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENTFUNCTIONAL AND NON FUNCTIONAL REQUIREMENT
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENTSneha Padhiar
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxStephen Sitton
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communicationpanditadesh123
 
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书rnrncn29
 
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfCS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfBalamuruganV28
 
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSneha Padhiar
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewsandhya757531
 

Dernier (20)

High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMSHigh Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solid
 
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHTEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating System
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdf
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
Robotics Group 10 (Control Schemes) cse.pdf
Robotics Group 10  (Control Schemes) cse.pdfRobotics Group 10  (Control Schemes) cse.pdf
Robotics Group 10 (Control Schemes) cse.pdf
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsNovel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending Actuators
 
signals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsignals in triangulation .. ...Surveying
signals in triangulation .. ...Surveying
 
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENT
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENTFUNCTIONAL AND NON FUNCTIONAL REQUIREMENT
FUNCTIONAL AND NON FUNCTIONAL REQUIREMENT
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptx
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communication
 
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
 
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfCS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdf
 
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overview
 

Linear Predictive Coding

  • 2. Introduction  Linear Predictive Coding (LPC) is one of the most powerful speech analysis techniques, and one of the most useful methods for encoding good quality speech at a low bit rate. It provides extremely accurate estimates of speech parameters, and is relatively efficient for computation.  The most important aspect of LPC is the linear predictive filter which allows the value of the next sample to be determined by a linear combination of previous samples.  This provides a rate of 64000 bits/second. Linear predictive coding reduces this to 2400 bits/second. At this reduced rate the speech has a distinctive synthetic sound and there is a noticeable loss of quality. However, the speech is still audible and it can still be easily understood. Since there is information loss in linear predictive coding, it is a lossy form of compression.
  • 3.  LPC starts with the assumption that the speech signal is produced by a buzzer at the end of a tube. The glottis (the space between the vocal cords) produces the buzz, which is characterized by its intensity (loudness) and frequency (pitch). The vocal tract (the throat and mouth) forms the tube, which is characterized by its resonances, which are called formants.  LPC analyzes the speech signal by estimating the formants, removing their effects from the speech signal, and estimating the intensity and frequency of the remaining buzz. The process of removing the formants is called inverse filtering, and the remaining signal is called the residue.  The numbers which describe the formants and the residue can be stored or transmitted somewhere else. LPC synthesizes the speech signal by reversing the process: use the residue to create a source signal, use the formants to create a filter (which represents the tube), and run the source through the filter, resulting in speech.  Because speech signals vary with time, this process is done on short chunks of the speech signal, which are called frames. Usually 30 to 50 frames per second give intelligible speech with good compression.
  • 4. LPC: Vocoder  It has two key components: analysis or encoding and synthesis or decoding. The analysis part of LPC involves examining the speech signal and breaking it down into segments or blocks. Each segment is than examined further to find the answers to several key questions: • Is the segment voiced or unvoiced? • What is the pitch of the segment? • What parameters are needed to build a filter that models the vocal tract for the current segment?  LPC analysis is usually conducted by a sender who answers these questions and usually transmits these answers onto a receiver.  The receiver performs LPC synthesis by using the answers received to build a filter that when provided the correct input source will be able to accurately reproduce the original speech signal. Essentially, LPC synthesis tries to imitate human speech production.
  • 5. LPC : Vocoder Figure
  • 6. Human vs LPC Speech Production
  • 7. LPC-10  According to government standard 1014, also known as LPC-10, the input signal is sampled at a rate of 8000 samples per second.  This input signal is then broken up into segments or blocks which are each analysed and transmitted to the receiver.  The 8000 samples in each second of speech signal are broken into 180 sample segments. This means that each segment represents 22.5 milliseconds of the input speech signal.
  • 8. LPC : Excitation  Before a speech segment is determined as being voiced or     unvoiced it is first passed through a low-pass filter with a bandwidth of 1 kHz. Determining if a segment is voiced or unvoiced is important because voiced sounds have a different waveform then unvoiced sounds. The differences in the two waveforms creates a need for the use of two different input signals for the LPC filter in the synthesis or decoding. One input signal is for voiced sounds and the other is for unvoiced. The LPC encoder notifies the decoder if a signal segment is voiced or unvoiced by sending a single bit. There are two steps in the process of determining if a speech segment is voiced or unvoiced. The first step is to look at the amplitude of the signal, also known as the energy in the segment. If the amplitude levels are large then the segment is classified as voiced and if they are small then the segment is considered unvoiced. This determination requires a preconceived notion about the range of amplitude values and energy levels associated with the two types of sound. Voiced speech segments have large amplitudes, unvoiced
  • 9. Spectrum of Voiced and Unvoiced Sounds Voiced Sound ‘E’ Unvoiced Sound ‘S’
  • 10.  Determining if a segment is a voiced or unvoiced sound is not all     of the information that is needed by the LPC decoder to accurately reproduce a speech signal. In order to produce an input signal for the LPC filter the decoder also needs another attribute of the current speech segment known as the pitch period. The period for any wave, including speech signals, can be defined as the time required for one wave cycle to completely pass a fixed position. For speech signals, the pitch period can be thought of as the period of the vocal cord vibration that occurs during the production of voiced speech. Therefore, the pitch period is only needed for the decoding of voiced segments and is not required for unvoiced segments since they are produced by turbulent air flow not vocal cord vibrations. It is very computationally intensive to determine the pitch period for a given segment of speech. There are several different types of algorithms that could be used. LPC-10 does not use an algorithm with autocorrelation, instead it uses an algorithm called average magnitude difference function
  • 11.  An advantage of the AMDF function is that it can be used to determine if a sample is voiced or unvoiced. When the AMDF function is applied to an unvoiced signal, the difference between the minimum and the average values is very small compared to voiced signals.  For unvoiced segments the AMDF function we also have a minimum when P equals the pitch period however, any additional minimums that are obtained will be very close to the average value. This means that these minimums will not be very deep AMDF for ‘E’ AMDF for ‘S’
  • 12. LPC: Articulation  The filter that is used by the decoder to recreate the original input signal is created based on a set of coefficients. These coefficients are extracted from the original signal during encoding and are transmitted to the receiver for use in decoding. Each speech segment has different filter coefficients or parameters that it uses to recreate the original sound. Not only are the parameters themselves different from segment to segment, but the number of parameters differ from voiced to unvoiced segment. Voiced segments use 10 parameters to build the filter while unvoiced sounds use only 4 parameters.  A filter with n parameters is referred to as an nth order filter. In order to find the filter coefficients that best match the current segment being analysed the encoder attempts to minimize the mean squared error. The mean squared error is expressed as:  where {yn} is the set of speech samples for the current segment and {ai} is the set of coefficients. In order to provide the most accurate coefficients, {ai} is chosen to minimize the average value of en² for all samples in the segment.  In order to solve for the filter coefficients E[yn-iyn-j] has to be estimate.
  • 13.  In an uncompressed form, speech is usually transmitted at 64,000 bits/second using 8 bits/sample and a rate of 8 kHz for sampling. LPC reduces this rate to 2,400 bits/second by breaking the speech into segments and then sending the voiced/unvoiced information, the pitch period, and the coefficients for the filter that represents the vocal tract for each segment.  The input signal used by the filter on the receiver end is determined by the classification of the speech segment as voiced or unvoiced and by the pitch period of the segment. The encoder sends a single bit to tell if the current segment is voiced or unvoiced. The pitch period is quantized using a logcompanded quantizer to one of 60 possible values. 6 bits are required to represent the pitch period.  If the segment contains voiced speech than a 10th order filter is used. This means that 11 values are needed: 10 reflection coefficients and the gain.  If the segment contains unvoiced speech than a 4th order filter is used. This means that 5 values are needed: 4 reflection coefficients and the gain. The reflection coefficients are denote kn where 1 # n # 10 for voiced speech filters and 1 # n # 4 for
  • 14. The 54-bit Frame            1 bit voiced/unvoiced 6 bits pitch period (60 values) 10 bits k1 and k2 (5 each) 10 bits k3 and k4 (5 each) 16 bits k5, k6, k7, k8 (4 each) 3 bits k9 2 bits k10 5 bits gain G 1 bit synchronization 54 bits TOTAL BITS PER FRAME Where k1…..k10 are the 10 reflection coefficients of the filter
  • 15. Verification of Bit rate for LPC  Sample rate = 8000 samples/second  Samples per segment = 180 samples/segment  Segment rate = Sample Rate/ Samples per       Segment = (8000 samples/second)/(180 samples/second) = 44.444444.... segments/second Segment size = 54 bits/segment Bit rate = Segment size * Segment rate = (54 bits/segment) * (44.44 segments/second) = 2400 bits/second
  • 16. LPC Synthesis/Decoding  The process of decoding a sequence of speech segments is the     reverse of the encoding process. Each segment is decoded individually and the sequence of reproduced sound segments is joined together to represent the entire input speech signal. The decoding or synthesis of a speech segment is based on the 54 bits of information that are transmitted from the encoder. The speech signal is declared voiced or unvoiced based on the voiced/unvoiced determination bit. The decoder needs to know what type of signal the segment contains in order to determine what type of excitement signal will be given to the LPC filter. For voiced segments a pulse is used as the excitement signal. This pulse consists of 40 samples and is locally stored by the decoder. For unvoiced segments white noise produced by a pseudorandom number generator is used as the input for the filter. The pitch period for voiced segments is then used to determine whether the 40 sample pulse needs to be truncated or extended. If the pulse needs to be extended it is padded with zeros since the definition of a pulse said that it travels through an
  • 17.  Each segment of speech has a different LPC filter that is eventually produced using the reflection coefficients and the gain that are received from the encoder.  10 reflection coefficients are used for voiced segment filters and 4 reflection coefficients are used for unvoiced segments. These reflection coefficients are used to generate the vocal tract coefficients or parameters which are used to create the filter.  The final step of decoding a segment of speech is to pass the excitement signal through the filter to produce the synthesized speech signal. LPC Decoder