SlideShare une entreprise Scribd logo
1  sur  25
Methods and algorithms of speech recognition
                   course

             Nikolay V. Karpov

              nkarpov(а)hse.ru

                 Lection 2
   This lecture is concerned with the spectrogram as a
    representation of how the frequency components within a
    signal vary with time.
   Define what we mean by normalised time and frequency
   Define the short-term discrete Fourier transformaton
   Look at the effect of different window lengths on time and
    frequency resolution
   Derive a quantitative description of the frequency
    resolution of the short term DFT and compare the
    performance of common windows
   Derive a quantitative description of the time resolution of
    the short term DFT
   Explain the uncertainty principle and illustrate its effects
    with some examples
   Review some of the properties of the DFT
Sampling
theorem says
that when the
analog signal
x(t) is band-
restricted
between 0 and
W[Hz] and when
x(t) is sampled
at every T =
1/2W, the
original signal
can be
completely
reproduced by
For those signals the
frequency bandwidths of
which are not known, a
low-pass filter is used to
restrict the bandwidths
before sampling. When a
signal is sampled
contrary to the sampling
theorem, aliasing
distortion occurs, which
distorts the high-
frequency components of
the signal, as shown
Designate
           sin(2 wt )
  g (t )
              2 wt



                    i          i         i       i
  x(t )          x( ) g (t       )    x( )     x( )   x(iT )   x(i )
           i 1     2w         2w        2w       Fs

Simple continuous signal               x(t )
                                   A cos(2 ft )
                  i                        k
 x(t ) A cos(2 f ( ) ) A cos( i ) A cos(2     i )
                  Fs                       N
                                             1    k
Normalised radian frequency 0          2 f( ) 2     (1)                0.5
                                            Fs    N
                             Normalised time     i
With sampled data systems it is customary to change
the time scale and work in units of the sample period.
 In normalised units the sampling frequency (fs) and
  period (T) both equal 1. They can therefore be
  omitted from equations: any such omissions can be
  deduced using dimensional consistency arguments.
 The Nyquist frequency is ½ Hz = πradians/second
 To convert back to real units:
 ◦ any time quantity must be multiplied by the real sample
   period (or divided by the real sample frequency)
 ◦ any frequency or angular frequency quantity must be
   multiplied by the real sample frequency (or divided by the
   real sample period)
   The spectrogram shows the energy in a signal at each
      frequency and at each time. We calculate this by
      evaluating the short-term discrete Fourier transform
   Z- tramsform
                          n
X ( z)         x ( n) z
         n

          1
X ( z)             X ( z ) z n 1dz
         2 j   c
The Fourier transform of x(n) can be computed from the z-
transform as:
                                                           i n
    X( )          X ( z)      z e   i          x ( n )e
                                        n

The Fourier transform may be viewed as the z-transform evaluated
around the unit circle.
The Discrete Fourier Transform (DFT) is defined as a sampled
version of the Fourier shown above:
         N 1                                         N 1
                          i 2 kn / N
X (k )         x ( n )e                     x ( n)         X ( k )e i 2   kn / N

         n 0                                         n 0


The Fast Fourier Transform (FFT) is simply an efficient computation
of the DFT.
   The Discrete Cosine Transform (DCT) is simply a
    DFT of a signal that is assumed to be real and
    even (real and even signals have real spectra!)
                      N 1
    X (k )   x(0) 2         x(n) cos(2 kn / N ); k   0,1,..., N 1
                      n 0

    Note that these are not the only transforms used in
    speech processing
     Wavelets;
     Fractals;
     Wigner distributions;
     etc.
   We often want to estimate the “power
    spectrum” of a non-stationary signal at a
    particular instant of time.
   Multiply by a finite length window and take
    the DFT.
   For a window of length N ending on sample m
    we have:
                  N 1
                                       i 2 k (m n) / N
     X ( k ; m)         w(n) x(m n)e
                  n 0
                                 k-frequency, m-time
2
   X (k , m)       X (k , m) X * (k , m) gives the power at a frequency of
    k/N Hz (normalized) for a window centred at m-0.5(N-1).
   the (m-i) term in the exponent means that the phase origin
    remains consistent by cancelling out the linear phase shift
    introduced by a delay of m samples.
   the window samples are numbered backwards in time (for
    convenience later) hence the summation is performed backwards
    in time.
   The values X(k;m) are based on the N signal values from m–N+1
    to m.
   the frequency resolution is 1/N Hz (normalised units)
   the spectrum is periodic and (since w and x are real) conjugate
    symmetric:
     X (k )     X (N    k)   X *( k)
   there are only 0.5N+1independent frequency values
Sound «is»




Hamming window of length
401 samples centred on 400
Line spectrum from larynx oscillation (at about 0.01
normalised Hz) is superimposed on vocal tract resonances
   Wide window (N=400)
   Independent frequency
    values=N/2+1=201


   Short window eliminates the fine
    detail (N=30)
   N/2+1=16



   Zero padded window gives more
    spectrum points and an illusion of
    more detail (N=480=30×16). This is
    the normal case for a spectrogram
   N/2+1=241
N 1
X ( k ; m)         w(n) x(m n) exp   i 2 k ( m n) / N
             n 0
By setting r= N–1–i we can rewrite this as:
                  2 j (m N 1) N 1
X ( k ; m)   exp(               ) * ym (r ) exp 2 jkr / N
                        N          r 0

                        ym (r ) w( N 1 r ) x(m N 1 r )
This is a standard DFT multiplied by a phase-shift term that
is proportional to k: this compensates for the starting time
of the window: m–N+1

y(r) is a product of two signals so its DFT is the convolution
of the DFT’s of w(N–1–r) and x(m–N+1+r)
Rectangle window
w(n) 1
Slidelobe -13.3 dB
Mainlobe width(-3dB) 0.027
Leakage factor 9.14%
a=1.21, b=2


                       N
ck    cos(2k (n          ) / N)
                       2
Hamming window
w(n) 0.54 046 c1
Slidelobe -42.5 dB
Mainlobe width 0.039
Linkage factor 0.03%
a=1.81, b=4
Bartlett(L)
Blackman(L)
Rectwin(L)
Chebwin(L,R)
Hamming(L)
Hann(L)
Kaiser(L,beta)
taylorwin(L)
triang(L)
Спектр сигнала при использовании окна Блэкмана   Спектр сигнала при использовании окна Блэкмана -
                                                 Наттала
BW = 45 Hz
NT = 44 ms




BW = 300 Hz
NT = 7 ms
/aI/ from “my” with 45Hz and 300Hz bandwidth spectrograms




       45Hz, 44ms                 300Hz, 7ms
 Vertical slice through
  spectrogram:
 mT= 0.1s.
45 Hz gives finer frequency
resolution



 Horizontal slice through
  spectrogram:
 k/NT= 3.5 kHz
300 Hz gives finer time
resolution
You cannot get good time resolution and good
frequency resolution from the same spectrogram.
 Duration of window = N/fs=N*T.
 Frequency resolution = fs×a/N
    ◦ Equal amplitude frequency components with this separation
      will give distinct peaks
    ◦ Most windows have a≈2
   Time resolution = 2N/(a*fs)
    ◦ Amplitude variations with this period will be attenuated by
      6 dB
   For all window functions, the product of the time and
    frequency resolutions is equal to 2.
   Linguistic analysis typically uses a window length of
    10–20 ms. The transfer function of the vocal tract
    does not change significantly in this time.
   To keep all the information about time variation
    of spectral components, you need only sample
    the spectrum twice as fast as the spectral
    magnitudes are varying. Using normalised
    frequencies:
   –∞ dB bandwidth for an N-point window = b/N
    ◦ b = 4 for a Hamming window
   Significant variation of spectral components
    occurs at frequencies below 2b/N
   we must sample the spectrum at a frequency of
    b/N
    ◦ the separation between spectral samples is the window
      width divided by b
   Exact line-spectrum of a periodic signal {xm}
   Sampled continuous spectrum of zero-extended {xm}
   Sampled continuous spectrum of infinite {xm} convolved with spectrum of
    rectangular window
   FFT is an algorithm for calculating DFT

   Energy Conservation (Parseval’s theorem)

   Symmetries {xm} ⇒ {Xk}
   Discrete⇒Periodic:Xk+N=Xk
   Real⇒Hermitian: X–k= X*k
   Periodic:xm+N/r = x
   m ⇒Discrete:Xk= 0for k ≠ir
   Skew Periodic:xm+N/2r = –xm ⇒Odd Harmonics:
   Even:xm= x N–m ⇒Real
   Odd:xm= –x N–m ⇒Purely Imaginary
   Real & Even⇒Real & Even
   Real & Odd⇒Purely Imaginary and Odd

Contenu connexe

Tendances

Applications of digital signal processing
Applications of digital signal processing Applications of digital signal processing
Applications of digital signal processing Rajeev Piyare
 
Noise 2.0
Noise 2.0Noise 2.0
Noise 2.0bhavyaw
 
Introduction to Information Theory and Coding.pdf
Introduction to Information Theory and Coding.pdfIntroduction to Information Theory and Coding.pdf
Introduction to Information Theory and Coding.pdfJimma University
 
Wide band frequency modulation wbfm | Communication Systems
Wide band frequency modulation wbfm | Communication SystemsWide band frequency modulation wbfm | Communication Systems
Wide band frequency modulation wbfm | Communication SystemsLearn By Watch
 
Types of the Signals- Signals and Systems
Types of the Signals- Signals and SystemsTypes of the Signals- Signals and Systems
Types of the Signals- Signals and SystemsGhansyam Rathod
 
Basics of digital filters
Basics of digital filtersBasics of digital filters
Basics of digital filtersSmile Hossain
 
parametric method of power spectrum Estimation
parametric method of power spectrum Estimationparametric method of power spectrum Estimation
parametric method of power spectrum Estimationjunjer
 
NOISE IN Analog Communication Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication  Part-2 AM SYSTEMS.pptNOISE IN Analog Communication  Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication Part-2 AM SYSTEMS.pptAshishChandrakar12
 
Digital communications
Digital communicationsDigital communications
Digital communicationsAllanki Rao
 
Differential pulse code modulation
Differential pulse code modulationDifferential pulse code modulation
Differential pulse code modulationRamraj Bhadu
 
Digital Communication 1
Digital Communication 1Digital Communication 1
Digital Communication 1admercano101
 

Tendances (20)

Applications of digital signal processing
Applications of digital signal processing Applications of digital signal processing
Applications of digital signal processing
 
Noise 2.0
Noise 2.0Noise 2.0
Noise 2.0
 
Matched filter
Matched filterMatched filter
Matched filter
 
digital filter design
digital filter designdigital filter design
digital filter design
 
Introduction to Information Theory and Coding.pdf
Introduction to Information Theory and Coding.pdfIntroduction to Information Theory and Coding.pdf
Introduction to Information Theory and Coding.pdf
 
Digital signal processing part1
Digital signal processing part1Digital signal processing part1
Digital signal processing part1
 
Wide band frequency modulation wbfm | Communication Systems
Wide band frequency modulation wbfm | Communication SystemsWide band frequency modulation wbfm | Communication Systems
Wide band frequency modulation wbfm | Communication Systems
 
Noise cancellation and supression
Noise cancellation and supressionNoise cancellation and supression
Noise cancellation and supression
 
Types of the Signals- Signals and Systems
Types of the Signals- Signals and SystemsTypes of the Signals- Signals and Systems
Types of the Signals- Signals and Systems
 
Digital communication unit II
Digital communication unit IIDigital communication unit II
Digital communication unit II
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
carbon microphone ppt
carbon microphone pptcarbon microphone ppt
carbon microphone ppt
 
Basics of digital filters
Basics of digital filtersBasics of digital filters
Basics of digital filters
 
Information theory
Information theoryInformation theory
Information theory
 
Information theory
Information theoryInformation theory
Information theory
 
parametric method of power spectrum Estimation
parametric method of power spectrum Estimationparametric method of power spectrum Estimation
parametric method of power spectrum Estimation
 
NOISE IN Analog Communication Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication  Part-2 AM SYSTEMS.pptNOISE IN Analog Communication  Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication Part-2 AM SYSTEMS.ppt
 
Digital communications
Digital communicationsDigital communications
Digital communications
 
Differential pulse code modulation
Differential pulse code modulationDifferential pulse code modulation
Differential pulse code modulation
 
Digital Communication 1
Digital Communication 1Digital Communication 1
Digital Communication 1
 

En vedette

Time-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SSTTime-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SSTUT Technology
 
Comprehensive_Slides_Ashikur_4_13
Comprehensive_Slides_Ashikur_4_13Comprehensive_Slides_Ashikur_4_13
Comprehensive_Slides_Ashikur_4_13Ashikur Rahman
 
A review of time­‐frequency methods
A review of time­‐frequency methodsA review of time­‐frequency methods
A review of time­‐frequency methodsUT Technology
 
Using time frequency and wavelet analysis to assess turbulence-rotor interact...
Using time frequency and wavelet analysis to assess turbulence-rotor interact...Using time frequency and wavelet analysis to assess turbulence-rotor interact...
Using time frequency and wavelet analysis to assess turbulence-rotor interact...ndkelley
 
ppt on Time Domain and Frequency Domain Analysis
ppt on Time Domain and Frequency Domain Analysisppt on Time Domain and Frequency Domain Analysis
ppt on Time Domain and Frequency Domain Analysissagar_kamble
 
Decimation in time and frequency
Decimation in time and frequencyDecimation in time and frequency
Decimation in time and frequencySARITHA REDDY
 

En vedette (6)

Time-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SSTTime-Frequency Representation of Microseismic Signals using the SST
Time-Frequency Representation of Microseismic Signals using the SST
 
Comprehensive_Slides_Ashikur_4_13
Comprehensive_Slides_Ashikur_4_13Comprehensive_Slides_Ashikur_4_13
Comprehensive_Slides_Ashikur_4_13
 
A review of time­‐frequency methods
A review of time­‐frequency methodsA review of time­‐frequency methods
A review of time­‐frequency methods
 
Using time frequency and wavelet analysis to assess turbulence-rotor interact...
Using time frequency and wavelet analysis to assess turbulence-rotor interact...Using time frequency and wavelet analysis to assess turbulence-rotor interact...
Using time frequency and wavelet analysis to assess turbulence-rotor interact...
 
ppt on Time Domain and Frequency Domain Analysis
ppt on Time Domain and Frequency Domain Analysisppt on Time Domain and Frequency Domain Analysis
ppt on Time Domain and Frequency Domain Analysis
 
Decimation in time and frequency
Decimation in time and frequencyDecimation in time and frequency
Decimation in time and frequency
 

Similaire à Speech signal time frequency representation

DSP_2018_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_2018_FOEHU - Lec 08 - The Discrete Fourier TransformDSP_2018_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_2018_FOEHU - Lec 08 - The Discrete Fourier TransformAmr E. Mohamed
 
Digital Signal Processing[ECEG-3171]-Ch1_L05
Digital Signal Processing[ECEG-3171]-Ch1_L05Digital Signal Processing[ECEG-3171]-Ch1_L05
Digital Signal Processing[ECEG-3171]-Ch1_L05Rediet Moges
 
UNIT-2 DSP.ppt
UNIT-2 DSP.pptUNIT-2 DSP.ppt
UNIT-2 DSP.pptshailaja68
 
EC8553 Discrete time signal processing
EC8553 Discrete time signal processing EC8553 Discrete time signal processing
EC8553 Discrete time signal processing ssuser2797e4
 
Chapter7 circuits
Chapter7 circuitsChapter7 circuits
Chapter7 circuitsVin Voro
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huangjhonce
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huangSagar Ahir
 
DSP_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_FOEHU - Lec 08 - The Discrete Fourier TransformDSP_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_FOEHU - Lec 08 - The Discrete Fourier TransformAmr E. Mohamed
 
Fourier analysis of signals and systems
Fourier analysis of signals and systemsFourier analysis of signals and systems
Fourier analysis of signals and systemsBabul Islam
 
DONY Simple and Practical Algorithm sft.pptx
DONY Simple and Practical Algorithm sft.pptxDONY Simple and Practical Algorithm sft.pptx
DONY Simple and Practical Algorithm sft.pptxDonyMa
 
Pres Simple and Practical Algorithm sft.pptx
Pres Simple and Practical Algorithm sft.pptxPres Simple and Practical Algorithm sft.pptx
Pres Simple and Practical Algorithm sft.pptxDonyMa
 
signal homework
signal homeworksignal homework
signal homeworksokok22867
 

Similaire à Speech signal time frequency representation (20)

DSP_2018_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_2018_FOEHU - Lec 08 - The Discrete Fourier TransformDSP_2018_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_2018_FOEHU - Lec 08 - The Discrete Fourier Transform
 
Dft
DftDft
Dft
 
Digital Signal Processing[ECEG-3171]-Ch1_L05
Digital Signal Processing[ECEG-3171]-Ch1_L05Digital Signal Processing[ECEG-3171]-Ch1_L05
Digital Signal Processing[ECEG-3171]-Ch1_L05
 
UNIT-2 DSP.ppt
UNIT-2 DSP.pptUNIT-2 DSP.ppt
UNIT-2 DSP.ppt
 
Doppler radar
Doppler radarDoppler radar
Doppler radar
 
Basic concepts
Basic conceptsBasic concepts
Basic concepts
 
Dsp Lab Record
Dsp Lab RecordDsp Lab Record
Dsp Lab Record
 
lecture_16.ppt
lecture_16.pptlecture_16.ppt
lecture_16.ppt
 
DSP Lab 1-6.pdf
DSP Lab 1-6.pdfDSP Lab 1-6.pdf
DSP Lab 1-6.pdf
 
EC8553 Discrete time signal processing
EC8553 Discrete time signal processing EC8553 Discrete time signal processing
EC8553 Discrete time signal processing
 
Kanal wireless dan propagasi
Kanal wireless dan propagasiKanal wireless dan propagasi
Kanal wireless dan propagasi
 
Chapter7 circuits
Chapter7 circuitsChapter7 circuits
Chapter7 circuits
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huang
 
Ff tand matlab-wanjun huang
Ff tand matlab-wanjun huangFf tand matlab-wanjun huang
Ff tand matlab-wanjun huang
 
DSP_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_FOEHU - Lec 08 - The Discrete Fourier TransformDSP_FOEHU - Lec 08 - The Discrete Fourier Transform
DSP_FOEHU - Lec 08 - The Discrete Fourier Transform
 
Fourier analysis of signals and systems
Fourier analysis of signals and systemsFourier analysis of signals and systems
Fourier analysis of signals and systems
 
unit 4,5 (1).docx
unit 4,5 (1).docxunit 4,5 (1).docx
unit 4,5 (1).docx
 
DONY Simple and Practical Algorithm sft.pptx
DONY Simple and Practical Algorithm sft.pptxDONY Simple and Practical Algorithm sft.pptx
DONY Simple and Practical Algorithm sft.pptx
 
Pres Simple and Practical Algorithm sft.pptx
Pres Simple and Practical Algorithm sft.pptxPres Simple and Practical Algorithm sft.pptx
Pres Simple and Practical Algorithm sft.pptx
 
signal homework
signal homeworksignal homework
signal homework
 

Plus de Nikolay Karpov

Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speechNikolay Karpov
 
Идентификация уровня сложности текста и его адаптация
Идентификация уровня сложности текста и его адаптацияИдентификация уровня сложности текста и его адаптация
Идентификация уровня сложности текста и его адаптацияNikolay Karpov
 
Идентификация уровня ложности текста и его адаптация
Идентификация уровня ложности текста и его адаптацияИдентификация уровня ложности текста и его адаптация
Идентификация уровня ложности текста и его адаптацияNikolay Karpov
 
Теория и практика обработки естественного языка
Теория и практика обработки естественного языкаТеория и практика обработки естественного языка
Теория и практика обработки естественного языкаNikolay Karpov
 
Speech waves in tube and filters
Speech waves in tube and filtersSpeech waves in tube and filters
Speech waves in tube and filtersNikolay Karpov
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speechNikolay Karpov
 

Plus de Nikolay Karpov (9)

Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speech
 
Идентификация уровня сложности текста и его адаптация
Идентификация уровня сложности текста и его адаптацияИдентификация уровня сложности текста и его адаптация
Идентификация уровня сложности текста и его адаптация
 
Идентификация уровня ложности текста и его адаптация
Идентификация уровня ложности текста и его адаптацияИдентификация уровня ложности текста и его адаптация
Идентификация уровня ложности текста и его адаптация
 
Cepstral coefficients
Cepstral coefficientsCepstral coefficients
Cepstral coefficients
 
Теория и практика обработки естественного языка
Теория и практика обработки естественного языкаТеория и практика обработки естественного языка
Теория и практика обработки естественного языка
 
Linear prediction
Linear predictionLinear prediction
Linear prediction
 
Speech waves in tube and filters
Speech waves in tube and filtersSpeech waves in tube and filters
Speech waves in tube and filters
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speech
 
Tagger numbers
Tagger numbersTagger numbers
Tagger numbers
 

Dernier

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEaurabinda banchhor
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...liera silvan
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 

Dernier (20)

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Dust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSEDust Of Snow By Robert Frost Class-X English CBSE
Dust Of Snow By Robert Frost Class-X English CBSE
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...
EmpTech Lesson 18 - ICT Project for Website Traffic Statistics and Performanc...
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 

Speech signal time frequency representation

  • 1. Methods and algorithms of speech recognition course Nikolay V. Karpov nkarpov(а)hse.ru Lection 2
  • 2. This lecture is concerned with the spectrogram as a representation of how the frequency components within a signal vary with time.  Define what we mean by normalised time and frequency  Define the short-term discrete Fourier transformaton  Look at the effect of different window lengths on time and frequency resolution  Derive a quantitative description of the frequency resolution of the short term DFT and compare the performance of common windows  Derive a quantitative description of the time resolution of the short term DFT  Explain the uncertainty principle and illustrate its effects with some examples  Review some of the properties of the DFT
  • 3. Sampling theorem says that when the analog signal x(t) is band- restricted between 0 and W[Hz] and when x(t) is sampled at every T = 1/2W, the original signal can be completely reproduced by
  • 4. For those signals the frequency bandwidths of which are not known, a low-pass filter is used to restrict the bandwidths before sampling. When a signal is sampled contrary to the sampling theorem, aliasing distortion occurs, which distorts the high- frequency components of the signal, as shown
  • 5. Designate sin(2 wt ) g (t ) 2 wt i i i i x(t ) x( ) g (t ) x( ) x( ) x(iT ) x(i ) i 1 2w 2w 2w Fs Simple continuous signal x(t ) A cos(2 ft ) i k x(t ) A cos(2 f ( ) ) A cos( i ) A cos(2 i ) Fs N 1 k Normalised radian frequency 0 2 f( ) 2 (1) 0.5 Fs N Normalised time i
  • 6. With sampled data systems it is customary to change the time scale and work in units of the sample period.  In normalised units the sampling frequency (fs) and period (T) both equal 1. They can therefore be omitted from equations: any such omissions can be deduced using dimensional consistency arguments.  The Nyquist frequency is ½ Hz = πradians/second  To convert back to real units: ◦ any time quantity must be multiplied by the real sample period (or divided by the real sample frequency) ◦ any frequency or angular frequency quantity must be multiplied by the real sample frequency (or divided by the real sample period)
  • 7. The spectrogram shows the energy in a signal at each frequency and at each time. We calculate this by evaluating the short-term discrete Fourier transform Z- tramsform n X ( z) x ( n) z n 1 X ( z) X ( z ) z n 1dz 2 j c
  • 8. The Fourier transform of x(n) can be computed from the z- transform as: i n X( ) X ( z) z e i x ( n )e n The Fourier transform may be viewed as the z-transform evaluated around the unit circle. The Discrete Fourier Transform (DFT) is defined as a sampled version of the Fourier shown above: N 1 N 1 i 2 kn / N X (k ) x ( n )e x ( n) X ( k )e i 2 kn / N n 0 n 0 The Fast Fourier Transform (FFT) is simply an efficient computation of the DFT.
  • 9. The Discrete Cosine Transform (DCT) is simply a DFT of a signal that is assumed to be real and even (real and even signals have real spectra!) N 1 X (k ) x(0) 2 x(n) cos(2 kn / N ); k 0,1,..., N 1 n 0 Note that these are not the only transforms used in speech processing  Wavelets;  Fractals;  Wigner distributions;  etc.
  • 10. We often want to estimate the “power spectrum” of a non-stationary signal at a particular instant of time.  Multiply by a finite length window and take the DFT.  For a window of length N ending on sample m we have: N 1 i 2 k (m n) / N X ( k ; m) w(n) x(m n)e n 0 k-frequency, m-time
  • 11. 2  X (k , m) X (k , m) X * (k , m) gives the power at a frequency of k/N Hz (normalized) for a window centred at m-0.5(N-1).  the (m-i) term in the exponent means that the phase origin remains consistent by cancelling out the linear phase shift introduced by a delay of m samples.  the window samples are numbered backwards in time (for convenience later) hence the summation is performed backwards in time.  The values X(k;m) are based on the N signal values from m–N+1 to m.  the frequency resolution is 1/N Hz (normalised units)  the spectrum is periodic and (since w and x are real) conjugate symmetric: X (k ) X (N k) X *( k)  there are only 0.5N+1independent frequency values
  • 12. Sound «is» Hamming window of length 401 samples centred on 400
  • 13. Line spectrum from larynx oscillation (at about 0.01 normalised Hz) is superimposed on vocal tract resonances
  • 14. Wide window (N=400)  Independent frequency values=N/2+1=201  Short window eliminates the fine detail (N=30)  N/2+1=16  Zero padded window gives more spectrum points and an illusion of more detail (N=480=30×16). This is the normal case for a spectrogram  N/2+1=241
  • 15. N 1 X ( k ; m) w(n) x(m n) exp i 2 k ( m n) / N n 0 By setting r= N–1–i we can rewrite this as: 2 j (m N 1) N 1 X ( k ; m) exp( ) * ym (r ) exp 2 jkr / N N r 0 ym (r ) w( N 1 r ) x(m N 1 r ) This is a standard DFT multiplied by a phase-shift term that is proportional to k: this compensates for the starting time of the window: m–N+1 y(r) is a product of two signals so its DFT is the convolution of the DFT’s of w(N–1–r) and x(m–N+1+r)
  • 16.
  • 17. Rectangle window w(n) 1 Slidelobe -13.3 dB Mainlobe width(-3dB) 0.027 Leakage factor 9.14% a=1.21, b=2 N ck cos(2k (n ) / N) 2 Hamming window w(n) 0.54 046 c1 Slidelobe -42.5 dB Mainlobe width 0.039 Linkage factor 0.03% a=1.81, b=4
  • 19. Спектр сигнала при использовании окна Блэкмана Спектр сигнала при использовании окна Блэкмана - Наттала
  • 20. BW = 45 Hz NT = 44 ms BW = 300 Hz NT = 7 ms
  • 21. /aI/ from “my” with 45Hz and 300Hz bandwidth spectrograms 45Hz, 44ms 300Hz, 7ms
  • 22.  Vertical slice through spectrogram:  mT= 0.1s. 45 Hz gives finer frequency resolution  Horizontal slice through spectrogram:  k/NT= 3.5 kHz 300 Hz gives finer time resolution
  • 23. You cannot get good time resolution and good frequency resolution from the same spectrogram.  Duration of window = N/fs=N*T.  Frequency resolution = fs×a/N ◦ Equal amplitude frequency components with this separation will give distinct peaks ◦ Most windows have a≈2  Time resolution = 2N/(a*fs) ◦ Amplitude variations with this period will be attenuated by 6 dB  For all window functions, the product of the time and frequency resolutions is equal to 2.  Linguistic analysis typically uses a window length of 10–20 ms. The transfer function of the vocal tract does not change significantly in this time.
  • 24. To keep all the information about time variation of spectral components, you need only sample the spectrum twice as fast as the spectral magnitudes are varying. Using normalised frequencies:  –∞ dB bandwidth for an N-point window = b/N ◦ b = 4 for a Hamming window  Significant variation of spectral components occurs at frequencies below 2b/N  we must sample the spectrum at a frequency of b/N ◦ the separation between spectral samples is the window width divided by b
  • 25. Exact line-spectrum of a periodic signal {xm}  Sampled continuous spectrum of zero-extended {xm}  Sampled continuous spectrum of infinite {xm} convolved with spectrum of rectangular window  FFT is an algorithm for calculating DFT  Energy Conservation (Parseval’s theorem)  Symmetries {xm} ⇒ {Xk}  Discrete⇒Periodic:Xk+N=Xk  Real⇒Hermitian: X–k= X*k  Periodic:xm+N/r = x  m ⇒Discrete:Xk= 0for k ≠ir  Skew Periodic:xm+N/2r = –xm ⇒Odd Harmonics:  Even:xm= x N–m ⇒Real  Odd:xm= –x N–m ⇒Purely Imaginary  Real & Even⇒Real & Even  Real & Odd⇒Purely Imaginary and Odd