SlideShare une entreprise Scribd logo
1  sur  8
International Journal of Engineering Research and Development
e-ISSN: 2278-067X, p-ISSN: 2278-800X, www.ijerd.com
Volume 7, Issue 5 (June 2013), PP. 23-30
23
An Auto-Correlation Based Speech Enhancement
Algorithm
Lalchhandami1
, Maninder Pal 2
1,2
Department of Electronics & Communication Engineering
Maharishi Markandeshwar University, Mullana (Ambala), INDIA
Abstract:- Enhancement of speech signals recorded using signal channel devices such as mobile
phones is of prime interest. It is because for these devices, it is not possible to record noise signals
separately, and the surrounding background noises are picked up by their microphone simultaneously
with the speech signal. This may even completely fade-in the speech signal, depending upon the
signal-to-noise ratio (SNR). Therefore to address this problem, number of algorithms and techniques
has been developed. However, the existing methods are not able to perform homogenously across all
noise types. The auto-correlation function of a noisy speech signal is usually confined to lower time
lag and is very small or zero for higher time lag. Therefore, the higher-lag auto-correlation coefficients
are relatively robust to additive noise distortion. This paper is focused on enhancing the noisy speech
signal from single channel devices by using only the higher-lag auto-correlation coefficients. The
efficiency of the algorithm is evaluated in terms of energy, zero crossings and intelligibility of speech
signal.
Keywords:- Single channel speech enhancement; speech processing; spectral subtraction auto-
correlation and Speech signals SNR
I. INTRODUCTION
Speech enhancement is a topic of interest for last many years. In particularly, enhancement of speech
signals recorded using signal channel devices such as mobile phones is of prime interest in this paper. It is
because for these devices, it is not possible to record noise signals separately, and the surrounding background
noises are picked up by its microphone simultaneously with the speech signal. This may even completely fade-
in the speech signal, depending upon the signal-to-noise ratio (SNR). Therefore to address this problem, number
of algorithms and techniques has been developed. The key techniques include: suppression of noise using the
periodicity of speech or noise, enhancement based on perceptual criteria and subtractive-type algorithms such as
spectral subtraction and Wiener filter [1], [ 6]. Among these, the spectral subtraction algorithm is the oldest
speech enhancement algorithm. In this approach, the degraded speech signal is enhanced by subtracting an
estimate of the average noise spectrum from a noisy speech spectrum and the noise spectrum is estimated or
updated during the silence period of the speaker. However, the existing methods are not able to perform
homogenously across all noise types. It is because all of these speech enhancement systems are based on certain
assumptions and constraints that are typically dependent on the application and the environment. This paper is
focused on reducing the continuous background noises from the speech signals recorded using single channel
microphone based devices. For this purpose, auto-correlation function in time domain and frequency domain is
used. The magnitude of auto-correlation coefficients is usually large between 2ms and 12ms, as the human pitch
period is typically constrained between these values. However, the same is generally not true for noisy speech
signals. The auto-correlation function of a noisy speech signal is usually confined to lower time lag and is very
small or zero for higher time lag. The higher-lag auto-correlation coefficients are relatively robust to additive
noise distortion [5]. Therefore, this paper is focused on enhancing the noisy speech signal from single channel
devices by using only the higher-lag auto-correlation coefficients. The efficiency of the algorithm is evaluated in
terms of energy, zero crossings and intelligibility of speech signal. This is presented below.
II. SYSTEM MODEL
As discussed above, the major constraint of single channel methods in speech enhancement is that there
is no reference signal for the noise available. Therefore, the power spectral density of the noise needs to be
estimated based on the available noisy speech signal only and this makes it a challenging task. In all single
channel enhancement techniques, the noisy speech signal is given by
An Auto-Correlation Based Speech Enhancement Algorithm
24
𝑦 𝑛 = 𝑥 𝑛 + 𝑑(𝑛) (1)
Where, 𝑥 𝑛 represents the clean speech signal, 𝑑 𝑛 is the uncorrelated additive noise and 𝑦 𝑛
represents the degraded noisy speech signal. To enhance such noisy speech signals, an auto-correlation function
based speech enhancement is presented in this paper. Its principle is based on traditional Spectral Subtraction
method. In this Spectral Subtraction method, the degraded speech signals are enhanced by subtracting the
estimate of the average noise spectrum from a noisy speech spectrum [1]. The noise spectrum is estimated during
the periods when the signal is absent; which is usually very difficult to do in practice. In addition, it is also
assumed that speech and noise is additive and uncorrelated. An estimate of the clean signal 𝑥 𝑛 is recovered
from the noisy signal 𝑦(𝑛) by assuming that there is an estimate of the power spectrum of noise 𝐷𝑘
2
, which is
obtained by averaging over multiple frames of a known noise segment. An estimate of the short-time squared
magnitude spectrum of the clean signal using this method can be obtained as follows:
𝑋 𝑘
2
= 𝑌𝑘
2
− 𝐷𝑘
2
, if 𝑌𝑘
2
− 𝐷𝑘
2
≥ 0
0, otherwise
(2)
Where, |𝑌𝑘 | is the magnitude of the noisy signal spectrum, 𝑋 𝑘 is the estimate of the clean frequency
spectrum for a given frame, and |𝐷𝑘| is the magnitude of noise spectrum estimate during non-speech activity.
To recover the signal, the magnitude spectrum estimate is finally combined with the phase of the noisy signal,
using Inverse Discrete Fourier Transform (IDFT) as follows:
𝑥 𝑛 = 𝑋 𝑘 e𝑗 𝜑(𝑦,𝑘) (3)
Although the spectral subtraction algorithm can be easily implemented to effectively reduce noise
present in the corrupted signal; yet, it has several shortcomings. The major drawback of this method is the
resulting musical noise, due to rapid coming and going of speech signals over successive frames. This is why
this paper focuses on using auto-correlation function instead of power spectrum.
The autocorrelation function of a signal contains the same information about the signal as its power
spectrum [2]. However, the main difference between the power spectrum and auto-correlation domain is that in
the power spectrum domain, the information is presented as a function of frequency; while in the latter, it is
presented as a function of time. More specifically, the higher-lag auto-correlation coefficients of the speech signal
𝑥(𝑛) usually contain information about the signal’s power spectrum; whereas, the magnitude of higher-lag auto-
correlation coefficients of the noise signal 𝑑(𝑛) is relatively small for some noise types. Therefore, the lower-lag
auto-correlation coefficients can be discarded and only higher-lag auto-correlation coefficients of the noisy
speech signal can be used for spectral estimate. The spectral estimation is done using only the higher-lag auto-
correlation coefficients; and, the speech and noise signals can be separated without having to estimate the noise
signal directly. The estimated power spectrum can then be used to enhance the corrupted noisy speech signals. It
is to be noted that the auto-correlation function of a signal can be computed in time domain and frequency
domain. This paper has focused on computing auto-correlation function in both domains in order to investigate
their effect on enhancing speech signals.
III. IMPLEMENTATION PROCEDURE
The flowchart of the autocorrelation based speech enhancement algorithm is given in Fig. 1. As shown
in Fig. 1, the first step in spectral estimation is to divide the noisy speech signal into overlapping short time
frames. These short time frames are 64 ms (512 samples) long. A Hamming window with 50% overlapping is
then applied on these short time frames. This process is defined by Eqn 4, where 𝑥(𝑛) is discrete speech signal
and 𝑤(𝑛) is the window function shifted by 𝑚 samples.
𝑥 𝑤 𝑛 = 𝑥 𝑛 𝑤 𝑚 − 𝑛 (4)
By taking the Fourier Transform of 𝑥 𝑤 𝑛 , a discrete Short-Time Fourier Transform can be computed as shown
in Eqn 5.
𝑋 𝑤, 𝑘 =
1
𝑁
𝑥 𝑤 𝑛
𝑤
𝑛=𝑤−𝑁+1
𝑒
−𝑗2𝜋𝑘𝑛
𝑁 𝑘 = 0, … , 𝑁 − 1
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
(5)
The resulting discrete STFT can be expressed as
𝑋 𝑤, 𝑘 = |𝑋 𝑤, 𝑘 |𝑒 𝑗𝜑 (𝑤,𝑘) (6)
The |𝑋 𝑤, 𝑘 | component is the magnitude of the Short-Time Fourier Transform and the 𝜑 𝑤, 𝑘 component is
the phase. For speech recorded using single-channel based devices, the noisy signal 𝑦 𝑛 can be analytically
expressed in frequency domain as:
𝑌𝑘 𝑒 𝑗 𝜑(𝑦,𝑘)
= 𝑋 𝑘 𝑒 𝑗 𝜑(𝑥,𝑘)
+𝐷𝑘 𝑒 𝑗 𝜑(𝑑,𝑘) (7)
In the modification stage, the degraded signal in the frequency domain is modified in order to restore the
An Auto-Correlation Based Speech Enhancement Algorithm
25
original speech signal. For this purpose, a biased auto-correlation sequence of the windowed time frames is then
computed. As the power spectrum of an auto-correlation sequence has dynamic range twice as large as that of the
signal’s power spectrum, so if a Hamming window is applied to the positive side of the auto-correlation sequence,
the dynamic range would be effectively reduced by half. Therefore, to address this problem, a new window of
double dynamic range is used to process the one-sided auto-correlation sequence. For implementing this window,
the dynamic range of Hamming window is made double by computing its auto-correlation sequence and resulting
auto-correlation sequence. The resulting sequence is defined as the coefficients of the new high dynamic range
window. To increase the noise reduction effect, the new window is placed over the higher-lag region of the auto-
correlation sequence, which is taken to be greater than 2 ms (16 samples), as there is natural null in the average
short-time auto-correlation of speech signals at approximately 2 ms lag. An estimate of the original speech signals
power spectrum is then computed for each frame as the magnitude spectrum of the windowed higher-lag
autocorrelation coefficients. This is done by pairing the modified magnitude spectra 𝑋 𝑘 with each of the original
corresponding phase spectra 𝜑(𝑦, 𝑘). The degraded phase spectra are then used to create the enhanced spectra.
The enhanced speech frames are then synthesized using an inverse Fourier transform of the complex spectra as
mentioned in Eqn 8.
𝑥 𝑛 = 𝐼𝐷𝐹𝑇 𝑋 𝑘 𝑒 𝑗 𝜑(𝑦,𝑘) (8)
These frames are then overlapped and added to obtain the enhanced speech signal.
Fig. 1: Flowchart of power spectral estimate using higher-lag auto-correlation coefficients.
The block diagram of the speech enhancement using higher-lag auto-correlation coefficients is shown in Fig. 2.
Fig. 2: Block diagram of the auto-correlation based speech enhancement algorithm.
IV. RESULTS & DISCUSSION
To evaluate the performance of the auto-correlation based speech enhancement algorithm, the same is
implemented in MATLAB and extensive simulations have been performed. A small length of approximately
An Auto-Correlation Based Speech Enhancement Algorithm
26
10 seconds of speech of a female person, in the age group 20 to 25 years, is recorded in the presence of noise.
The noise is produced by two sources: running fan and car. The speech signals are recorded separately in both
noise conditions. To record signals at different SNR’s, the speed of fan is changed to change the level of noise.
The mobile phone and sound card used for recording signals has a single microphone, so the speech signal and
noises are both picked up by the microphone. These recorded signals are called noisy speech signals. These
noisy speech signals are then processed by an auto-correlation based speech enhancement algorithm and the
estimated cleaned signals are then stored and listened back to estimate their degree of similarity (intelligibility)
with their original version. Auto-correlation function is computed by two methods: the generalized cross-
correlation (GCC) method in frequency domain and using Matlab xcorr command in time domain. The results
obtained for these two methods are then compared. These results are shown in Fig. 4 to Fig. 15, and the key
conclusions drawn from the results are discussed below.
1. Fig. 4 & Fig. 5 shows the auto-correlation function and one-sided auto-correlation window respectively of
one frame (i.e. 512 samples). For enhancing the speech signal, the lower lags of 16 samples (2 ms) of the
one-sided auto-correlation function are discarded and only the higher-lag coefficients are retained. A
Hamming window with wide range (width 512 samples) is applied to the higher-lag auto-correlation
sequence. It is observed that this wide dynamic range Hamming window has peak in the middle frequencies
and decrease to zero at the edges; thereby, reducing the effects of the discontinuities as a result of finite
duration. It is also observed that this window captures all the harmonic peaks of the auto-correlation
sequence, and therefore, captures all the formant structures with no spectral loss. This indicates that this
window works well in the auto-correlation domain in enhancing the noisy speech signal.
2. Fig. 6 show the power spectrum estimate of a single frame (512 samples) of the speech signals; which is
used to obtain the enhanced version of the signal. The estimated power spectrum is computed on a frame to
frame basis, and need not necessarily be computed during the non-speech activity of the speaker, as
required in the spectral subtraction algorithm. This is the key advantages of this algorithm.
3. Fig. 7, Fig. 12 & Fig. 13 shows the recorded speech signal with different levels of noises i.e. different SNRs
and in the presence of various noise producing sources. The energies of the recorded and the recovered
speech signals are computed and is observed that a high level of energy (above threshold) of the speech
signal is approximately at the same time instance for both the recorded and the recovered signal. It is also
observed that the zero-crossing rate is high during the speech period of the signal Thus, the intelligibility of
the recovered signal is maintained.
4. In Fig. 7 to Fig. 15, the recorded speech signals are processed by the auto-correlation based speech
enhancement method. The auto-correlation function is calculated using the GCC method in frequency
domain and using xcorr Matlab command for time domain. It can be observed that the second method
(xcorr) gives better results in reducing the noise from the speech signals; however, at the expense of small
decrease in magnitude of the speech signal. The decrease in the magnitude of the recovered speech signal is
uniform; thus, the algorithm maintains the intelligibility of the recovered speech signal.
5. Fig. 11, Fig.14 & Fig. 15 shows the spectrograms of the recorded noisy signal and the recovered signal
obtained using the auto-correlation based algorithm. The spectrograms showed that the noise energy, at a
particular frequency and time, is reduced to a considerable extent in the recovered speech signal; thereby,
improving the quality of the recovered speech signals.
6. Fig. 12 & Fig. 14 shows the results of noisy speech signals corresponding to a female voice of
approximately 10 seconds, recorded in a running car by using a microphone and sound card, and mobile
phone. It is observed that the recovered signal using xcorr gives better results than the GCC method, as both
musical and residual noises are observed in the signals recovered using GCC method. The continuous
noises from the ambient environment are reduced to a significant extent in the recovered signal by the two
methods; thereby, proving the effectiveness of the algorithm for continuous surrounding noises.
7. Fig. 13 & Fig. 15 shows the results for speech signals recorded using a single microphone (mono) based
mobile phone, used by a person in a running car. The recorded signal is then processed by the auto-
correlation based algorithm and is observed that the moving wind noise and engine noise of car are reduced
in the recovered signal. It is also observed from the spectrogram that the energy of the noises lie in the
audio frequency range indicating that the speech signals gets easily distorted by the ambient noises. The
spectrogram of the recovered signal shows that the energy of these noises is reduced in the recovered signal.
Thus, the algorithm works well for enhancing speech signals recorded using single channel devices in a
noisy environment. This justifies that this algorithm is designed to minimize the noises produced by
constant sources such as vehicles and is well suited for environments such as persons using mobile phones
in cars, trains etc.
An Auto-Correlation Based Speech Enhancement Algorithm
27
Fig. 3: Recorded speech (female speaking /a/ sound)
signal of length 10 seconds in presence of noise
produced by fan moving at constant speed.
Fig. 4: An example of autocorrelation of a
single speech frame (512 samples) using xcorr
and generalized cross-correlation (GCC).
Fig. 5: An example of autocorrelation window
(single sided) of single frame (512 samples).
Fig. 6: An example of the power spectrum
estimate of a single frame (512 samples).
Fig. 7: The recorded speech (female speaking /a/, /b/
and /c/ sound) signal of length 10 seconds in
presence of noise produced by a fan moving at
constant speed, and the recovered signals using GCC
(frequency domain) and xcorr (time domain).
Fig. 8: The recorded speech signal with its
energy (logarithmic scale) and zero crossing
rate.
An Auto-Correlation Based Speech Enhancement Algorithm
28
Fig. 9: The recovered speech signal (using GCC)
with its energy (logarithmic scale) and zero
crossing rate.
Fig. 10: The recovered speech signal (using
xcorr) with its energy (logarithmic scale) and
zero crossing rate.
a) The spectrogram of recorded
noisy speech signal.
b) The spectrogram of recovered
speech signal using GCC.
c) The spectrogram of recovered
speech signal using xcorr.
Fig. 11: The spectrogram of a) recorded speech signal, b) recovered signal using GCC, and c) recovered signal
using xcorr.
Fig. 12: The recorded speech signal of a female
/a/, /b/ and /c/ sound of length 10 seconds in
running car and the recovered signals using GCC
(frequency domain) and xcorr (time domain).
Fig. 13: Mobile phone recorded speech signal
corresponding to alphabets /a/ and /b/, produced
by male speaker in running car and the recovered
signals using GCC (frequency domain) and xcorr
(time domain).
An Auto-Correlation Based Speech Enhancement Algorithm
29
a) The spectrogram of recorded
noisy speech signal (Corresponding
to Fig. 12).
b) The spectrogram of recovered
speech signal using GCC.
c) The spectrogram of recovered
speech signal using xcorr.
Fig. 14: The spectrogram of a) recorded speech signal (corresponding to Figure 12), b) recovered signal using
GCC, and c) recovered signal using xcorr.
a) The spectrogram of recorded
noisy speech signal (Corresponding
to Fig. 12).
b) The spectrogram of recovered
speech signal using GCC.
c) The spectrogram of recovered
speech signal using xcorr.
Fig. 15: The spectrogram of a) recorded speech signal using using mobile phone in Fig. 13, b) recovered signal
using GCC, and c) recovered signal using xcorr.
V. CONCLUSIONS
This paper presented a speech enhancement algorithm based on auto-correlation function. In this
approach, it is expected that by discarding the more noise affected lower lag auto-correlation coefficients, an
estimate of power spectrum of clean signal can be obtained from the magnitude spectrum of higher lag auto-
correlation coefficients and finally the enhanced speech signal can be obtained using the overlap-add method.
More specifically, it presented the results obtained on applying the auto-correlation based speech enhancement
algorithm on the recorded speech signals with continuous background noises. The speech enhancement is
performed by using auto-correlation in both time domain and frequency domain. From the results obtained, it is
found that the time domain auto-correlation gives better results in reducing the noise from the speech signal and
gives better quality recovered signals as compared to the frequency domain auto-correlation. With the auto-
correlation based algorithm, the stationary noises (e.g., the moving fan noise and car noise) are reduced in the
recovered speech signals; but, at the expense of small decrease in magnitude of the speech signal. However, the
decrease in the magnitude of the recovered speech signal is uniform; thus, the algorithm maintains the
intelligibility of the original speech signal. It is also observed from the results that the algorithm does not work
well for non-stationary background noises, as these noises are still audible in the recovered signal. Therefore, it
is concluded that this algorithm is capable of enhancing the speech signal of a single channel based devices and
perform well for continuous or stationary noises.
REFERENCES
[1]. Z. Chen, “Simulation of Spectral Subtraction Based Noise Reduction Method,” International Journal of
Advanced Computer Science and Applications, Vol. 2, No. 8, 2011.
An Auto-Correlation Based Speech Enhancement Algorithm
30
[2]. M. A. Hasan & T. Shimamura, “A Fundamental Frequency Extraction Method Based on Windowless
and Normalized Autocorrelation Functions,” Proc. 6th
WSEAS Int. Conf. Circuits, Systems, Signal and
Telecommunications, Cambridge, pp. 305-309, 2012.
[3]. M. Gabrea, “Two Microphones Speech Enhancement Systems based on Instrumental Variable
Algorithm for Speaker Identification,” Proc. Of the 24th
Canadian Conference on Electrical and
Computer Engineering, May 2011.
[4]. J. I. Hurtado & D. V. Anderson, “FFT-Based Block Processing in Speech Enhancement: Potential
Artifacts and Solutions,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, No.
8, November 2011.
[5]. B. J. Shannon, K. K. Paliwal, & C. Nadeu, “Speech enhancement based on spectral estimation from
higher-lag autocorrelation”, Proc. Interspeech, 2006.
[6]. L. D. Alsteris & K. K. Paliwal, “Importance Of Window Shape For Phase-Only Reconstruction Of
Speech,” International Conf. on Acoustics, Speech, and Signal Processing-ICASSP, Vol. 1, pp. I-573-
6, 2004.
[7]. P. Basu, P. J. Wolfe, D. Rudoy, T. F. Quatieri & Bob Dunn, “Adaptive Short-Time Analysis-Synthesis
for Speech Enhancement,” International Conf. on Acoustics, Speech, and Signal Processing-ICASSP,
pp. 4905-4908, 2008.
[8]. A. K. Swain & W. Abdulla, “Estimation of LPC Parameters of Speech Signals in Noisy Environment,”
TENCON, Vol. 1, pp. 139-142, 2004.
[9]. H. G. Kim, M. Schwab, N. Moreau & T. Sikora, “Speech Enhancement of Noisy Speech Using Log-
Spectral Amplitude Estimator and Harmonic Tunneling,” International Workshop on Acoustic Echo
and Noise Control, Japan, September 2003.
[10]. K. A. Sheela and K. S. Prasad, “Less Computational Complex Method for Estimation of Non-
Stationary Noise in Speech Using Low frequency Regions for Power Spectral Subtraction,” National
conference on Communications (NCC2007) IIT Kanpur, January 2007.
[11]. T. Fingscheidt, S. Suhadi, & S. Stan, “Environment-Optimized Speech Enhancement,” IEEE
Transactions on Audio, Speech & Language Processing, pp. 825-834, 2008.
[12]. R. Martin, “Noise power spectral density estimation based on optimal smoothing and minimum
statistics,” IEEE Transactions on Speech and Audio Processing, Vol. 9, No. 5, pp. 504-512, July 2001.
[13]. M. K. Hasan, S. Salahuddin & M. R Khan, “A modified a priori SNR for speech enhancement using
spectral subtraction rules,” IEEE Signal Processing Letters, Vol. 11, No. 4, pp. 450-453, April 2004.
[14]. S. Ogata, S. Ebataz & T. Shimamuray, “Improved Model of SPAC (Speech Processing System by Use
of Auto-Correlation Function) Utilizing Spectral Subtraction as Preprocessing,” Proc. of ICA 2004:
The 18th international congress on acoustics, Kyoto, Japan, pp. IV-3037 - IV-3040, April 2004.
[15]. Y. Hu, M. Bhatnagar & P. C. Loizou, “A Cross-Correlation Technique for Enhancing Speech
Corrupted with Correlated Noise,” Proc. of IEEE International Conference on Acoustic Speech Signal
Processing, Vol. 1, Salt Lake City, UT, pp. 673-676, 2001.
[16]. J. B. Boldt, D. Ellis, “A Simple Correlation-Based Model of Intelligibility for Nonlinear Speech
Enhancement and Separation,” Proc. of the 17th European Signal Processing Conference EURASIP,
2009.
[17]. M. A. Hasan & T. Shimamura, “An Efficient Pitch Estimation Method Using Windowless and
Normalized Autocorrelation Functions in Noisy Environments,” International Journal of Circuits,
Systems and Signal Processing, Vol. 6 No. 1, pp. 197-204, 2012.

Contenu connexe

Tendances

METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANMETHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANIJNSA Journal
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...sipij
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement techniqueeSAT Publishing House
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniqueseSAT Journals
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 
Paper id 252014135
Paper id 252014135Paper id 252014135
Paper id 252014135IJRAT
 
M.sc. presentation t.bagheri fashkhami
M.sc. presentation t.bagheri fashkhamiM.sc. presentation t.bagheri fashkhami
M.sc. presentation t.bagheri fashkhamitaherbagherif
 
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive Radio
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive RadioAn Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive Radio
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive RadioIJERD Editor
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...sipij
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...CSCJournals
 
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...cscpconf
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Venkata Sudhir Vedurla
 
Improvement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodImprovement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodCSCJournals
 
A Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR FilterA Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR FilterIRJET Journal
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signalVinodhini
 
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative StudyEcho Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Studyidescitation
 

Tendances (20)

METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LANMETHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
METHOD FOR REDUCING OF NOISE BY IMPROVING SIGNAL-TO-NOISE-RATIO IN WIRELESS LAN
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
Paper id 252014135
Paper id 252014135Paper id 252014135
Paper id 252014135
 
M.sc. presentation t.bagheri fashkhami
M.sc. presentation t.bagheri fashkhamiM.sc. presentation t.bagheri fashkhami
M.sc. presentation t.bagheri fashkhami
 
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive Radio
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive RadioAn Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive Radio
An Optimized Energy Detection Scheme For Spectrum Sensing In Cognitive Radio
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
 
Lr3620092012
Lr3620092012Lr3620092012
Lr3620092012
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
 
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
 
H010234144
H010234144H010234144
H010234144
 
Ijetcas14 555
Ijetcas14 555Ijetcas14 555
Ijetcas14 555
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
Improvement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation methodImprovement of minimum tracking in Minimum Statistics noise estimation method
Improvement of minimum tracking in Minimum Statistics noise estimation method
 
A Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR FilterA Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR Filter
 
Digital modeling of speech signal
Digital modeling of speech signalDigital modeling of speech signal
Digital modeling of speech signal
 
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative StudyEcho Cancellation Algorithms using Adaptive Filters: A Comparative Study
Echo Cancellation Algorithms using Adaptive Filters: A Comparative Study
 

En vedette

AAFCS 2013 Entrepreneurship Workshop Bundled Handout
AAFCS 2013 Entrepreneurship Workshop Bundled HandoutAAFCS 2013 Entrepreneurship Workshop Bundled Handout
AAFCS 2013 Entrepreneurship Workshop Bundled HandoutBarbara O'Neill
 
Entorno de Word 2007
Entorno de Word 2007Entorno de Word 2007
Entorno de Word 2007marlenecaiza
 
SpinetiX GTM DiVA (From Print to Digital)
SpinetiX GTM DiVA (From Print to Digital)SpinetiX GTM DiVA (From Print to Digital)
SpinetiX GTM DiVA (From Print to Digital)Paul Van Dort
 
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...DT Associates
 
Communication Strategy for Bangladesh ICT Products In brazil And Kazakhstan
Communication Strategy for Bangladesh ICT Products In brazil And KazakhstanCommunication Strategy for Bangladesh ICT Products In brazil And Kazakhstan
Communication Strategy for Bangladesh ICT Products In brazil And KazakhstanMd. Sohan Haidear
 
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216세진 양
 
2º dinastia
2º dinastia2º dinastia
2º dinastiablog-eic
 
A Study on usability of mobile games using color theory
A Study on usability of mobile games using color theoryA Study on usability of mobile games using color theory
A Study on usability of mobile games using color theoryWooseok Jeon
 
iOS Auto Layout
iOS Auto LayoutiOS Auto Layout
iOS Auto LayoutEunjoo Im
 
Apresentação para décimo segundo ano de 2016 7, aula 25
Apresentação para décimo segundo ano de 2016 7, aula 25Apresentação para décimo segundo ano de 2016 7, aula 25
Apresentação para décimo segundo ano de 2016 7, aula 25luisprista
 
Apresentação para décimo segundo ano de 2016 7, aula 33-34
Apresentação para décimo segundo ano de 2016 7, aula 33-34Apresentação para décimo segundo ano de 2016 7, aula 33-34
Apresentação para décimo segundo ano de 2016 7, aula 33-34luisprista
 
Apresentação para décimo segundo ano de 2016 7, aula 11-12
Apresentação para décimo segundo ano de 2016 7, aula 11-12Apresentação para décimo segundo ano de 2016 7, aula 11-12
Apresentação para décimo segundo ano de 2016 7, aula 11-12luisprista
 
Apresentação para décimo segundo ano de 2016 7, aula 20
Apresentação para décimo segundo ano de 2016 7, aula 20Apresentação para décimo segundo ano de 2016 7, aula 20
Apresentação para décimo segundo ano de 2016 7, aula 20luisprista
 
Apresentação para décimo segundo ano de 2016 7, aula 28-29
Apresentação para décimo segundo ano de 2016 7, aula 28-29Apresentação para décimo segundo ano de 2016 7, aula 28-29
Apresentação para décimo segundo ano de 2016 7, aula 28-29luisprista
 

En vedette (18)

Reese Brinkley Resume
Reese Brinkley ResumeReese Brinkley Resume
Reese Brinkley Resume
 
AAFCS 2013 Entrepreneurship Workshop Bundled Handout
AAFCS 2013 Entrepreneurship Workshop Bundled HandoutAAFCS 2013 Entrepreneurship Workshop Bundled Handout
AAFCS 2013 Entrepreneurship Workshop Bundled Handout
 
Entorno de Word 2007
Entorno de Word 2007Entorno de Word 2007
Entorno de Word 2007
 
Danny vera
Danny veraDanny vera
Danny vera
 
SpinetiX GTM DiVA (From Print to Digital)
SpinetiX GTM DiVA (From Print to Digital)SpinetiX GTM DiVA (From Print to Digital)
SpinetiX GTM DiVA (From Print to Digital)
 
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...
The State Of Digital Excellence In The Pharmaceutical Industry In 2015: Capab...
 
Communication Strategy for Bangladesh ICT Products In brazil And Kazakhstan
Communication Strategy for Bangladesh ICT Products In brazil And KazakhstanCommunication Strategy for Bangladesh ICT Products In brazil And Kazakhstan
Communication Strategy for Bangladesh ICT Products In brazil And Kazakhstan
 
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216
나눔의 본질을 묻다 사랑의열매 정동아카데미 35회 20160216
 
Ravik-Profile
Ravik-ProfileRavik-Profile
Ravik-Profile
 
2º dinastia
2º dinastia2º dinastia
2º dinastia
 
A Study on usability of mobile games using color theory
A Study on usability of mobile games using color theoryA Study on usability of mobile games using color theory
A Study on usability of mobile games using color theory
 
iOS Auto Layout
iOS Auto LayoutiOS Auto Layout
iOS Auto Layout
 
Start up
Start up Start up
Start up
 
Apresentação para décimo segundo ano de 2016 7, aula 25
Apresentação para décimo segundo ano de 2016 7, aula 25Apresentação para décimo segundo ano de 2016 7, aula 25
Apresentação para décimo segundo ano de 2016 7, aula 25
 
Apresentação para décimo segundo ano de 2016 7, aula 33-34
Apresentação para décimo segundo ano de 2016 7, aula 33-34Apresentação para décimo segundo ano de 2016 7, aula 33-34
Apresentação para décimo segundo ano de 2016 7, aula 33-34
 
Apresentação para décimo segundo ano de 2016 7, aula 11-12
Apresentação para décimo segundo ano de 2016 7, aula 11-12Apresentação para décimo segundo ano de 2016 7, aula 11-12
Apresentação para décimo segundo ano de 2016 7, aula 11-12
 
Apresentação para décimo segundo ano de 2016 7, aula 20
Apresentação para décimo segundo ano de 2016 7, aula 20Apresentação para décimo segundo ano de 2016 7, aula 20
Apresentação para décimo segundo ano de 2016 7, aula 20
 
Apresentação para décimo segundo ano de 2016 7, aula 28-29
Apresentação para décimo segundo ano de 2016 7, aula 28-29Apresentação para décimo segundo ano de 2016 7, aula 28-29
Apresentação para décimo segundo ano de 2016 7, aula 28-29
 

Similaire à International Journal of Engineering Research and Development (IJERD)

A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfA_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfBala Murugan
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environmentssipij
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Design and Implementation of Polyphase based Subband Adaptive Structure for N...
Design and Implementation of Polyphase based Subband Adaptive Structure for N...Design and Implementation of Polyphase based Subband Adaptive Structure for N...
Design and Implementation of Polyphase based Subband Adaptive Structure for N...Pratik Ghotkar
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...IJORCS
 
Enhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationEnhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationIRJET Journal
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
 
Research on VoIP Acoustic Echo Cancelation Algorithm Based on Speex
Research on VoIP Acoustic Echo Cancelation Algorithm Based on SpeexResearch on VoIP Acoustic Echo Cancelation Algorithm Based on Speex
Research on VoIP Acoustic Echo Cancelation Algorithm Based on SpeexTELKOMNIKA JOURNAL
 
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...IRJET Journal
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...IJMER
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...IOSR Journals
 
General Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker IdentificationGeneral Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker Identificationijcisjournal
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
P ERFORMANCE A NALYSIS O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...
P ERFORMANCE A NALYSIS  O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...P ERFORMANCE A NALYSIS  O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...
P ERFORMANCE A NALYSIS O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...ijwmn
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based featuresijsc
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...CSCJournals
 
An improved anc system with
An improved anc system withAn improved anc system with
An improved anc system withsipij
 

Similaire à International Journal of Engineering Research and Development (IJERD) (20)

A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfA_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Design and Implementation of Polyphase based Subband Adaptive Structure for N...
Design and Implementation of Polyphase based Subband Adaptive Structure for N...Design and Implementation of Polyphase based Subband Adaptive Structure for N...
Design and Implementation of Polyphase based Subband Adaptive Structure for N...
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
 
Enhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationEnhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition application
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
 
Research on VoIP Acoustic Echo Cancelation Algorithm Based on Speex
Research on VoIP Acoustic Echo Cancelation Algorithm Based on SpeexResearch on VoIP Acoustic Echo Cancelation Algorithm Based on Speex
Research on VoIP Acoustic Echo Cancelation Algorithm Based on Speex
 
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
A Noise Reduction Method Based on Modified Least Mean Square Algorithm of Rea...
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
 
H0814247
H0814247H0814247
H0814247
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...
 
General Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker IdentificationGeneral Kalman Filter & Speech Enhancement for Speaker Identification
General Kalman Filter & Speech Enhancement for Speaker Identification
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
P ERFORMANCE A NALYSIS O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...
P ERFORMANCE A NALYSIS  O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...P ERFORMANCE A NALYSIS  O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...
P ERFORMANCE A NALYSIS O F A DAPTIVE N OISE C ANCELLER E MPLOYING N LMS A LG...
 
F010334548
F010334548F010334548
F010334548
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based features
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
 
An improved anc system with
An improved anc system withAn improved anc system with
An improved anc system with
 

Plus de IJERD Editor

A Novel Method for Prevention of Bandwidth Distributed Denial of Service Attacks
A Novel Method for Prevention of Bandwidth Distributed Denial of Service AttacksA Novel Method for Prevention of Bandwidth Distributed Denial of Service Attacks
A Novel Method for Prevention of Bandwidth Distributed Denial of Service AttacksIJERD Editor
 
MEMS MICROPHONE INTERFACE
MEMS MICROPHONE INTERFACEMEMS MICROPHONE INTERFACE
MEMS MICROPHONE INTERFACEIJERD Editor
 
Influence of tensile behaviour of slab on the structural Behaviour of shear c...
Influence of tensile behaviour of slab on the structural Behaviour of shear c...Influence of tensile behaviour of slab on the structural Behaviour of shear c...
Influence of tensile behaviour of slab on the structural Behaviour of shear c...IJERD Editor
 
Gold prospecting using Remote Sensing ‘A case study of Sudan’
Gold prospecting using Remote Sensing ‘A case study of Sudan’Gold prospecting using Remote Sensing ‘A case study of Sudan’
Gold prospecting using Remote Sensing ‘A case study of Sudan’IJERD Editor
 
Reducing Corrosion Rate by Welding Design
Reducing Corrosion Rate by Welding DesignReducing Corrosion Rate by Welding Design
Reducing Corrosion Rate by Welding DesignIJERD Editor
 
Router 1X3 – RTL Design and Verification
Router 1X3 – RTL Design and VerificationRouter 1X3 – RTL Design and Verification
Router 1X3 – RTL Design and VerificationIJERD Editor
 
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...IJERD Editor
 
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVR
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVRMitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVR
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVRIJERD Editor
 
Study on the Fused Deposition Modelling In Additive Manufacturing
Study on the Fused Deposition Modelling In Additive ManufacturingStudy on the Fused Deposition Modelling In Additive Manufacturing
Study on the Fused Deposition Modelling In Additive ManufacturingIJERD Editor
 
Spyware triggering system by particular string value
Spyware triggering system by particular string valueSpyware triggering system by particular string value
Spyware triggering system by particular string valueIJERD Editor
 
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...IJERD Editor
 
Secure Image Transmission for Cloud Storage System Using Hybrid Scheme
Secure Image Transmission for Cloud Storage System Using Hybrid SchemeSecure Image Transmission for Cloud Storage System Using Hybrid Scheme
Secure Image Transmission for Cloud Storage System Using Hybrid SchemeIJERD Editor
 
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...IJERD Editor
 
Gesture Gaming on the World Wide Web Using an Ordinary Web Camera
Gesture Gaming on the World Wide Web Using an Ordinary Web CameraGesture Gaming on the World Wide Web Using an Ordinary Web Camera
Gesture Gaming on the World Wide Web Using an Ordinary Web CameraIJERD Editor
 
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...IJERD Editor
 
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...IJERD Editor
 
Moon-bounce: A Boon for VHF Dxing
Moon-bounce: A Boon for VHF DxingMoon-bounce: A Boon for VHF Dxing
Moon-bounce: A Boon for VHF DxingIJERD Editor
 
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...IJERD Editor
 
Importance of Measurements in Smart Grid
Importance of Measurements in Smart GridImportance of Measurements in Smart Grid
Importance of Measurements in Smart GridIJERD Editor
 
Study of Macro level Properties of SCC using GGBS and Lime stone powder
Study of Macro level Properties of SCC using GGBS and Lime stone powderStudy of Macro level Properties of SCC using GGBS and Lime stone powder
Study of Macro level Properties of SCC using GGBS and Lime stone powderIJERD Editor
 

Plus de IJERD Editor (20)

A Novel Method for Prevention of Bandwidth Distributed Denial of Service Attacks
A Novel Method for Prevention of Bandwidth Distributed Denial of Service AttacksA Novel Method for Prevention of Bandwidth Distributed Denial of Service Attacks
A Novel Method for Prevention of Bandwidth Distributed Denial of Service Attacks
 
MEMS MICROPHONE INTERFACE
MEMS MICROPHONE INTERFACEMEMS MICROPHONE INTERFACE
MEMS MICROPHONE INTERFACE
 
Influence of tensile behaviour of slab on the structural Behaviour of shear c...
Influence of tensile behaviour of slab on the structural Behaviour of shear c...Influence of tensile behaviour of slab on the structural Behaviour of shear c...
Influence of tensile behaviour of slab on the structural Behaviour of shear c...
 
Gold prospecting using Remote Sensing ‘A case study of Sudan’
Gold prospecting using Remote Sensing ‘A case study of Sudan’Gold prospecting using Remote Sensing ‘A case study of Sudan’
Gold prospecting using Remote Sensing ‘A case study of Sudan’
 
Reducing Corrosion Rate by Welding Design
Reducing Corrosion Rate by Welding DesignReducing Corrosion Rate by Welding Design
Reducing Corrosion Rate by Welding Design
 
Router 1X3 – RTL Design and Verification
Router 1X3 – RTL Design and VerificationRouter 1X3 – RTL Design and Verification
Router 1X3 – RTL Design and Verification
 
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...
Active Power Exchange in Distributed Power-Flow Controller (DPFC) At Third Ha...
 
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVR
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVRMitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVR
Mitigation of Voltage Sag/Swell with Fuzzy Control Reduced Rating DVR
 
Study on the Fused Deposition Modelling In Additive Manufacturing
Study on the Fused Deposition Modelling In Additive ManufacturingStudy on the Fused Deposition Modelling In Additive Manufacturing
Study on the Fused Deposition Modelling In Additive Manufacturing
 
Spyware triggering system by particular string value
Spyware triggering system by particular string valueSpyware triggering system by particular string value
Spyware triggering system by particular string value
 
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
 
Secure Image Transmission for Cloud Storage System Using Hybrid Scheme
Secure Image Transmission for Cloud Storage System Using Hybrid SchemeSecure Image Transmission for Cloud Storage System Using Hybrid Scheme
Secure Image Transmission for Cloud Storage System Using Hybrid Scheme
 
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...
Application of Buckley-Leverett Equation in Modeling the Radius of Invasion i...
 
Gesture Gaming on the World Wide Web Using an Ordinary Web Camera
Gesture Gaming on the World Wide Web Using an Ordinary Web CameraGesture Gaming on the World Wide Web Using an Ordinary Web Camera
Gesture Gaming on the World Wide Web Using an Ordinary Web Camera
 
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...
Hardware Analysis of Resonant Frequency Converter Using Isolated Circuits And...
 
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...
Simulated Analysis of Resonant Frequency Converter Using Different Tank Circu...
 
Moon-bounce: A Boon for VHF Dxing
Moon-bounce: A Boon for VHF DxingMoon-bounce: A Boon for VHF Dxing
Moon-bounce: A Boon for VHF Dxing
 
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
 
Importance of Measurements in Smart Grid
Importance of Measurements in Smart GridImportance of Measurements in Smart Grid
Importance of Measurements in Smart Grid
 
Study of Macro level Properties of SCC using GGBS and Lime stone powder
Study of Macro level Properties of SCC using GGBS and Lime stone powderStudy of Macro level Properties of SCC using GGBS and Lime stone powder
Study of Macro level Properties of SCC using GGBS and Lime stone powder
 

Dernier

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 

Dernier (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

International Journal of Engineering Research and Development (IJERD)

  • 1. International Journal of Engineering Research and Development e-ISSN: 2278-067X, p-ISSN: 2278-800X, www.ijerd.com Volume 7, Issue 5 (June 2013), PP. 23-30 23 An Auto-Correlation Based Speech Enhancement Algorithm Lalchhandami1 , Maninder Pal 2 1,2 Department of Electronics & Communication Engineering Maharishi Markandeshwar University, Mullana (Ambala), INDIA Abstract:- Enhancement of speech signals recorded using signal channel devices such as mobile phones is of prime interest. It is because for these devices, it is not possible to record noise signals separately, and the surrounding background noises are picked up by their microphone simultaneously with the speech signal. This may even completely fade-in the speech signal, depending upon the signal-to-noise ratio (SNR). Therefore to address this problem, number of algorithms and techniques has been developed. However, the existing methods are not able to perform homogenously across all noise types. The auto-correlation function of a noisy speech signal is usually confined to lower time lag and is very small or zero for higher time lag. Therefore, the higher-lag auto-correlation coefficients are relatively robust to additive noise distortion. This paper is focused on enhancing the noisy speech signal from single channel devices by using only the higher-lag auto-correlation coefficients. The efficiency of the algorithm is evaluated in terms of energy, zero crossings and intelligibility of speech signal. Keywords:- Single channel speech enhancement; speech processing; spectral subtraction auto- correlation and Speech signals SNR I. INTRODUCTION Speech enhancement is a topic of interest for last many years. In particularly, enhancement of speech signals recorded using signal channel devices such as mobile phones is of prime interest in this paper. It is because for these devices, it is not possible to record noise signals separately, and the surrounding background noises are picked up by its microphone simultaneously with the speech signal. This may even completely fade- in the speech signal, depending upon the signal-to-noise ratio (SNR). Therefore to address this problem, number of algorithms and techniques has been developed. The key techniques include: suppression of noise using the periodicity of speech or noise, enhancement based on perceptual criteria and subtractive-type algorithms such as spectral subtraction and Wiener filter [1], [ 6]. Among these, the spectral subtraction algorithm is the oldest speech enhancement algorithm. In this approach, the degraded speech signal is enhanced by subtracting an estimate of the average noise spectrum from a noisy speech spectrum and the noise spectrum is estimated or updated during the silence period of the speaker. However, the existing methods are not able to perform homogenously across all noise types. It is because all of these speech enhancement systems are based on certain assumptions and constraints that are typically dependent on the application and the environment. This paper is focused on reducing the continuous background noises from the speech signals recorded using single channel microphone based devices. For this purpose, auto-correlation function in time domain and frequency domain is used. The magnitude of auto-correlation coefficients is usually large between 2ms and 12ms, as the human pitch period is typically constrained between these values. However, the same is generally not true for noisy speech signals. The auto-correlation function of a noisy speech signal is usually confined to lower time lag and is very small or zero for higher time lag. The higher-lag auto-correlation coefficients are relatively robust to additive noise distortion [5]. Therefore, this paper is focused on enhancing the noisy speech signal from single channel devices by using only the higher-lag auto-correlation coefficients. The efficiency of the algorithm is evaluated in terms of energy, zero crossings and intelligibility of speech signal. This is presented below. II. SYSTEM MODEL As discussed above, the major constraint of single channel methods in speech enhancement is that there is no reference signal for the noise available. Therefore, the power spectral density of the noise needs to be estimated based on the available noisy speech signal only and this makes it a challenging task. In all single channel enhancement techniques, the noisy speech signal is given by
  • 2. An Auto-Correlation Based Speech Enhancement Algorithm 24 𝑦 𝑛 = 𝑥 𝑛 + 𝑑(𝑛) (1) Where, 𝑥 𝑛 represents the clean speech signal, 𝑑 𝑛 is the uncorrelated additive noise and 𝑦 𝑛 represents the degraded noisy speech signal. To enhance such noisy speech signals, an auto-correlation function based speech enhancement is presented in this paper. Its principle is based on traditional Spectral Subtraction method. In this Spectral Subtraction method, the degraded speech signals are enhanced by subtracting the estimate of the average noise spectrum from a noisy speech spectrum [1]. The noise spectrum is estimated during the periods when the signal is absent; which is usually very difficult to do in practice. In addition, it is also assumed that speech and noise is additive and uncorrelated. An estimate of the clean signal 𝑥 𝑛 is recovered from the noisy signal 𝑦(𝑛) by assuming that there is an estimate of the power spectrum of noise 𝐷𝑘 2 , which is obtained by averaging over multiple frames of a known noise segment. An estimate of the short-time squared magnitude spectrum of the clean signal using this method can be obtained as follows: 𝑋 𝑘 2 = 𝑌𝑘 2 − 𝐷𝑘 2 , if 𝑌𝑘 2 − 𝐷𝑘 2 ≥ 0 0, otherwise (2) Where, |𝑌𝑘 | is the magnitude of the noisy signal spectrum, 𝑋 𝑘 is the estimate of the clean frequency spectrum for a given frame, and |𝐷𝑘| is the magnitude of noise spectrum estimate during non-speech activity. To recover the signal, the magnitude spectrum estimate is finally combined with the phase of the noisy signal, using Inverse Discrete Fourier Transform (IDFT) as follows: 𝑥 𝑛 = 𝑋 𝑘 e𝑗 𝜑(𝑦,𝑘) (3) Although the spectral subtraction algorithm can be easily implemented to effectively reduce noise present in the corrupted signal; yet, it has several shortcomings. The major drawback of this method is the resulting musical noise, due to rapid coming and going of speech signals over successive frames. This is why this paper focuses on using auto-correlation function instead of power spectrum. The autocorrelation function of a signal contains the same information about the signal as its power spectrum [2]. However, the main difference between the power spectrum and auto-correlation domain is that in the power spectrum domain, the information is presented as a function of frequency; while in the latter, it is presented as a function of time. More specifically, the higher-lag auto-correlation coefficients of the speech signal 𝑥(𝑛) usually contain information about the signal’s power spectrum; whereas, the magnitude of higher-lag auto- correlation coefficients of the noise signal 𝑑(𝑛) is relatively small for some noise types. Therefore, the lower-lag auto-correlation coefficients can be discarded and only higher-lag auto-correlation coefficients of the noisy speech signal can be used for spectral estimate. The spectral estimation is done using only the higher-lag auto- correlation coefficients; and, the speech and noise signals can be separated without having to estimate the noise signal directly. The estimated power spectrum can then be used to enhance the corrupted noisy speech signals. It is to be noted that the auto-correlation function of a signal can be computed in time domain and frequency domain. This paper has focused on computing auto-correlation function in both domains in order to investigate their effect on enhancing speech signals. III. IMPLEMENTATION PROCEDURE The flowchart of the autocorrelation based speech enhancement algorithm is given in Fig. 1. As shown in Fig. 1, the first step in spectral estimation is to divide the noisy speech signal into overlapping short time frames. These short time frames are 64 ms (512 samples) long. A Hamming window with 50% overlapping is then applied on these short time frames. This process is defined by Eqn 4, where 𝑥(𝑛) is discrete speech signal and 𝑤(𝑛) is the window function shifted by 𝑚 samples. 𝑥 𝑤 𝑛 = 𝑥 𝑛 𝑤 𝑚 − 𝑛 (4) By taking the Fourier Transform of 𝑥 𝑤 𝑛 , a discrete Short-Time Fourier Transform can be computed as shown in Eqn 5. 𝑋 𝑤, 𝑘 = 1 𝑁 𝑥 𝑤 𝑛 𝑤 𝑛=𝑤−𝑁+1 𝑒 −𝑗2𝜋𝑘𝑛 𝑁 𝑘 = 0, … , 𝑁 − 1 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 (5) The resulting discrete STFT can be expressed as 𝑋 𝑤, 𝑘 = |𝑋 𝑤, 𝑘 |𝑒 𝑗𝜑 (𝑤,𝑘) (6) The |𝑋 𝑤, 𝑘 | component is the magnitude of the Short-Time Fourier Transform and the 𝜑 𝑤, 𝑘 component is the phase. For speech recorded using single-channel based devices, the noisy signal 𝑦 𝑛 can be analytically expressed in frequency domain as: 𝑌𝑘 𝑒 𝑗 𝜑(𝑦,𝑘) = 𝑋 𝑘 𝑒 𝑗 𝜑(𝑥,𝑘) +𝐷𝑘 𝑒 𝑗 𝜑(𝑑,𝑘) (7) In the modification stage, the degraded signal in the frequency domain is modified in order to restore the
  • 3. An Auto-Correlation Based Speech Enhancement Algorithm 25 original speech signal. For this purpose, a biased auto-correlation sequence of the windowed time frames is then computed. As the power spectrum of an auto-correlation sequence has dynamic range twice as large as that of the signal’s power spectrum, so if a Hamming window is applied to the positive side of the auto-correlation sequence, the dynamic range would be effectively reduced by half. Therefore, to address this problem, a new window of double dynamic range is used to process the one-sided auto-correlation sequence. For implementing this window, the dynamic range of Hamming window is made double by computing its auto-correlation sequence and resulting auto-correlation sequence. The resulting sequence is defined as the coefficients of the new high dynamic range window. To increase the noise reduction effect, the new window is placed over the higher-lag region of the auto- correlation sequence, which is taken to be greater than 2 ms (16 samples), as there is natural null in the average short-time auto-correlation of speech signals at approximately 2 ms lag. An estimate of the original speech signals power spectrum is then computed for each frame as the magnitude spectrum of the windowed higher-lag autocorrelation coefficients. This is done by pairing the modified magnitude spectra 𝑋 𝑘 with each of the original corresponding phase spectra 𝜑(𝑦, 𝑘). The degraded phase spectra are then used to create the enhanced spectra. The enhanced speech frames are then synthesized using an inverse Fourier transform of the complex spectra as mentioned in Eqn 8. 𝑥 𝑛 = 𝐼𝐷𝐹𝑇 𝑋 𝑘 𝑒 𝑗 𝜑(𝑦,𝑘) (8) These frames are then overlapped and added to obtain the enhanced speech signal. Fig. 1: Flowchart of power spectral estimate using higher-lag auto-correlation coefficients. The block diagram of the speech enhancement using higher-lag auto-correlation coefficients is shown in Fig. 2. Fig. 2: Block diagram of the auto-correlation based speech enhancement algorithm. IV. RESULTS & DISCUSSION To evaluate the performance of the auto-correlation based speech enhancement algorithm, the same is implemented in MATLAB and extensive simulations have been performed. A small length of approximately
  • 4. An Auto-Correlation Based Speech Enhancement Algorithm 26 10 seconds of speech of a female person, in the age group 20 to 25 years, is recorded in the presence of noise. The noise is produced by two sources: running fan and car. The speech signals are recorded separately in both noise conditions. To record signals at different SNR’s, the speed of fan is changed to change the level of noise. The mobile phone and sound card used for recording signals has a single microphone, so the speech signal and noises are both picked up by the microphone. These recorded signals are called noisy speech signals. These noisy speech signals are then processed by an auto-correlation based speech enhancement algorithm and the estimated cleaned signals are then stored and listened back to estimate their degree of similarity (intelligibility) with their original version. Auto-correlation function is computed by two methods: the generalized cross- correlation (GCC) method in frequency domain and using Matlab xcorr command in time domain. The results obtained for these two methods are then compared. These results are shown in Fig. 4 to Fig. 15, and the key conclusions drawn from the results are discussed below. 1. Fig. 4 & Fig. 5 shows the auto-correlation function and one-sided auto-correlation window respectively of one frame (i.e. 512 samples). For enhancing the speech signal, the lower lags of 16 samples (2 ms) of the one-sided auto-correlation function are discarded and only the higher-lag coefficients are retained. A Hamming window with wide range (width 512 samples) is applied to the higher-lag auto-correlation sequence. It is observed that this wide dynamic range Hamming window has peak in the middle frequencies and decrease to zero at the edges; thereby, reducing the effects of the discontinuities as a result of finite duration. It is also observed that this window captures all the harmonic peaks of the auto-correlation sequence, and therefore, captures all the formant structures with no spectral loss. This indicates that this window works well in the auto-correlation domain in enhancing the noisy speech signal. 2. Fig. 6 show the power spectrum estimate of a single frame (512 samples) of the speech signals; which is used to obtain the enhanced version of the signal. The estimated power spectrum is computed on a frame to frame basis, and need not necessarily be computed during the non-speech activity of the speaker, as required in the spectral subtraction algorithm. This is the key advantages of this algorithm. 3. Fig. 7, Fig. 12 & Fig. 13 shows the recorded speech signal with different levels of noises i.e. different SNRs and in the presence of various noise producing sources. The energies of the recorded and the recovered speech signals are computed and is observed that a high level of energy (above threshold) of the speech signal is approximately at the same time instance for both the recorded and the recovered signal. It is also observed that the zero-crossing rate is high during the speech period of the signal Thus, the intelligibility of the recovered signal is maintained. 4. In Fig. 7 to Fig. 15, the recorded speech signals are processed by the auto-correlation based speech enhancement method. The auto-correlation function is calculated using the GCC method in frequency domain and using xcorr Matlab command for time domain. It can be observed that the second method (xcorr) gives better results in reducing the noise from the speech signals; however, at the expense of small decrease in magnitude of the speech signal. The decrease in the magnitude of the recovered speech signal is uniform; thus, the algorithm maintains the intelligibility of the recovered speech signal. 5. Fig. 11, Fig.14 & Fig. 15 shows the spectrograms of the recorded noisy signal and the recovered signal obtained using the auto-correlation based algorithm. The spectrograms showed that the noise energy, at a particular frequency and time, is reduced to a considerable extent in the recovered speech signal; thereby, improving the quality of the recovered speech signals. 6. Fig. 12 & Fig. 14 shows the results of noisy speech signals corresponding to a female voice of approximately 10 seconds, recorded in a running car by using a microphone and sound card, and mobile phone. It is observed that the recovered signal using xcorr gives better results than the GCC method, as both musical and residual noises are observed in the signals recovered using GCC method. The continuous noises from the ambient environment are reduced to a significant extent in the recovered signal by the two methods; thereby, proving the effectiveness of the algorithm for continuous surrounding noises. 7. Fig. 13 & Fig. 15 shows the results for speech signals recorded using a single microphone (mono) based mobile phone, used by a person in a running car. The recorded signal is then processed by the auto- correlation based algorithm and is observed that the moving wind noise and engine noise of car are reduced in the recovered signal. It is also observed from the spectrogram that the energy of the noises lie in the audio frequency range indicating that the speech signals gets easily distorted by the ambient noises. The spectrogram of the recovered signal shows that the energy of these noises is reduced in the recovered signal. Thus, the algorithm works well for enhancing speech signals recorded using single channel devices in a noisy environment. This justifies that this algorithm is designed to minimize the noises produced by constant sources such as vehicles and is well suited for environments such as persons using mobile phones in cars, trains etc.
  • 5. An Auto-Correlation Based Speech Enhancement Algorithm 27 Fig. 3: Recorded speech (female speaking /a/ sound) signal of length 10 seconds in presence of noise produced by fan moving at constant speed. Fig. 4: An example of autocorrelation of a single speech frame (512 samples) using xcorr and generalized cross-correlation (GCC). Fig. 5: An example of autocorrelation window (single sided) of single frame (512 samples). Fig. 6: An example of the power spectrum estimate of a single frame (512 samples). Fig. 7: The recorded speech (female speaking /a/, /b/ and /c/ sound) signal of length 10 seconds in presence of noise produced by a fan moving at constant speed, and the recovered signals using GCC (frequency domain) and xcorr (time domain). Fig. 8: The recorded speech signal with its energy (logarithmic scale) and zero crossing rate.
  • 6. An Auto-Correlation Based Speech Enhancement Algorithm 28 Fig. 9: The recovered speech signal (using GCC) with its energy (logarithmic scale) and zero crossing rate. Fig. 10: The recovered speech signal (using xcorr) with its energy (logarithmic scale) and zero crossing rate. a) The spectrogram of recorded noisy speech signal. b) The spectrogram of recovered speech signal using GCC. c) The spectrogram of recovered speech signal using xcorr. Fig. 11: The spectrogram of a) recorded speech signal, b) recovered signal using GCC, and c) recovered signal using xcorr. Fig. 12: The recorded speech signal of a female /a/, /b/ and /c/ sound of length 10 seconds in running car and the recovered signals using GCC (frequency domain) and xcorr (time domain). Fig. 13: Mobile phone recorded speech signal corresponding to alphabets /a/ and /b/, produced by male speaker in running car and the recovered signals using GCC (frequency domain) and xcorr (time domain).
  • 7. An Auto-Correlation Based Speech Enhancement Algorithm 29 a) The spectrogram of recorded noisy speech signal (Corresponding to Fig. 12). b) The spectrogram of recovered speech signal using GCC. c) The spectrogram of recovered speech signal using xcorr. Fig. 14: The spectrogram of a) recorded speech signal (corresponding to Figure 12), b) recovered signal using GCC, and c) recovered signal using xcorr. a) The spectrogram of recorded noisy speech signal (Corresponding to Fig. 12). b) The spectrogram of recovered speech signal using GCC. c) The spectrogram of recovered speech signal using xcorr. Fig. 15: The spectrogram of a) recorded speech signal using using mobile phone in Fig. 13, b) recovered signal using GCC, and c) recovered signal using xcorr. V. CONCLUSIONS This paper presented a speech enhancement algorithm based on auto-correlation function. In this approach, it is expected that by discarding the more noise affected lower lag auto-correlation coefficients, an estimate of power spectrum of clean signal can be obtained from the magnitude spectrum of higher lag auto- correlation coefficients and finally the enhanced speech signal can be obtained using the overlap-add method. More specifically, it presented the results obtained on applying the auto-correlation based speech enhancement algorithm on the recorded speech signals with continuous background noises. The speech enhancement is performed by using auto-correlation in both time domain and frequency domain. From the results obtained, it is found that the time domain auto-correlation gives better results in reducing the noise from the speech signal and gives better quality recovered signals as compared to the frequency domain auto-correlation. With the auto- correlation based algorithm, the stationary noises (e.g., the moving fan noise and car noise) are reduced in the recovered speech signals; but, at the expense of small decrease in magnitude of the speech signal. However, the decrease in the magnitude of the recovered speech signal is uniform; thus, the algorithm maintains the intelligibility of the original speech signal. It is also observed from the results that the algorithm does not work well for non-stationary background noises, as these noises are still audible in the recovered signal. Therefore, it is concluded that this algorithm is capable of enhancing the speech signal of a single channel based devices and perform well for continuous or stationary noises. REFERENCES [1]. Z. Chen, “Simulation of Spectral Subtraction Based Noise Reduction Method,” International Journal of Advanced Computer Science and Applications, Vol. 2, No. 8, 2011.
  • 8. An Auto-Correlation Based Speech Enhancement Algorithm 30 [2]. M. A. Hasan & T. Shimamura, “A Fundamental Frequency Extraction Method Based on Windowless and Normalized Autocorrelation Functions,” Proc. 6th WSEAS Int. Conf. Circuits, Systems, Signal and Telecommunications, Cambridge, pp. 305-309, 2012. [3]. M. Gabrea, “Two Microphones Speech Enhancement Systems based on Instrumental Variable Algorithm for Speaker Identification,” Proc. Of the 24th Canadian Conference on Electrical and Computer Engineering, May 2011. [4]. J. I. Hurtado & D. V. Anderson, “FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, No. 8, November 2011. [5]. B. J. Shannon, K. K. Paliwal, & C. Nadeu, “Speech enhancement based on spectral estimation from higher-lag autocorrelation”, Proc. Interspeech, 2006. [6]. L. D. Alsteris & K. K. Paliwal, “Importance Of Window Shape For Phase-Only Reconstruction Of Speech,” International Conf. on Acoustics, Speech, and Signal Processing-ICASSP, Vol. 1, pp. I-573- 6, 2004. [7]. P. Basu, P. J. Wolfe, D. Rudoy, T. F. Quatieri & Bob Dunn, “Adaptive Short-Time Analysis-Synthesis for Speech Enhancement,” International Conf. on Acoustics, Speech, and Signal Processing-ICASSP, pp. 4905-4908, 2008. [8]. A. K. Swain & W. Abdulla, “Estimation of LPC Parameters of Speech Signals in Noisy Environment,” TENCON, Vol. 1, pp. 139-142, 2004. [9]. H. G. Kim, M. Schwab, N. Moreau & T. Sikora, “Speech Enhancement of Noisy Speech Using Log- Spectral Amplitude Estimator and Harmonic Tunneling,” International Workshop on Acoustic Echo and Noise Control, Japan, September 2003. [10]. K. A. Sheela and K. S. Prasad, “Less Computational Complex Method for Estimation of Non- Stationary Noise in Speech Using Low frequency Regions for Power Spectral Subtraction,” National conference on Communications (NCC2007) IIT Kanpur, January 2007. [11]. T. Fingscheidt, S. Suhadi, & S. Stan, “Environment-Optimized Speech Enhancement,” IEEE Transactions on Audio, Speech & Language Processing, pp. 825-834, 2008. [12]. R. Martin, “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Transactions on Speech and Audio Processing, Vol. 9, No. 5, pp. 504-512, July 2001. [13]. M. K. Hasan, S. Salahuddin & M. R Khan, “A modified a priori SNR for speech enhancement using spectral subtraction rules,” IEEE Signal Processing Letters, Vol. 11, No. 4, pp. 450-453, April 2004. [14]. S. Ogata, S. Ebataz & T. Shimamuray, “Improved Model of SPAC (Speech Processing System by Use of Auto-Correlation Function) Utilizing Spectral Subtraction as Preprocessing,” Proc. of ICA 2004: The 18th international congress on acoustics, Kyoto, Japan, pp. IV-3037 - IV-3040, April 2004. [15]. Y. Hu, M. Bhatnagar & P. C. Loizou, “A Cross-Correlation Technique for Enhancing Speech Corrupted with Correlated Noise,” Proc. of IEEE International Conference on Acoustic Speech Signal Processing, Vol. 1, Salt Lake City, UT, pp. 673-676, 2001. [16]. J. B. Boldt, D. Ellis, “A Simple Correlation-Based Model of Intelligibility for Nonlinear Speech Enhancement and Separation,” Proc. of the 17th European Signal Processing Conference EURASIP, 2009. [17]. M. A. Hasan & T. Shimamura, “An Efficient Pitch Estimation Method Using Windowless and Normalized Autocorrelation Functions in Noisy Environments,” International Journal of Circuits, Systems and Signal Processing, Vol. 6 No. 1, pp. 197-204, 2012.