Improved Speech Enhancement Algorithm Based on Bark Bands Noise-Estimation for Non-Stationary Environment

Article Preview

Abstract:

The conventional spectrum subtraction algorithm cannot effectively suppress the noise under highly non-stationary environment and results in the remaining music noise is often heard in the enhanced speech. In order to improve the speech enhancement performance, a novel denoising algorithm is proposed, which is based on speech endpoint detection using spectrum variance and the dynamic spectrum subtraction in Bark bands. According to human auditory characteristics, the Bark bands spectrums of the noisy speech signal are firstly calculated, and the noise power spectrum of each Bark band is then tracked and estimated by the improved minima controlled recursive averaging method. This noise estimation is adjustable frame by frame and more accurate for non-stationary environment. The experiment results showed that the proposed method can suppress the noise more efficiently than the conventional spectrum subtraction and the remaining music noise is almost eliminated.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1398-1401

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Thomsa F. Q, Discrete-time speech signal processing. Beijing: Electronics Industry Press, (2004).

Google Scholar

[2] Zhang Rui. Research on Speech Enhancement. Chendu, Sichuan University, (2006).

Google Scholar

[3] Pu Chunjun, Majingxia, Xupeng. Research on endpoint detection method of speech with noise. Compute Application, 2006, 26(11), pp.2685-2690.

Google Scholar

[4] Wang Y., Qu BD, Li JB, An improved endpoint detection algorithm based on band variance. Proceedings of Control and Decision Conference, Wuxi, China, 2007, pp.301-303.

Google Scholar

[5] R. Sundarrajan, C.L. Philips, A noise-estimation algorithm for highly non-stationary environments. Speech Communication, 2006, 48(2), pp.220-231.

DOI: 10.1016/j.specom.2005.08.005

Google Scholar