Speech Stream Detection for Noisy Environments Based on Empirical Mode Decomposition

Article Preview

Abstract:

A new approach for speech stream detection based on empirical mode decomposition (EMD) under a noisy environment is proposed. Accurate speech stream detection proves to significantly improve speech recognition performance under noise. The proposed algorithm relies on the Teager energy and spectral entropy characteristics of the signal to determine whether an input frame is speech or non-speech. Firstly, the noise signals can be decomposed into different numbers of sub-signals called intrinsic mode functions (IMFs) with the EMD. Then, spectral entropy is used to extract the desired feature for noisy IMF components and Teager energy is used to non-noisy IMF components. Finally, in order to show the effectiveness of the proposed method, we present examples showing that the new measure is more effective than traditional measures. The experiments show that the proposed algorithm can suppress different noise types with different SNR.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2239-2242

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] G. Tanyer, H. Ozer, IEEE Trans. Speech Audio Processing, Vol. 8 (2000), p.478.

Google Scholar

[2] L. R. Rabiner,M. R. Sambur, Bell Syst. Tech. J., vol. 54(1975), p.297.

Google Scholar

[3] Zhang Yong, Chen Bin. Journal of University of Electronic Science and Technology of China, Vol. 36(2007), p.8.

Google Scholar

[4] J. F. Kaiser. In Proc. IEEE ICASSP-90, (1990), p.381.

Google Scholar

[5] N. E. Huang, Z. Shen and S. R. Long, Proceedings of the Royal Society London A, Vol. 454 (1998), p.903.

Google Scholar

[6] I. Djurovic and L.J. Stankovic, IEEE Transaction on Signal Processing, Vol. 49 (2001), p.2985.

Google Scholar

[7] Wang Rangding, Chai Peiqi. Information and Control (In Chinese), Vol. 33 (2004), p.77.

Google Scholar

[8] De-Xiang Zhang, Xiao-Pei Wu, and Zhao Lv. Journal of Electronic Science and Technology, Vol. 8 (2010), p.183.

Google Scholar