A New Method for Noisy Speech Classification Based on Gaussian Mixture Models

Article Preview

Abstract:

Speech can be broadly categorized into voiceless, voiced, and mute signal, in which voiced speech can be further classified into vowel and voiced consonant. With the ever increasing demand of the speech synthesis applications, it is urgent to develop an effective classification method to differentiate vowel and voiced consonant signal since they are two distinct components that affect the naturalness of the synthetic speech signal. State-of-the-arts algorithms for speech signal classification are effective in classifying voiceless, voiced and mute speech signal, however, not effective in further classifying the voiced signal. In view of the issue, a new algorithm for speech classification based on Gaussian Mixture Model (GMM) is proposed, which can directly classify a speech into voiceless, voiced consonant, vowel and mute signal. Specifically, a new speech feature is proposed, and the GMM is also modified for speech classification. Simulation results demonstrate that the proposed algorithm is effective even under the noisy environments.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 532-533)

Pages:

1253-1257

Citation:

Online since:

June 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] B Atal, L Rabiner. A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition [ J ]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1976, 24 ( 3 ) : 201 -212.

DOI: 10.1109/tassp.1976.1162800

Google Scholar

[2] L Rabiner, M Sam bur. Application of an LPC distance measure to the voiced-unvoiced-silence detection problem [ J ]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1977, 25 (4) : 338 – 343.

DOI: 10.1109/tassp.1977.1162964

Google Scholar

[3] B Cox, L M Timothy. Nonparametric rank-order statistics applied to robust voiced-unvoiced-silence classifica2tion [ J ]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1980, 28 (5) : 550 - 561.

DOI: 10.1109/tassp.1980.1163444

Google Scholar

[4] Y Q i, B R Hunt. Voiced-unvoiced-silence classification of speech using hybrid features and a network classifier[ J ]. IEEE Transactions on Speech and Audio Processing.

DOI: 10.1109/89.222883

Google Scholar

[5] R Ahn, W H Holmes. Voiced / unvoiced / silence classification of speech using 2-Stage neural networks with delayed decision input[A ]. B Boashash, et a l. Proc ISSPA96 [ C ]. B risbane, Australia: Queensland University of Technology, 1996. 389 - 390.

Google Scholar

[6] QI Feng-yan and BAO Chang-chun, A Method for Voiced/Unvoiced/Silence Classification of Speech with Noise Using SVM, [ J ]Chinese Journal of Electronics, 2006, 4(4), 605-611.

Google Scholar

[7] Douglas Reynolds Richard C Rose . Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models[J]. IEEE Transactions on Speech and Audio Processing January 1995, 3( 1) : 72-83.

DOI: 10.1109/89.365379

Google Scholar