A Recognition Judgment Method of Isolated-Word Speech-Recognition

Article Preview

Abstract:

Isolated-word speech-recognition system adopted the shortest distance of Dynamic Time Warping (DTW) to make recognition judgment, which has the disadvantage of high False Accept Rate (FAR), poor anti-noise and robustness. This paper proposes a new method based on DTW distance Threshold Estimation for recognition judgment. This method processes the maximum distance between template speech and training input speech multiplying adjusting coefficient, then plus noise DTW distance, which regard the final result as distance Threshold Estimation. At the time of doing speech recognition, if the distance between testing speech and template speech exceeds the Threshold Estimation, then the system will not recognize this speech. The experiment shows that this method can greatly improve the anti-noise and robustness performance of the Isolated-word speech-recognition system and solve the problem of high FAR.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2337-2340

Citation:

Online since:

March 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Zhiqiang Wang, The research of key issues for isolated-word speech-recognition system, Beijing University of Posts and Telecommunications, MA, (2006).

Google Scholar

[2] Xiaodong Shi, The design of isolated-word speech-recognition system, Zhejiang University , MA, (2006).

Google Scholar

[3] Yunhong Li, Ziling Li, The improved DTW voice recognition algorithm (Published Journal style), Information Technology Applications in Industry, pp.2328-2331, (2013).

Google Scholar

[4] Shichun Zhou, The research of algorithm for auditory characteristics and noise estimation in speech enhancement, East China University of Science and Technology, MA, (2013).

Google Scholar

[5] Junqin Wu , Junjun Yu, An improved arithmetic of MFCC in speech recognition system(Published Proceedings style), IEEE , International Conference on Electronics, Communications and Control 2011, pp.719-722.

DOI: 10.1109/icecc.2011.6066676

Google Scholar

[6] I. Cohen, Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging (Published Journal style), IEEE Transactions on Speech and Audio Processing, vol. 11, no. 5, pp.466-475, (2003).

DOI: 10.1109/tsa.2003.811544

Google Scholar

[7] Xueji Jin, The research and implementation of algorithm for speech enhancement, Zhejiang University , MA, (2005).

Google Scholar

[8] T. Painter, A. Spanias, Perceptual coding of digital audio (Published Journal style), Proceedings of the IEEE, vol. 88, no. 4, pp.451-503, (2000).

DOI: 10.1109/5.842996

Google Scholar