Research and Realization on the Voice Command Recognition System for Robot Control Based on ARM9

Article Preview

Abstract:

In this paper, based on the study of two speech recognition algorithms, two designs of speech recognition system are given to realize this isolated speech recognition mobile robot control system based on ARM9 processor. The speech recognition process includes pretreatment of speech signal, characteristic extrication, pattern matching and post-processing. Mel-Frequency cepstrum coefficients (MFCC) and linear prediction cepstrum coefficients (LPCC) are the two most common parameters. Through analysis and comparison the parameters, MFCC shows more noise immunity than LPCC, so MFCC is selected as the characteristic parameters. Both dynamic time warping (DTW) and hidden markov model (HMM) are commonly used algorithm. For the different characteristics of DTW and HMM recognition algorithm, two different programs were designed for mobile robot control system. The effect and speed of the two speech recognition system were analyzed and compared.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1422-1426

Citation:

Online since:

December 2010

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Luo Zhi-zeng , Zao Jing-bin: Voice - based Robot Control System and its Application, Journal of Hangzhou Institute of Electronic Engineering Vol. 24 (2004) , pp.30-34.

Google Scholar

[2] Liu Xiao: Research on speech recognition key methods, Harbin Engineering University (2006).

Google Scholar

[3] Wang Xue-song, Tian Xilan, Wang Weiqiang: Application of speech recognition to robot control, Chinese Journal of Scientific Instrument Vol. 27 (2006) , pp.768-770.

Google Scholar

[4] Wang Li-ming Chen Shuang-qiao: ARM9 Embedded System Development and Practice, Beijing University of Aeronautics and Astronautics Press (2008).

Google Scholar

[5] Hetherington I L, Han Shu, Glass J R. Flexible Multi- Stream Framework for Speech Recognition using Multi-Tape Finite-State Transducers. IEEE International Gon-ference on Acoustics, Speech, and Signal Processing (2006).

DOI: 10.1109/icassp.2006.1660046

Google Scholar