A Chinese Small Vocabulary Offline Speech Recognition System Based on Pocketsphinx in Android Platform

Article Preview

Abstract:

This paper describes a Chinese small-vocabulary offline speech recognition system based on PocketSphinx which acoustic models are regenerated by improving the existing models of Sphinx and language model is generated by LMTool online tool. And then build an offline speech recognition system which could run on the Android smartphone in Android development environment in Linux system. The experiment results show that the system used for recognizing the voice commands for cell phone has good recognition performance.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

267-273

Citation:

Online since:

August 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Huggins-Daines, D., Kumar, M., Chan, A., Black, A.W., PocketSphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices, Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 proceedings. 2006 IEEE International Conference on, vol. 1, no., pp. I, 14-19 May (2006).

DOI: 10.1109/icassp.2006.1659988

Google Scholar

[2] Information on http: /developer. android. com/sdk/installing. html.

Google Scholar

[3] A P Harvey., R J McCrindle, K., Lundqvist and P Parslow., Automatic speech recognition for assistive technology devices, 8th Intl Conf. Disability, Virtual Reality & Associated Technologies, 31 Aug. - 2 Sept. (2010).

Google Scholar

[4] J.M. Noyes., C.R. Frankish., Speech recognition technology for individuals with disabilities, ISAAC. vol. 8, December (1992).

Google Scholar

[5] Wang Y, Zhang X. Realization of Mandarin continuous digits speech recognition system using Sphinx, Computer Communication Control and Automation (3CA), 2010 International Symposium on. IEEE, 2010, 1: 378-380.

DOI: 10.1109/3ca.2010.5533801

Google Scholar

[6] Waleed Fakhr., Ahmed AbdelSalam., NadderHamdy., Enhancement of mismatched conditions in speaker recognition for multimedia applications , IEEE International Conference on Acoustics, Speech, and Signal Processing, May (2004).

DOI: 10.1109/icassp.2004.1326001

Google Scholar

[7] Barrena S., Klotz L., Landes V., Designing android applications with both online and offline voice control of household devices, Bioengineering Conference (NEBEC), 2012 38th Annual Northeast. IEEE, 2012: 319-320.

DOI: 10.1109/nebc.2012.6207093

Google Scholar