Research on Speech Recognition System with Speaker Identification Based on the Cloud Server

Article Preview

Abstract:

The speech recognition system is not real-time, a speak identification method based on the cloud server is proposed to solve this problem. Firstly, the MFCC frequency cepstrum coefficient and the first order differential coefficient are extracted from the speech feature vector sequence to form 32 dimensional. And then the 32 dimensional speech feature vector is sent to the cloud server, the training speaker model and identification are done in the cloud server. Finally, the identification result is sent to the client.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

219-222

Citation:

Online since:

August 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Jain A K, Hong L, Kulkarni Y A. Multimod-al Biometric Sys-tem using Fingerprints, Face and Speech. 2nd Int' l Conferenceon Audio-and Video-based Biometric Person Authentication, Washington D.C., p.182~187, March 22-24, (1999).

Google Scholar

[2] CAO Jie, YU Li-chen. Improved speaker clustering initialization and GMM multi-speaker recognition[J]. Application Research of Computers. 2012, Vol29, No2, pp: 590-593.

Google Scholar

[3] GARAU G,DIELMANN A,BOURLARD H.

Google Scholar

[4] Audio-visual synchroni- sation for speaker diarisaation[C]/ Proc of International Conference on Speech and Language Processing. Makuhari,Chiba: [s. n. ]. 2010: 2654-2657.

Google Scholar

[5] BURGES C L C. A tutorial on support vector machines for pattern recognition [J]. Data Min-ing and Knowledge Discovery, 1998, 2(2): 121 -167.

Google Scholar

[6] LIUM H, XIEY L, YAO ZQ, etal. A new hybrid GMM /SVM for speaker verification [C]/ The 18th International Conference on Pattern Recognition. Hong Kong: IEEE Press, 2006, 4: 314-317.

DOI: 10.1109/icpr.2006.118

Google Scholar

[7] CuiXuan. Research of Speaker Recognition Based on combining speaker characteristic [J]. Chengdu Xinhua University. 2008: 19, 39-42.

Google Scholar

[8] Yan Gao, Lanwen Jin, Cong He, Guibin Zhou. Handwriting Character Recognition as a Service: A New Handwriting Recognition System Based on Cloud Computing. 2011. 9P885-889. In proceeding of: Document Analysis and Recognition (ICDAR).

DOI: 10.1109/icdar.2011.181

Google Scholar

[9] Siew Chan Woo, Chee Peng Lim, Osman. R, Development of a Speaker Recognition System Using Wavelets and Artificial Neural network Intelligent Multimedia, Video and Speech Processing May (2001), Page 413-416.

DOI: 10.1109/isimp.2001.925421

Google Scholar

[10] H. Torres, H. Rufiner, Automatic. Speaker Identifacation by Means of Mel Cepstrum, Wavelets and Wavelet Packets,Processing of the 22th Annual EMBS International Conference, Chicago, July (2000), Page 978-981.

DOI: 10.1109/iembs.2000.897886

Google Scholar

[11] Dean J. Ghemawat S. Distributed Programming With MapReduce [M]. /. Oram A, WilsonG, eds. Beautiful Code, Sebastopol: O' Reilly Media Inc. 2007. 371-384.

Google Scholar