Research on Speech Recognition System with Speaker Identification Based on the Cloud Server

Xue Yan Liu; Bao Ling Yuan

doi:10.4028/www.scientific.net/AMR.1022.219

Paper Titles

The Overview on Sticking Breakout Behavior for Thin Slab Continuous Casting
p.201

Research on Information Process with a Computational Approach to Some Odd-Graceful Trees
p.207

Application of Computer and Information Technology in the Dormitory Management
p.211

Application of Information Technology in Interpretation with Note Methods for Exploration-Internalization
p.215

Research on Speech Recognition System with Speaker Identification Based on the Cloud Server
p.219

Applied Technology in a Developed Simulation Model of Pedestrian Crowd Dynamics during Emergency Evacuation
p.223

Applied AHP Technology and Information Technology in Evaluation for Teaching Quality of University
p.229

Data Processing for Correlation Analysis between Extracurricular Sports Activities and Mental Health in Rural Middle School Students
p.233

Research on Computer Technology with Informatization in Higher Education
p.237

HomeAdvanced Materials ResearchAdvanced Materials Research Vol. 1022Research on Speech Recognition System with Speaker...

Research on Speech Recognition System with Speaker Identification Based on the Cloud Server

Abstract:

The speech recognition system is not real-time, a speak identification method based on the cloud server is proposed to solve this problem. Firstly, the MFCC frequency cepstrum coefficient and the first order differential coefficient are extracted from the speech feature vector sequence to form 32 dimensional. And then the 32 dimensional speech feature vector is sent to the cloud server, the training speaker model and identification are done in the cloud server. Finally, the identification result is sent to the client.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Advanced Materials Research (Volume 1022)

Pages:

219-222

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.1022.219

Citation:

Cite this paper

Online since:

August 2014

Authors:

Xue Yan Liu*, Bao Ling Yuan

Keywords:

Cloud Server, MFCC, Speaker Identification, SVM

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Jain A K, Hong L, Kulkarni Y A. Multimod-al Biometric Sys-tem using Fingerprints, Face and Speech. 2nd Int' l Conferenceon Audio-and Video-based Biometric Person Authentication, Washington D.C., p.182～187, March 22-24, (1999).

Google Scholar

[2] CAO Jie, YU Li-chen. Improved speaker clustering initialization and GMM multi-speaker recognition[J]. Application Research of Computers. 2012, Vol29, No2, pp: 590-593.

Google Scholar

[3] GARAU G，DIELMANN A，BOURLARD H.

Google Scholar

[4] Audio-visual synchroni- sation for speaker diarisaation[C]/ Proc of International Conference on Speech and Language Processing. Makuhari，Chiba: [s. n. ]. 2010: 2654-2657.

Google Scholar

[5] BURGES C L C. A tutorial on support vector machines for pattern recognition [J]. Data Min-ing and Knowledge Discovery, 1998, 2(2): 121 -167.

Google Scholar

[6] LIUM H, XIEY L, YAO ZQ, etal. A new hybrid GMM /SVM for speaker verification [C]/ The 18th International Conference on Pattern Recognition. Hong Kong: IEEE Press, 2006, 4: 314-317.

DOI: 10.1109/icpr.2006.118

Google Scholar

[7] CuiXuan. Research of Speaker Recognition Based on combining speaker characteristic [J]. Chengdu Xinhua University. 2008: 19, 39-42.

Google Scholar

[8] Yan Gao, Lanwen Jin, Cong He, Guibin Zhou. Handwriting Character Recognition as a Service: A New Handwriting Recognition System Based on Cloud Computing. 2011. 9P885-889. In proceeding of: Document Analysis and Recognition (ICDAR).

DOI: 10.1109/icdar.2011.181

Google Scholar

[9] Siew Chan Woo, Chee Peng Lim, Osman. R, Development of a Speaker Recognition System Using Wavelets and Artificial Neural network Intelligent Multimedia, Video and Speech Processing May (2001), Page 413-416.

DOI: 10.1109/isimp.2001.925421

Google Scholar

[10] H. Torres, H. Rufiner, Automatic. Speaker Identifacation by Means of Mel Cepstrum, Wavelets and Wavelet Packets，Processing of the 22th Annual EMBS International Conference, Chicago, July (2000), Page 978-981.

DOI: 10.1109/iembs.2000.897886

Google Scholar

[11] Dean J. Ghemawat S. Distributed Programming With MapReduce [M]. /. Oram A, WilsonG, eds. Beautiful Code, Sebastopol: O' Reilly Media Inc. 2007. 371-384.

Google Scholar