The Research of Speech Emotion Recognition Based on Gaussian Mixture Model

Article Preview

Abstract:

A new recognition method based on Gaussian mixture model for speech emotion recognition is proposed in this paper. To improve the effectiveness of feature extraction and accuracy of emotion recognition, extraction of Mel frequency cepstrum coefficient combined with Gaussian mixture model is used to recognize speech emotion. According to feature parameters extraction method by analyzing the principle of vocalization theory, emotion models based on Gaussian mixture model are generated and the similarity of their templates is obtained. A series of experiments is performed with recorded speech based on Gaussian mixture model and indicates the system gains high performance and better robustness.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1126-1129

Citation:

Online since:

October 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Lin Zhong , Yanxiong Li , Xin Wei, Guanglong Li , Zili Wang , Yewen Jiang. System design for monitoring infant speech emotion, 2013 10th IEEE International Conference on Control and Automation, (2013).

DOI: 10.1109/icca.2013.6564891

Google Scholar

[2] Yuanlu Kuang, Lijuan Li. Speech emotion recognition of decision fusion based on DS evidence theory, Software Engineering and Service Science (ICSESS), (2013).

DOI: 10.1109/icsess.2013.6615425

Google Scholar

[3] Wenming Zheng , Minghai Xin , Xiaolan Wang, Bei Wang. A Novel Speech Emotion Recognition Method via Incomplete Sparse Least Square Regression , Signal Processing Letters, (2014).

DOI: 10.1109/lsp.2014.2308954

Google Scholar

[4] Qingli Zhang , Ning An , Kunxia Wang , Fuji Ren , Lian Li. Speech emotion recognition using combination of features, 2013 Fourth International Conference on Intelligent Control and Information Processing, (2013).

DOI: 10.1109/icicip.2013.6568131

Google Scholar

[5] Krishna Kishore, K.V. ; Krishna Satish, P. Emotion recognition in speech using MFCC and wavelet features, 2013 IEEE 3rd International Advance Computing Conference , (2013).

DOI: 10.1109/iadcc.2013.6514336

Google Scholar

[6] Sungrack Yun , Yoo, C.D. Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification. Audio, Speech, and Language Processing, (2012).

DOI: 10.1109/tasl.2011.2162405

Google Scholar

[7] Jianbo Jiang ; Zhiyong Wu ; Mingxing Xu ; Jia Jia ; Lianhong Cai. Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition, Signal and Information Processing Association Annual Summit and Conference, (2013).

DOI: 10.1109/apsipa.2013.6694336

Google Scholar

[8] Bhaykar, Manav ; Yadav, Jainath ; Rao, K. Sreenivasa . Speaker dependent, speaker independent and cross language emotion recognition from speech using GMM and HMM, 2013 National Conference on Communications (NCC), (2013).

DOI: 10.1109/ncc.2013.6487998

Google Scholar

[9] Tsang-Long Pao, Chun-Hsiang Wang , Yu-Ji Li. A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition, 2012 Fifth International Symposium on  Parallel Architectures, Algorithms and Programming, (2012).

DOI: 10.1109/paap.2012.31

Google Scholar

[10] Wanli zhang, Guoxin Li. The Research of Feature Extraction Based on MFCC for Speaker Recognition . 3rd International Conference on Computer Science and Network Technology, October (2013).

DOI: 10.1109/iccsnt.2013.6967289

Google Scholar