Emotion Recognition from Speech Signals Using Elicited Data and Fuzzy KDA

Article Preview

Abstract:

In this paper we introduced the application of Fuzzy KDA in speech emotion recognition using elicited data. The emotional data induced in a psychology experiment. The acted data is not suitable for developing real world applications and by using more naturalistic data we may build more reliable system. The emotional feature set is then constructed for modeling and recognition. A total of 372 low level acoustic features are used and kernel discriminant analysis is used for emotion recognition. The experimental results show a promising recognition rate.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1385-1388

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis and G. Votsis: Emotion Recognition in Human-Computer Interaction. IEEE Signal Processing Magazine, Jan. 2001, 18(1), pp.32-80.

DOI: 10.1109/79.911197

Google Scholar

[2] T. Yamada,H. Hashimoto, and N. Tosa: Pattern Recognition of Emotion with Neural Network. Proceeding of the 1995 IEEE IECON 21st International Conference on Industrial Electronics, Control, and Instrumentation, 1995, Vol. 1, PP: 183-187.

DOI: 10.1109/iecon.1995.483355

Google Scholar

[3] H. Sato,Y. Mitsukura,M. Fukumi and N. Akamatsu: Emotional Speech Classification with Prosodic Parameters by using Neural Networks. Seventh Australian and NewZealand Intelligent Information Systems Conference, 2001, pp: 395-398.

DOI: 10.1109/anziis.2001.974111

Google Scholar

[4] J. Nicholson, K. Takahashi and R. Nakatsu: Emotion Recognition in Speech Using Neural Networks. ICONIP 99, Vol. 2, 16-20.

DOI: 10.1109/iconip.1999.845644

Google Scholar

[5] Y. Jin, Y. Zhao, C.W. Huang and L. Zhao: The design and establishment of a Chinese whispered speech emotion database. Technical Acoustics, 2010, Vol. 29, No. 1, pp.63-68.

Google Scholar

[6] C. Pereira: Some Acoustic characteristics of Emotion, Proc. Int. Conf. On Spoken Language Processing, 1998, Sydney. pp.927-930.

Google Scholar

[7] C. Clavel, I. Vasilescu, L. Devillers, G. Richard and T. Ehrette: Fear-type emotion recognition for future audio-based surveillance systems, Speech Communication, 2008, vol. 50, pp.487-503.

DOI: 10.1016/j.specom.2008.03.012

Google Scholar

[8] D.A. Reynolds and R.C. Rose: Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Trans. Speech Audio Processing, Jan. 1995, vol. 3, pp.72-83.

DOI: 10.1109/89.365379

Google Scholar

[9] R. Cowie and E. Douglas: Automatic statistical analysis of the signal and prosodic signs of emotion in speech, in Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, August 1999 , vol 3, pp.2327-2330.

DOI: 10.21437/icslp.1996-467

Google Scholar

[10] Mika and Sebastian: Fisher discriminant analysis with kernels. Proceedings of the 1999 IEEE Signal Processing Society Workshop, (1999).

Google Scholar