The Extraction of Differential MFCC Based on EMD

Article Preview

Abstract:

Feature extraction is the key to the object recognition. How to obtain effective, reliable characteristic parameters from the limited measured data is a question of great importance in feature extraction. This paper presents a method based on Empirical Mode Decomposition (EMD) for the extraction of Mel Frequency Cepstrum Coefficients (MFCCs) and its first order difference from original speech signals that contain four kinds of emotions such as anger, happiness, surprise and natural for emotion recognition. And the experiments compare the recognition rate of MFCC, differential MFCC (Both of them are extracted based on EMD) or their combination through using Support Vector Machine (SVM) to recognize speakers' emotional speech identity. It proves that the combination of MFCC and its first order difference has a highest recognition rate.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1167-1170

Citation:

Online since:

March 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] M.A. Hossan, S. Memon and M.A. Gregory, in: A Novel Approach for MFCC Feature Extraction, Progress in 4th International Conference on Signal Processing and Communication Systems (2010).

DOI: 10.1109/icspcs.2010.5709752

Google Scholar

[2] L. Zhao: Speech Signal Processing (in Chinese), China Machine Press, Beijing (2003).

Google Scholar

[3] Y. Shao, B. Liu, and Z. Li: Speaker Recognition System Based on MFCC and Weighted Vector Quantization, Computer Engineering and Applications (in Chinese) (2002), pp.127-128.

Google Scholar

[4] Binbin Tu and Fengqin YU: Speech Emotion Recognition Based on Improved MFCC with EMD. Computer Engineering and Applications (in Chinese) (2012), pp.119-122.

Google Scholar

[5] Z. Bian, X. Zhang, et al, in: Pattern Recognition (in Chinese). Tsinghua University Press, Beijing (2000).

Google Scholar

[6] Junqin Wu and Junjun Yu, in: An Improved Arithmetic of MFCC in Speech Recognition System. Progress in International Conference on Electronics, Communications and Control (2011).

DOI: 10.1109/icecc.2011.6066676

Google Scholar

[7] Gan Lu, Long Zhou and Xinge You, in: An improved EMD analysis method for GPR images, Process in International Conference on Computational and Information Sciences (2011).

DOI: 10.1109/iccis.2011.77

Google Scholar