Enhancements of SVM Speaker Recognition by Dynamic Time Wrapping

Article Preview

Abstract:

This paper proposes an approach using dynamic time wrapping (DTW) to improve the classification performance of the SVM separation hyperplane. The presented method by incorporating the distance information derived from DTW template matching calculations into SVM separation hyperplane training will be able to effectively control the balance between mar-gin maximization and the amount of misclassifications, and therefore the recognition accuracy of the SVM classifier on speaker recognition will further be increased. Experimental results demonstrated the effectiveness and efficiency of the developed approach.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

891-894

Citation:

Online since:

May 2015

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] S. K. Gaikwad, B. W. Gawali, P. Yannawar: A review on speech recognition technique, Int. J. Comput. Appl. 10 (2010) 16-24.

Google Scholar

[2] I. J. Ding, C. T. Yen: Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications, Multimed. Tools Appl. (2013).

DOI: 10.1007/s11042-013-1587-5

Google Scholar

[3] C. J. C. Burges: A tutorial on support vector machines for pattern recognition, Data Min. Knowl. Disc. 2 (1998) 121-167.

Google Scholar

[4] J. X. Dong, A. Krzyzak, C. Y. Suen: Fast SVM training algorithm with decomposition on very large datasets, IEEE Trans. Pattern Anal. Mach. Intell. 27 (2005) 603-618.

DOI: 10.1109/tpami.2005.77

Google Scholar

[5] S. Z. Boujelbene, D. B. A. Mezghani, N. Ellouze: Improving SVM by modifying kernel functions for speaker identification task, International Journal of Digital Content Technology and its Applications 4 (2010) 100-105.

DOI: 10.4156/jdcta.vol4.issue6.12

Google Scholar

[6] H. Sakoe, S. Chiba: Dynamic programming algorithm optimization for spoken word recognition, IEEE Trans. Acoust. Speech Signal. Process. 26 (1978) 43-49.

DOI: 10.1109/tassp.1978.1163055

Google Scholar