Speech Feature Parameter Extraction and Recognition Based on Interpolation

Article Preview

Abstract:

The research of the existing speech recognition is based on speech feature parameter, acco-rding to the shortage of poor anti noise and larger storage capacity, etc. So, curve interpolation has been introduced into speech feature parameter extraction to enhance that. Refer to the speech spectrum dynamic changes and the short-time energy smooth stationary characteristics of speech signal, this paper puts forward and designs an arithmetic of speech feature parameter extraction based on interpolation, constructs the feature parameter extraction and personal identification scheme based on speech, and also designs critical modules algorithm. The detail process of feature parameter extraction: firstly, it creates two-dimensional coordinate for each frame data. Then, according to two-dimensional coordinate, it performs Lagrange cubic interpolation for segmentation the data in a signal frame. Get the interpolation coefficient, average the interpolation coefficient for a signal frame, here the average value is seen as the feature parameter for each frame. Lastly, the each frame’s feature parameter is connected in series to form feature parameter of the speech segment. The arithmetic has been simulated an experiment, in order to confirm the applicability and feasibility. The results illustrates the method has preferable anti noise performance, especially expression and storage for overall speech segment feature parameter show more obvious advantages.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2118-2123

Citation:

Online since:

August 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Yuan Yujin, Zhao Peihua, Zhou Qun. Research of Speaker Recongnition Based on Combination of LPCC and MFCC. 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems. xiamen china. IEEE Computer Society : 765-767.

DOI: 10.1109/icicisys.2010.5658337

Google Scholar

[2] Zhang Cheng, Researches and Implementation on Speaker Recognition Algorithms and systems[D], Changsha, National University of Defense Technology, (2005).

Google Scholar

[3] ZHANG Zhen, WANG Hua-qing, Improve algorithm of Mel-Frequence Cepstral Coefficients in characteristics extraction based on voice signal, Cmputer Engineering and Applications, 2008, vol. 44 No. 22: 54-55.

Google Scholar

[4] Wang Biao, Speech recognition system based on LPCC parameter, Electronic Desi gn Engineering, Apr. 2012 vol. 20 No. 7: 18-20.

Google Scholar

[5] Shao Yang, Liu Bingzhe, Li Zongge, A Speaker Recognition System Using MFCC Features and Weig- hted Vector Quantization, Cmputer Engineering and Applications, Dec. 2002 Vol. 28 No. 5: 127-128.

Google Scholar

[6] JIANG Xing-hua, Li Ying, Audio Data Retrieval Method Based on LPC-MFCC, Computer Engineering, June 2009, vol. 35 No. 11: 246-247, 253.

Google Scholar

[7] HERMANSKY H. Perceptual linear predictive (PLP) analysis for speech[J]. Journal of Acoust Soc Am, 1990, 87 ( 4 ) : 1738 -1752.

DOI: 10.1121/1.399423

Google Scholar

[8] Meinard M8üller, Member, Sebastian Ewert. Towards Timbre-Invariant audio Featrue for Harmony-Based Music. 2010 IEEE Transactions on Audio, SPEECH and Language Processing March Vol. 18, No. 3: 649-662.

DOI: 10.1109/tasl.2010.2041394

Google Scholar

[9] Nakamasa Inoue, Tatsuhiko Saito, Koichi Shinoda and Sadaoki Furui. High-Level Feature Extraction Using SIFT GMMs and Audio Models. 2010 International Conference on Pattern Recongnition. IEEE computer society, 2010: 3220-3223.

DOI: 10.1109/icpr.2010.787

Google Scholar

[10] Charles Parker. An Empirical Study of Feature Extraction Methods for Audio Classification. 2010 20th International Conference on Pattern Recognition (ICPR). Istanbul. IEEE computer society, 2010: 4593-4596.

DOI: 10.1109/icpr.2010.1111

Google Scholar

[11] Zhen Bin, Wu Xihong, Liu Zhimin, Chi Huisheng, On the Importance of Components of each Cepstral Components in Speech and Speaker Recognition, Pekinensis Universitatis(Acta Scientiarum Naturalium) , 2001vol. 37 No. 3: 371-378.

Google Scholar

[12] DONG Zhi-Feng, WANG Zeng-Fu, The Speaker Recognition System Based on The Dynamic MFCC, Pattern Recognition and Artificial Intelligence, Dec. Oct 2005 Vol. 18 No. 5: 596-601.

Google Scholar