Speech Feature Parameter Extraction and Recognition Based on Interpolation

Ying Jie Meng; Wen Jun Liu; Rui Zhi Zhang; Hua Song Du

doi:10.4028/www.scientific.net/AMM.602-605.2118

Paper Titles

Research on Review Spam Detection Based on Sentiment Analysis in Electronic Commerce
p.2101

A Dimension Reduction Method and its Application in GIS Partial Discharge Pattern Recognition Research
p.2105

Study on Single-Phase Earth Fault Location Method in Mine Non-Effectively Grounded Network Based on WAMS
p.2110

Design of Automatic Output of Carbon Electrode Processing System Testing Tag Based on WinCC
p.2114

Speech Feature Parameter Extraction and Recognition Based on Interpolation
p.2118

Monitor System and Safety Status Analysis of Large Power Equipment Transportation
p.2124

Research of the Intelligent Medical Infusion Monitoring System
p.2130

Research and Implementation of a Wireless Network Security Inspection Platform for a Power Supply Enterprise
p.2134

Feedback Clustering Algorithm for Detecting Approximately Duplicate Records
p.2138

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 602-605Speech Feature Parameter Extraction and...

Speech Feature Parameter Extraction and Recognition Based on Interpolation

Abstract:

The research of the existing speech recognition is based on speech feature parameter, acco-rding to the shortage of poor anti noise and larger storage capacity, etc. So, curve interpolation has been introduced into speech feature parameter extraction to enhance that. Refer to the speech spectrum dynamic changes and the short-time energy smooth stationary characteristics of speech signal, this paper puts forward and designs an arithmetic of speech feature parameter extraction based on interpolation, constructs the feature parameter extraction and personal identification scheme based on speech, and also designs critical modules algorithm. The detail process of feature parameter extraction: firstly, it creates two-dimensional coordinate for each frame data. Then, according to two-dimensional coordinate, it performs Lagrange cubic interpolation for segmentation the data in a signal frame. Get the interpolation coefficient, average the interpolation coefficient for a signal frame, here the average value is seen as the feature parameter for each frame. Lastly, the each frame’s feature parameter is connected in series to form feature parameter of the speech segment. The arithmetic has been simulated an experiment, in order to confirm the applicability and feasibility. The results illustrates the method has preferable anti noise performance, especially expression and storage for overall speech segment feature parameter show more obvious advantages.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 602-605)

Pages:

2118-2123

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.602-605.2118

Citation:

Cite this paper

Online since:

August 2014

Authors:

Ying Jie Meng*, Wen Jun Liu, Rui Zhi Zhang, Hua Song Du

Keywords:

Extraction Method, Feature Parameter, Speech Recognition

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Yuan Yujin, Zhao Peihua, Zhou Qun. Research of Speaker Recongnition Based on Combination of LPCC and MFCC. 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems. xiamen china. IEEE Computer Society : 765-767.

DOI: 10.1109/icicisys.2010.5658337

Google Scholar

[2] Zhang Cheng, Researches and Implementation on Speaker Recognition Algorithms and systems[D], Changsha, National University of Defense Technology, (2005).

Google Scholar

[3] ZHANG Zhen, WANG Hua-qing, Improve algorithm of Mel-Frequence Cepstral Coefficients in characteristics extraction based on voice signal, Cmputer Engineering and Applications, 2008, vol. 44 No. 22: 54-55.

Google Scholar

[4] Wang Biao, Speech recognition system based on LPCC parameter, Electronic Desi gn Engineering, Apr. 2012 vol. 20 No. 7: 18-20.

Google Scholar

[5] Shao Yang, Liu Bingzhe, Li Zongge, A Speaker Recognition System Using MFCC Features and Weig- hted Vector Quantization, Cmputer Engineering and Applications, Dec. 2002 Vol. 28 No. 5: 127-128.

Google Scholar

[6] JIANG Xing-hua, Li Ying, Audio Data Retrieval Method Based on LPC-MFCC, Computer Engineering, June 2009, vol. 35 No. 11: 246-247, 253.

Google Scholar

[7] HERMANSKY H. Perceptual linear predictive (PLP) analysis for speech[J]. Journal of Acoust Soc Am, 1990, 87 ( 4 ) : 1738 -1752.

DOI: 10.1121/1.399423

Google Scholar

[8] Meinard M8üller, Member, Sebastian Ewert. Towards Timbre-Invariant audio Featrue for Harmony-Based Music. 2010 IEEE Transactions on Audio, SPEECH and Language Processing March Vol. 18, No. 3: 649-662.

DOI: 10.1109/tasl.2010.2041394

Google Scholar

[9] Nakamasa Inoue, Tatsuhiko Saito, Koichi Shinoda and Sadaoki Furui. High-Level Feature Extraction Using SIFT GMMs and Audio Models. 2010 International Conference on Pattern Recongnition. IEEE computer society, 2010: 3220-3223.

DOI: 10.1109/icpr.2010.787

Google Scholar

[10] Charles Parker. An Empirical Study of Feature Extraction Methods for Audio Classification. 2010 20th International Conference on Pattern Recognition (ICPR). Istanbul. IEEE computer society, 2010: 4593-4596.

DOI: 10.1109/icpr.2010.1111

Google Scholar

[11] Zhen Bin, Wu Xihong, Liu Zhimin, Chi Huisheng, On the Importance of Components of each Cepstral Components in Speech and Speaker Recognition, Pekinensis Universitatis(Acta Scientiarum Naturalium) , 2001vol. 37 No. 3: 371-378.

Google Scholar

[12] DONG Zhi-Feng, WANG Zeng-Fu, The Speaker Recognition System Based on The Dynamic MFCC, Pattern Recognition and Artificial Intelligence, Dec. Oct 2005 Vol. 18 No. 5: 596-601.

Google Scholar