Stressed Speech Recognition Method Based on Difference Subspace Combined with Dynamic Time Warping

Cheng Guo Lv; Ru Bo Zhang; Pei Hua Li

doi:10.4028/www.scientific.net/AMM.241-244.1640

Paper Titles

Research on Parameters Optimization Algorithm in Support Vector Machine Based on Immune Memory Clone Strategy
p.1618

Study on Path Based Hierarchy Aggregation in DS-TE Environment
p.1622

A Multi-Classified Method of Support Vector Machine (SVM) Based on Entropy
p.1629

Research on Interactive Visualization Clustering Method Based on the Radar Chart
p.1633

Stressed Speech Recognition Method Based on Difference Subspace Combined with Dynamic Time Warping
p.1640

A Novel Technology in Surveillance Video for Detecting and Forecasting a Robbing Incident
p.1647

Illumination Invariant Face Recognition Using Nonlocal Total Variation in Logarithmic Domain
p.1652

Research of Intelligent Search Engine Based on Multi-Ontology
p.1659

Study on Techniques of Hand Gesture Recognition
p.1664

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 241-244Stressed Speech Recognition Method Based on...

Stressed Speech Recognition Method Based on Difference Subspace Combined with Dynamic Time Warping

Abstract:

Speech under G-force which produced when speaker was under different acceleration of gravity was analyzed and researched, considered as principal part and stressed part to research. An isolated word recognition approach was proposed which combined difference subspace means with dynamic time warping technique. The method recognized speech under G-force by constructing a difference subspace to remove the stressed part. Dynamic time warping technique was adopted to make all feature vectors of one word in the training set have equal length, and a corresponding decision criterion was suggested. For a small vocabulary including 15 words, the method obtained the average recognition rate of 98.3%, which almost equal to the rate in normal environment. The method not only worked well in normal conditions but also had good performance for speech under G-force.

You might also be interested in these eBooks

Industrial Instrumentation and Control Systems

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 241-244)

Pages:

1640-1646

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.241-244.1640

Citation:

Cite this paper

Online since:

December 2012

Authors:

Cheng Guo Lv, Ru Bo Zhang, Pei Hua Li

Keywords:

Difference Subspace, Dynamic Time Warping, Speech Recognition, Speech under G-Force

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Y. Chen. IEEE Trans. On acoustics, Speech and Signal Processing. Vol. 36 (1988), p.433.

Google Scholar

[2] J.H.L. Hansen and S. Bou-Ghazale. EUROSPEECH'97. Vol. 4 (1997), p.1743.

Google Scholar

[3] H.J.M. Steeneken and J.H.L. Hansen. ICASSP'99. Vol. 4 (1999), p. (2079).

Google Scholar

[4] S. Bou-Ghazale and J.H.L. Hansen. IEEE Trans. On Speech and Audio Processing. Vol. 8(2000), p.429.

Google Scholar

[5] G. Zhou, J.H.L. Hansen and J.F. Kaiser. IEEE Trans. On Speech and Audio Processing. Vol. 9(2001), p.201.

Google Scholar

[6] D. Womack and J.H.L. Hansen. IEEE Trans. On Speech and Audio Processing. Vol. 7(1999), p.668.

Google Scholar

[7] Ziyun YANG, Jiqing HAN and Jinpei XU. CHINESE JOURNAL OF ACOUSTICS. Vol. 15(1996), p.123.

Google Scholar

[8] Jingdong CHEN, Lei YAO and Taiyi HUANG. ACTA ACOUSTIC(In Chinese). Vol. 23(1998), p.537.

Google Scholar

[9] Bin TIAN and Kechu YI. ACTA ACOUSTIC(In Chinese). Vol. 28(2003), p.28.

Google Scholar

[10] Jialu ZHANG and Shijin QI. ACTA ACOUSTIC(In Chinese). Vol. 9(1984), p.258.

Google Scholar

[11] Jialu ZHANG. ACTA ACOUSTIC(In Chinese). Vol. 14(1989), p.401.

Google Scholar

[12] Jialu ZHANG. ACTA ACOUSTIC(In Chinese). Vol. 18(1993), p.263.

Google Scholar

[13] Yonglin MA, Jiqing HAN and Lei ZHANG. ACTA ACOUSTIC(In Chinese). Vol. 27(2002), p.518.

Google Scholar

[14] Yuwei Wang, Lei Zhang and Jiqing Han. SIGNAL PROCESSING(In Chinese). Vol. 18(2002), p.484.

Google Scholar

[15] M.B. Gülmezoglu and D. Vakif. IEEE Transactions on Speech and Audio Processing. Vol. 7(1999), p.620.

Google Scholar

[16] M.B. Gülmezoglu and D. Vakif. IEEE Transactions on Speech and Audio Processing. Vol. 9(2001), p.655.

Google Scholar