Features Extraction for Lhasa Tibetan Speech Recognition

Guan Yu Li; Hong Zhi Yu; Yong Hong Li; Ning Ma

doi:10.4028/www.scientific.net/AMM.571-572.205

Paper Titles

A Hybrid IGA-SA Algorithm for Optimization Problems in Fault Diagnosis
p.187

Efficient Particle Swarm Optimization Algorithm Based on Affinity Propagation
p.191

Estimation of Initial Field in the Bohai Sea with the Adjoint Method: A Comparative Study on Optimization Algorithms
p.196

Fault Diagnosis of Transformer Based on RBF Neural Network
p.201

Features Extraction for Lhasa Tibetan Speech Recognition
p.205

FECG Extraction Algorithm Based on BSS Using Temporal Structure and DWT
p.209

Fuzzy Clustering Segmentation Algorithm Research for Biomedical Image Based on Artificial Life
p.213

Hourly Solar Radiation Forecast Based on k-NN Nonparametric Regression Model
p.217

Influence Diffusion Model Based on Semantic Orientation and its Application in Opinion Leader Identification
p.223

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 571-572Features Extraction for Lhasa Tibetan Speech...

Features Extraction for Lhasa Tibetan Speech Recognition

Abstract:

Speech feature extraction is discussed. Mel frequency cepstral coefficients (MFCC) and perceptual linear prediction coefficient (PLP) method is analyzed. These two types of features are extracted in Lhasa large vocabulary continuous speech recognition system. Then the recognition results are compared.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 571-572)

Pages:

205-208

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.571-572.205

Citation:

Cite this paper

Online since:

June 2014

Authors:

Guan Yu Li*, Hong Zhi Yu, Yong Hong Li, Ning Ma

Keywords:

ASR, Feature Extraction, MFCC, PLP

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Gongque Jiangcuo, Theory of Tibetan, Tibetan Studies . 1997 III.

Google Scholar

[2] Kelsang Jumian, Practical Tibetan grammar course. Sichuan minorities press. November 2004 edition.

Google Scholar

[3] Han Qinghua, Yu Hongzhi. Anduo Tibetan speaker-independent isolated words speech recognition based on HMMs. Software Guide . 2010 07.

Google Scholar

[4] Pei Chun Bao. The Tibetan language speech recognition technology based on the standard Lhasa, Master Thesis in Tibet University , (2009).

Google Scholar

[5] The HTK Book(for HTK Version 3. 4). Cambridge University Engineering Department. (2009).

Google Scholar

[6] Website: http: /htk. eng. cam. ac. uk.

Google Scholar

[7] L awrence Rabiner, Biing-Hwang Juang. Fundamentals of Speech Recognition, Tsinghua University Press Copy.

Google Scholar

[8] Nichong Jia, Liu Wen Ju, Xu Bo. Chinese large vocabulary continuous speech recognition system progress. Chinese Information. Volume 23 No. 1 January (2009).

Google Scholar

[9] Li Yonghong, Kong Jiangping, Yu Hongzhi. Automatically convert Tibetan language audio and its implementation. Tsinghua University (Natural Science) . 2008 Volume 48 of the S1.

Google Scholar

[10] Zheng Fang, Wen Hu Wu, Fang Ditang. Recognition Keyword Research of Continuous stream voice. Fourth National Conference on Human Machine Speech Communication Proceedings, (1996).

Google Scholar

[11] Gao Sheng, XU Bo, HUANG Taiyi. Chinese triphone model Based on Acoustics Decision Tree. Vol 25 No. 6 November (2000).

Google Scholar

[12] Julian James Odell. The Use of Context in Large Vocabulary Speech Recognition. University of Cambridge. March (1995).

Google Scholar