Speech Recognition Algorithm Based on Nonlinear Partition and GFCC Features

Xiang Tao Meng; Shi Yin

doi:10.4028/www.scientific.net/AMM.556-562.3069

Paper Titles

The Application of Transient Electromagnetic Method in Detecting the Corrosion of Heat Distribution Pipeline
p.3052

Research on EIS-Based Anomaly Detection Technique for Composite Materials
p.3056

Research and Development of Vehicle Fault Diagnostic System Based on MDI
p.3060

The Detection Fusion Algorithm of Intellectual Vehicles Based on Laser Radar in Traversable Areas
p.3065

Speech Recognition Algorithm Based on Nonlinear Partition and GFCC Features
p.3069

An Improved Moving Object Detection Algorithm Based on Colour Separation
p.3074

Numerical Identification of Ship-Roll Chaos Threshold
p.3078

Transformer Automatic Test System Based on Virtual Instrument
p.3084

Research of Tracking Algorithm for Moving Object Based on Video Sequence
p.3088

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 556-562Speech Recognition Algorithm Based on Nonlinear...

Speech Recognition Algorithm Based on Nonlinear Partition and GFCC Features

Abstract:

In order to speed up and enhance the robustness of speech recognition system, this paper proposes a speech recognition algorithm based on segment-level features of GFCC. In training and testing stage we use segment-level features of GFCC which is more robust to noise instead of the widely used MFCC features. Experiment results show that both the training time and test time decreased, while the accuracy of system was made to improve.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 556-562)

Pages:

3069-3073

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.556-562.3069

Citation:

Cite this paper

Online since:

May 2014

Authors:

Xiang Tao Meng*, Shi Yin

Keywords:

GFCC, Nonlinear Partition, Speech Recognition

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Wang H Z, Xu Y C, Li M J. Study on the MFCC similarity-based voice activity detection algorithm, 2011 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce. Piscataway: IEEE Press, 4391-4394. (2011).

DOI: 10.1109/aimsec.2011.6009945

Google Scholar

[2] Huang X D, Acero A, Hon H W. Spoken language Processing. Upper Saddle River, NJ: Prentice Hall PTR. (2000).

Google Scholar

[3] Qi J, Wang D, Jiang Y, et al. Auditory features based on gammatione filters for robust speech recognition. ISCA. (2012).

Google Scholar

[4] Luo C H, Wu X J, Zheng F, et al. Segmentation-based method for text-dependent speaker recognition in embedded applications, Proceedings of the Second Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. Singapore: [s. n. ], 466-469. (2010).

DOI: 10.1109/apsipa.2015.7415306

Google Scholar

[5] Varga A P, Steeneken H J M, Tomlinson M, et al. The NOISEX-92 study on the effort of additive noise on automatic speech recognition. Speech Research Unit, Defense Research Agency, Malvern, UK. (1992).

DOI: 10.1016/0167-6393(93)90095-3

Google Scholar