Study of English Pronunciation Quality Evaluation System with Tone and Emotion Analysis Capabilities

Xin Guang Li; Su Mei Li; Li Rui Jiang; Sheng Bin Zhang

doi:10.4028/www.scientific.net/AMM.475-476.318

Paper Titles

A New 3D Coordinates Acquisition Method on Base of Single Camera
p.296

Method for Testability Demonstration Test Based on Cost Optimization
p.301

Cloud Computing Model for Big Geological Data Processing
p.306

Membership Calculation Based on Dimension Hierarchical Division
p.312

Study of English Pronunciation Quality Evaluation System with Tone and Emotion Analysis Capabilities
p.318

A Study for Removal of Pipe Vibration on Swirlmeter
p.324

About Timekeeping Activities at NTSC
p.330

Image Stereo Correspondence Method for Stereo Vision
p.337

A Color Image Enhancement Based on Improved Genetic Algorithm
p.342

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 475-476Study of English Pronunciation Quality Evaluation...

Study of English Pronunciation Quality Evaluation System with Tone and Emotion Analysis Capabilities

Abstract:

During the study of English sentence pronunciation evaluation system, we found that sentence pronunciation emotion and intonation evaluation are very important. Probabilistic neural network has been used to study English sentence pronunciation emotion, and DTW (Dynamic Time Warping) algorithm has been used in the intonation analysis. The probability neural network basic principle is introduced in this paper. An emotion recognition algorithm based on MFCC（Mel　Frequency　Cepstrum　Coefficient）is present. The keynote and energy of the sentences are used to analyse the accuracy of the tones.　The experimental results of the proposed method effectiveness are given.

You might also be interested in these eBooks

Sensors, Measurement and Intelligent Materials II

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 475-476)

Pages:

318-323

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.475-476.318

Citation:

Cite this paper

Online since:

December 2013

Authors:

Xin Guang Li*, Su Mei Li, Li Rui Jiang, Sheng Bin Zhang

Keywords:

Emotion Recognition, Probabilistic Neural Network, Pronunciation Evaluation, Tone

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Yashpalsing Chavhan, M. L. Dhore, Pallavi Yesaware, Speech Emotion Recognition Using Support Vector Machine, International Journal of Computer Applications, vol. 1, pp.6-9, February (2010).

DOI: 10.5120/431-636

Google Scholar

[2] He L., Lech M., Maddage N., Memon S., and Allen N., Emotion Recognition in Spontaneous Speech within Work and Family Environments, iCBBE (2009).

DOI: 10.1109/icbbe.2009.5162772

Google Scholar

[2] Franco H, Neumeyer L, Ramos M, et al. Automatic detection of phone-level mispronunciation for language learning[C]/Proc. Eurospeech. 1999, 2: 851-854.

DOI: 10.21437/eurospeech.1999-207

Google Scholar

[3] Franco H, Neumeyer L, Kim Y, et al. Automatic pronunciation scoring for language instruction[C]/Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on. IEEE, 1997, 2: 1471-1474.

DOI: 10.1109/icassp.1997.596227

Google Scholar

[4] Kim Y, Franco H, Neumeyer L. Automatic pronunciation scoring of specific phone segments for language instruction[C]/Proc. Eurospeech. 1997, 97: 645-649.

DOI: 10.21437/eurospeech.1997-230

Google Scholar

[5] Neumeyer L, Franco H, Weintraub M, et al. Automatic text-independent pronunciation scoring of foreign language student speech[C]/Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on. IEEE, 1996, 3: 1457-1460.

DOI: 10.21437/icslp.1996-372

Google Scholar

[6] D. F. Specht, Probabilistic neural networks, Neural Networks, vol. 3, pp.109-118, (1990).

Google Scholar

[7] Ezzat T., Tomaso Poggio T. Discriminative Word-Spotting Using Ordered Spectro-Temporal Patch Features, to appear, SAPA workshop, Interspeech, Brisbane, Australia, (2008).

Google Scholar

[8] Frank,A. &Asuncion,A. (2010). UCI Machine Learning Repository [http: /archive. ics. uci. ed u/ml] . Irvine, CA: University of California, School of Information and Computer Science.

Google Scholar

[9] Burkhardt F, Paeschke A, Rolfes M, et al. A database of German emotional speech[C]/Proc. Interspeech, (2005).

DOI: 10.21437/interspeech.2005-446

Google Scholar

[10] Bouvrie J., Ezzat T., Poggio T., Localized Spectro-Temporal Cepstral Analysis of Speech, ICASSP, Las Vegas, Nevada, (2008).

DOI: 10.1109/icassp.2008.4518714

Google Scholar

[11] Keogh E. Exact indexing of dynamic time warping[C]/Proceedings of the 28th international conference on Very Large Data Bases. VLDB Endowment, 2002: 406-417.

DOI: 10.1016/b978-155860869-6/50043-3

Google Scholar

[12] Jyh-Shing Roger Jang, Speech and Audio Processing (SAP) Toolbox", available at "http: /mirlab. org/jang/matlab/toolbox/sap.

Google Scholar