A Low-Complexity 3.6kbps Speech Coding Algorithm Based on Mixed Excitation

Article Preview

Abstract:

A low-complexity 3.6kb/s speech coding algorithm based on mixed excitation is presented in this paper. It uses the parameter encoding and mixed excitation technology to ensure the quality of speech. Through adopting the scalar quantization of Line Spectrum Frequency (LSF), the algorithm reduces the storage and computational complexity. Meanwhile, improved frame type with dynamic Unvoiced/Voiced (U/V) thresholds make a reduction of the traditional U/V decision error and the sudden transformation of U/V frame. A modified bit allocation table is introduced and the PESQ-MOS test shows that the synthetic speech quality has been improved and reached the quality of communication, especially for high frequency female speakers with new frame type.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1282-1286

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Kondoz A M. Digital speech: coding for low bit rate communication systems[M]. Wiley, (2005).

Google Scholar

[2] DoD U S. MELP Analog-to-Digital Conversion of Voice by 2400 bit/second Mixed-Excitation Linear Prediction [J]. Department of Defense Telecommunications System Standard.

Google Scholar

[3] Unno T, Barnwell III T P, Truong K. An improved mixed excitation linear prediction (MELP) coder[C]/Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on. IEEE, 1999, 1: 245-248.

DOI: 10.1109/icassp.1999.758108

Google Scholar

[4] Jamrozik M, Gowdy J. Modified multiband excitation model at 2400 bps[C]/Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on. IEEE, 1997, 2: 1603-1606.

DOI: 10.1109/icassp.1997.596260

Google Scholar

[5] Paliwal K K, Kleijn W B. Quantization of LPC parameters[J]. Speech Coding and Synthesis, 1995: 433-466.

Google Scholar

[6] McCree A V, Barnwell III T P. A mixed excitation LPC vocoder model for low bit rate speech coding[J]. Speech and Audio Processing, IEEE Transactions on, 1995, 3(4): 242-250.

DOI: 10.1109/89.397089

Google Scholar

[7] ITU-T R P. 862-perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs[J]. International Telecommunication Union-Telecommunication Standardisation Sector (ITU-T), (2001).

DOI: 10.1109/icassp.2001.941023

Google Scholar