Speech Coding Based on Compressed Sensing and Sparse Representation

Shang Jing Li; Qi Zhu

doi:10.4028/www.scientific.net/AMM.667.242

Paper Titles

Problems of KDD Cup 99 Dataset Existed and Data Preprocessing
p.218

Remote Sensing Image Segmentation Based on Improved Statistical Region Merging
p.226

Research on China Imaging Altimeter Planning Problem for Area Target Observation
p.230

Simulation Research for Petal Color
p.237

Speech Coding Based on Compressed Sensing and Sparse Representation
p.242

Static Gesture Recognition Based on RGB-D Depth Information
p.248

A Calculating Method for Adaptive Human Shadows Moving Target Detection Algorithm
p.255

Research on Feature Extraction and Optimization of Cashmere and Wool Fiber Based on Digital Image
p.260

Terminal Aimpoint Selection Algorithm Based on Convex Hull Technique
p.264

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vol. 667Speech Coding Based on Compressed Sensing and...

Speech Coding Based on Compressed Sensing and Sparse Representation

Abstract:

In this paper, we propose a novel speech coding scheme based on compressed sensing and sparse representation. Compressed sensing (CS) attracts great interest for its ability to utilize a few measurements to recover original signals. Measurements preserve part of speech features while projected by row echelon matrix. A dictionary is learned in order to contain redundant information about speech measurements. The synthesized speech is recovered from a sparse approximation of the corresponding measurement. A rear low-pass filter is adopted to improve the subject quality of synthesized speech. Results show that the proposed coding scheme has achieved average Mean Opinion Score (MOS) of the synthesized speech 3.083 in an appropriate bit rate (4.2 Kbps), which outperforms the quality of Code excited linear prediction (CELP).

You might also be interested in these eBooks

Advances in Computers, Electronics and Mechatronics

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volume 667)

Pages:

242-247

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.667.242

Citation:

Cite this paper

Online since:

October 2014

Authors:

Shang Jing Li*, Qi Zhu

Keywords:

Compressed Sensing, Measurement Modeling, Sparse Representation, Speech Coding

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Donoho D L. Compressed sensing. IEEE Transaction on Information Theory, 2006, vol. 52, no. (4) , pp.1289-1306.

Google Scholar

[2] Ye L, Yang Z, Wang T J, et al. Compressed Sensing of Speech Signal Based on Row Echelon Measurement Matrix and Dual Affine Scaling Interior Point Reconstruction Method. Acta Electronica Sinca, 2012, vol. 40, no. 3, pp.429-434.

Google Scholar

[3] Elad M. Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing (Springer, USA 2010).

DOI: 10.1007/978-1-4419-7011-4

Google Scholar

[4] Tropp J A, Gilbert A C. Signal Recovery Form Random Measurements Via Orthogonal Matching Pursuit. IEEE Transactions on Information Theory, 2007, vol. 53, pp.4655-4666.

DOI: 10.1109/tit.2007.909108

Google Scholar

[5] Engan K, Aase S, Hakon-Husoy J. Method of optimal directions for frame design. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1999. Proceedings. 1999, vol. 5, pp.2443-2446.

DOI: 10.1109/icassp.1999.760624

Google Scholar

[6] Aharon M, Elad M, Bruckstein A. K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Re presentation. IEEE Transactions on Signal Processing, 2006, vol. 54, no. 11, pp.4311-4322.

DOI: 10.1109/tsp.2006.881199

Google Scholar

[7] Skretting K, Engan K. Recursive Least Squares Dictionary Learning Algorithm. IEEE Transactions on Signal Processing, 2010, vol. 58, no. 4, pp.2121-2130.

DOI: 10.1109/tsp.2010.2040671

Google Scholar

[8] Ye L, Yang Z, Sun L H. New low bit rate speech coding scheme based on compressed sensing. Chinese Journal of Scientific Instrument, 2011, vol. 32, no. 12, pp.2688-2692.

Google Scholar

[9] National Communication System, Office of Technology and Standards. Federal Standard 1016, Analog to digital conversion of radio voice by 4800 bit/second code excited linear predictive coding, (1991).

Google Scholar