Key Technologies of Speech Services in the Internet Era

Article Preview

Abstract:

With the popularity of mobile devices, the communication mode based on speech is increasing obviously. The cognitive load of speech interaction is small, and will not always arouse the user's attention, at the same time, it needs a small physical space, but has great amount of interaction information and high interaction efficiency. The challenge Internet speech information service is facing is speech interaction. This paper analyzes four key technologies: the user behavior modeling of speech service in Internet era, multi-channel perception technology in speech interaction, speech information service platform architecture, and visual analysis of speech information. It also presents the development and application prospect of speech interaction.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2201-2205

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] D. Thom, H. Bosch, Jang Yun,R. Maciejewski, D.S. Eber t,T. Ertl, Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition. 2012 IEEE Conference on Visual Analytics Scienceand Technology(VAST), 143~152.

DOI: 10.1109/vast.2012.6400557

Google Scholar

[2] G. Dahl, D. Yu, L. Deng, and A. Acero, Context-dependent pretrained deep neural networks for large vocabulary speech recognition, IEEE Trans. Audio, Speech, Lang. Proc., vol. 20, 2012, 30~42.

DOI: 10.1109/tasl.2011.2134090

Google Scholar

[3] D. Yu, L. Deng, and F. Seide. The deep tensor neural network with applications to large vocabulary speech recognition, IEEE Trans Audio, Speech, &Lang. Proc. vol. 21, no. 2, Feb, 2013, 388~396.

DOI: 10.1109/tasl.2012.2227738

Google Scholar

[4] L. Deng and X. Li. Machine learning paradigms for speech recognition: An overview, IEEE Trans. Audio, Speech & Lang. Proc., Vol. 21, No. 5, May (2013).

DOI: 10.1109/tasl.2013.2244083

Google Scholar

[5] http: /www. fortunechina. com/technology.

Google Scholar

[6] http: /us s tock. cngold. org/c/2012-08-24/c1277750. html.

Google Scholar