Research on the Chinese Word Segmentation System Based on Incremental Learning

Article Preview

Abstract:

In order to cope with the increasing size of the training corpus and adapt to the requirements of incremental learning, this paper introduces a feature selection algorithm of maximum entropy model into the research of Chinese word segmentation technology, designs and implements a Chinese word segmentation system based on incremental learning. The experimental results show that the system gradually improves the segmentation accuracy in the incremental learning process which without wasting time to restudy.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

3469-3473

Citation:

Online since:

August 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Ying Qin, Xiaojie Wang, Suxiang Zhang, in: The research of combinatorial ambiguity in Chinese word segmentation. Journal of Chinese Information Processing, China(2007).

Google Scholar

[2] Chengqing Zong, in: Statistical natural language processing. Tsinghua University press, Beijing, China(2008).

Google Scholar

[3] Fanjin Mai, Ting Wang, in: Word segmentation disambiguation model based on the bidirectional maximum matching and HMM. Modern Library and Information Technology, China(2008).

Google Scholar

[4] Rong Zeng, Jiancheng Liu, Xinhua Jiang, in: An incremental learning algorithm based on support vector machine (SVM). Journal of Railway Science and Engineering, China(2005).

Google Scholar

[5] Wenhua Zeng, Jian Ma, in: A new support vector machine (SVM) incremental learning algorithm. Natural Science , Journal of Xiamen University, China (2002).

Google Scholar