The Research on Chinese Automatic Segmentation

Article Preview

Abstract:

Abstract. Word Segmentation is a fundamental problem of the Chinese natural language progressing. Based on the analysis of the state on research and key issues of it .The paper introduces a tree structure statistical method of word frequency, which enables key words to match one another highly-efficiently. by which we can rapidly express texts as the set of high-frequency words,so the classification of texts is conveniently reached.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 798-799)

Pages:

818-821

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Huihsin Tseng, Pichuan Chang et al. A conditional random field word segmenter for SIGHAN Bakeoff 2005. In: Proceedings of the Fourth SIGHAN Work-shop on Chinese Language Processing. J eju Island, Korea, 2005: 168~171.

DOI: 10.3115/v1/w14-6815

Google Scholar

[2] Fei Hongxiao and Kang Songlin, Chinese Word Segmentation Research Based on Statistic the Frequency of the Word, Computer Engineering and Applications, Vol. 7, No. 1, 2005, pp: 67-68.

Google Scholar

[3] Mia K . Stern, Joseph E. Beverly Park Woof. Native Bayes Classifiers for User Modeling.

Google Scholar

[4] Sara Baase and Allen Van Gelder, Computer algorithms: introduction to design and analysis, Addison-Wesley, (2000).

Google Scholar

[5] Hai Zhao, Chang-Ning Huang, Mu Li. An im-proved Chinese word segmentation system with conditional random field. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, Sydney: 2006, 7: 108~117.

Google Scholar

[6] Pedro Domingos, Michael Pazzzani. On the Optimality of the Simple Bayesian Classifier under zero-one Loss. Machine Learning. 29 , 103-130, (1997).

Google Scholar

[7] . Elkan. Boosting and Naïve Bayesian Learning. In Technical Report CS97, Dept. of Computer Science and Engineering, Univ. Calif at SanDiego, Sept. (1997).

Google Scholar