A Fast Algorithm for Chinese Text Categorization Based on Key Tree

Abstract:

Article Preview

To solving Chinese text categorization, a fast algorithm is proposed. The basic idea of the algorithm is: first constructs a weighted value of keywords dictionary which is constructed in key tree, then using the Hash function and the principle of giving priority for long term matching to mapping the strings in documentations to the dictionary. After that, calculate the sum of weights of the keywords which has been matched successfully. Finally take the maximum for the result of the classification. The algorithm can avoid the difficulty of Chinese word segmentation and its influence on accuracy of result. Theoretical analysis and experimental results indicate that the accuracy and the time efficiency of the algorithm is higher, whose comprehensive performance reaches to the level of current major technology.

Info:

Periodical:

Edited by:

Qi Luo

Pages:

1106-1112

DOI:

10.4028/www.scientific.net/AMM.58-60.1106

Citation:

X. Liu et al., "A Fast Algorithm for Chinese Text Categorization Based on Key Tree", Applied Mechanics and Materials, Vols. 58-60, pp. 1106-1112, 2011

Online since:

June 2011

Export:

Price:

$35.00

In order to see related information, you need to Login.

In order to see related information, you need to Login.