Text Clustering Algorithm of Co-Occurrence Word Based on Association-Rule Mining

Article Preview

Abstract:

According to the analysis of text feature, the document with co-occurrence words expresses very stronger and more accurately topic information. So this paper puts forward a text clustering algorithm of word co-occurrence based on association-rule mining. The method uses the association-rule mining to extract those word co-occurrences of expressing the topic information in the document. According to the co-occurrence words to build the modeling and co-occurrence word similarity measure, then this paper uses the hierarchical clustering algorithm based on word co-occurrence to realize text clustering. Experimental results show the method proposed in this paper improves the efficiency and accuracy of text clustering compared with other algorithms.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1749-1752

Citation:

Online since:

August 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Li Heng, Zhu Jing-bo and Yao Tian-shun. SVM based chinese text chunking. Journal of Chinese information processing, Vol. 18(2004), pp.1-7.

Google Scholar

[2] Zhao Shi-qi, Liu Ting and Li Sheng. A topical document clustering method. Journal of Chinese information processing, Vol. 21(2007), pp.58-62.

Google Scholar

[3] Zhang Cheng-zhi, Zhang Qing-guo. Topic navigation generation using topic extraction and clustering. KAM, Vol. 88(2008), pp.333-339.

Google Scholar

[4] Zhang Jian-pei, Yang Yun and Yang jing. Algorithm for initialization of K-means clustering centre based on optimized-division. Journal of system simulation, Vol. 21(2009), pp.2586-2590.

Google Scholar

[5] Cao Tian, Zhou Li and Zhang Guo-xuan. Text similarity computing based on word co-occurrence. Computer engineering & science, Vol. 29(2007), pp.52-54.

Google Scholar