Chinese Text Classification with a KNN Classifier Using an Adjusted Feature Weighting Method

Article Preview

Abstract:

KNN algorithm is used for Chinese text classification in this paper. First, TF-IDF is chosen as the feature weighting method. To the characteristics of corpus used in this paper, TF-IDF is adjusted to a new method. At last, experimental result shows the accuracy of KNN text classifier can be improved with the adjusted feature weighting method.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

700-703

Citation:

Online since:

February 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Songbo Tan: An effective refinement strategy for KNN text classifier. Expert Systems with Applications Vol. 30-2(2006), pp.290-298.

DOI: 10.1016/j.eswa.2005.07.019

Google Scholar

[2] Yu-Long Qiao , Jeng-Shyang Pan , Sheng-He Sun : Improved K nearest neighbor classification algorithm. Circuits and Systems Vol. 2(2005), pp.1101-1104.

DOI: 10.1109/apccas.2004.1413076

Google Scholar

[3] Huang Wei, Liu Yi , Gao Bing , Yang Ke-wei. Study on Method of Word Segmentation in Feature Selection in Chinese Text Categorization, Proceedings of Third International Conference on Knowledge Discovery and Data Mining, (2010).

DOI: 10.1109/wkdd.2010.61

Google Scholar

[4] Jung-Yi Jiang , Shie-Jue Lee . A Weight-based Feature Extraction Approach for Text Classification, Proceedings of Second International Conference on Innovative Computing, Information and Control, (2007).

DOI: 10.1109/icicic.2007.109

Google Scholar

[5] Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. Pattern Analysis and Machine Intelligence Vol. 19-2(1997), pp.153-158.

DOI: 10.1109/34.574797

Google Scholar

[6] Yanling Li , Jing Yuan , Xia Ye : Method for feature word weight calculating, Proceedings of IEEE International Conference on Intelligent Computing and Intelligent Systems, (2009).

DOI: 10.1109/icicisys.2009.5357840

Google Scholar

[7] Information on http: /www. searchforum. org. cn/tansongbo/corpus-senti. htm.

Google Scholar