Improved Algorithm Based on TFIDF in Text Classification

Article Preview

Abstract:

Traditional feature weighting algorithm TFIDF doesn’t take some other factors which impact the feature weight into consideration, so this paper discusses the factors in details and proposes a new feature weighting algorithm called NTFIDF combined with these factors and TFIDF. Experiment on the KNN classifier shows that NTFIDF is better than TFIDF in text classification.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 403-408)

Pages:

1791-1794

Citation:

Online since:

November 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Huanling Tang,Jiantao Sun,Yuchang Lu.A Weight Adjustment Technique With Feature Weight Function Named TEF-WA In Text Categorization [J].Journal of Computer Research and Development,2005,42(1):47. 53.

DOI: 10.1360/crad20050106

Google Scholar

[2] Ning Zhang,Ziyan Jia,Zhongzhi Shi.Text Categorization With KNN Algorithm [J].Computer Engineering,2005,31(8):171. 172.

Google Scholar

[3] Rongjun feng. Improvement And Application Of Document Frequency-Based Feature Extraction Algorithm [D]. Nanjing University of Posts and Telecommunications ,(2005).

Google Scholar

[4] Jia Lv. Improved Feature Selection Algorithm Based On Variance In Text Categorization [J]. Computer Engineering And Design,2007, 28(24).

Google Scholar

[5] Jianhui Wang,Hongwei Wang,Zhan Shen.A Simple and Efficient Algorithm to Classify a Large Scale of Texts [J].Journal Of Computer Research And Development. 2005,42(1):85-93.

Google Scholar

[6] Xianqun Tong,Zhongmei Zhou. Enhancement Of K-Nearest Neighbor Algorithm Based On Information Entropy Of Attribute Value [J]. Computer Engineering And Applications, 2010,46(3).

Google Scholar

[7] Guang Rong. Study of Chinese Text Classification Method [D]. Shandong University,(2009).

Google Scholar

[8] Dongmei Liu. Automatic Classification Research On Html Document And Implentation Of The Tool[D]. Inner Mongolia University, (2006).

Google Scholar

[9] Xiaohua Zhao. Research Of Weight Algorithm In Text Cassification [D]. Taiyua University of Technology, (2010).

Google Scholar