p.1807
p.1812
p.1816
p.1820
p.1824
p.1829
p.1833
p.1838
p.1842
Comparison and Improvements of Feature Extraction Methods for Text Categorization
Abstract:
Feature extraction is a key point of text categorization[1]. The accuracy of extraction will directly affect the accuracy of text classification. This paper introduces and compares 4 commonly used methods of text feature extraction: IG (Information gain), MI (Mutual information), CHI (statistics), DF (Document frequency), and proposes an improved method based on the method of CHI. Experiment result shows that the proposed method can improve the accuracy of text categorization.
Info:
Periodical:
Pages:
1824-1828
Citation:
Online since:
August 2014
Authors:
Keywords:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: