Study on the Method of Feature Selection Based on Hybrid Model for Text Classification

Article Preview

Abstract:

In this paper, we study on the problem of how to combine feature selection models in text classification ,and present a method through build the hybrid model for feature selection ,this hybrid model combined with advantage of four feature selection models (DF,MI, IG, CHI), then we use the Naive Bayes model as classifier to verify the effect of the hybrid feature selelction model ,and experiments shows that the hybrid model is correct and effective and get good performance in text classification.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 433-440)

Pages:

2881-2886

Citation:

Online since:

January 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Chih H. B., Kulathuramaiyer N., An Empirical Study of Feature Selection for Text Categorization based on Term Weightage. IEEE/WIC/ACM International Conference on Web Intelligence, 2004. pg: 59—602.

DOI: 10.1109/wi.2004.10060

Google Scholar

[2] Mladenie D., Brank J, Grobelnik M., Milic-Frayling N., Feature Selection using Linear Classifier Weights: Interaction with Classification Models. 27th ACM SIGIR Conference on Research and Development in Information Retrieval, 2004. pg 234-241.

DOI: 10.1145/1008992.1009034

Google Scholar

[3] Yan J., Liu N., Zhang B., Yan S., Chen Z., Cheng Q., Fan Q., Ma W. OCFS: optimal orthogonal centroid feature selection for text categorization. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in Information Retrieval, 2005. pg 122-129.

DOI: 10.1145/1076034.1076058

Google Scholar

[4] Zheng Z., Srihari R., Optimally Combining Positive and Negative Features for Text Categorization. In Proceedings of the ICML, Workshop on Learning from Imbalanced Datasets II, Washington DC, (2003).

Google Scholar

[5] Dumais ST, Platt J, Heckerman D, Sahami M (1998) Inductive learning algorithms and.

Google Scholar

[1] representations for text categorization. In: Proceedings of the Seventh International Conference on Information and Knowledge Management (CIKM-98), pp.148-155. Bethesda, MD, USA.

Google Scholar