Research of Web-Page Multi-Class Classification System Based on Support Vector Machine

Article Preview

Abstract:

A Chinese web-page classification algorithm based on SVM including the important aspects of text preprocessing, feature selection and multiple-Classification algorithm. In this paper, based on the analyses of features of Web documents, this paper does research the approach of classification in Support Vector Machine (SVM) and select of Kernel function. Furthermore, a web-page classification model and algorithm that is based on Binary Tree SVM is presented. The experiments show that it not only reduces the size of train set, but also has very high training efficiency. Its precision and recall are better.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2312-2316

Citation:

Online since:

January 2015

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Zhiquan Qi, Yingjie Tian, Yong Shi. Laplacian twin support vector machine for semi-supervised classification [J]. Neural Networks. (2012).

DOI: 10.1016/j.neunet.2012.07.011

Google Scholar

[2] David D Lewis. Feature Selection and Feature Extraction for Text Categorization A. In Proceedings of Speech and Natural Language Workshop C. Defense Advanced Research Projects Agency, Morgan Kaufmann, 1992, 212-217.

DOI: 10.3115/1075527.1075574

Google Scholar

[3] Lin Mu. Performance Comparison Based on Support Vector Machine Algorithm and Other Algorithms in Text Categorization, Journal of Inner Mongolia University (Natural Science Edition), vo1. 42, pp.703-707, (2011).

Google Scholar

[4] Cristianini N, Shawe-Taylor J. An introduction to support vector machines and other kernel-based learning methods [M]. Cambridge: Cambridge University Press, (2000).

DOI: 10.1017/cbo9780511801389

Google Scholar

[5] Vapnik V N. Statistical learning theory [M]. New York: Wiley, (1998).

Google Scholar

[6] Zheng Chun-hong, Jiao Li-cheng Fuzzy pre-extracting method for support vector machine[C]/Proceedings of the first International Conference on Machine Learning and Cybernetics, Beijing, 4-5 November (2002).

DOI: 10.1109/icmlc.2002.1175393

Google Scholar