A Text Categorization Method Based on SVM and Improved K-Means

Rong Ze Xia; Yan Jia; Hu Li

doi:10.4028/www.scientific.net/AMM.427-429.2449

Paper Titles

A Novel ZF-Based MAI Cancellation Scheme in the Time Domain for SC-FDMA System
p.2432

A New Computer-Aided Topology Optimization Platform for Solving Various Engineering Problems
p.2436

Design of Power Quality Management Software Based on Virtual Instrument and Oracle
p.2441

PKI Trust Model Analysis and Research
p.2445

A Text Categorization Method Based on SVM and Improved K-Means
p.2449

Pipes Information Recognition and 3D Model Reconstruction Based on DWGDirect
p.2454

Design and Implementation of the HTTP Proxy Server in SSL VPN
p.2458

Optimization Project on Collecting and Integrating Video Resources
p.2462

Privacy Preserving Techniques in the Internet of Things
p.2466

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 427-429A Text Categorization Method Based on SVM and...

A Text Categorization Method Based on SVM and Improved K-Means

Abstract:

Traditional supervised classification method such as support vector machine (SVM) could achieve high performance in text categorization. However, we should first hand-labeled the samples before classifying. Its a time-consuming task. Unsupervised method such as k-means could also be used for handling the text categorization problem. However, Traditional k-means could easily be affected by several isolated observations. In this paper, we proposed a new text categorization method. First we improved the traditional k-means clustering algorithm. The improved k-means is used for clustering vectors in our vector space model. After that, we use the SVM to categorize vectors which are preprocessed by improved k-means. The experiments show that our algorithm could out-perform the traditional SVM text categorization method.