An Effective K-Means Clustering Based SVM Algorithm

Article Preview

Abstract:

Support Vector Machine (SVM) is one of the most popular and effective data mining algorithms which can be used to resolve classification or regression problems, and has attracted much attention these years. SVM could find the optimal separating hyperplane between classes, which afford outstanding generalization ability with it. Usually all the labeled records are used as training set. However, the optimal separating hyperplane only depends on a few crucial samples (Support Vectors, SVs), we neednt train SVM model on the whole training set. In this paper a novel SVM model based on K-means clustering is presented, in which only a small subset of the original training set is selected to constitute the final training set, and the SVM classifier is built through training on these selected samples. This greatly decrease the scale of the training set, and effectively saves the training and predicting cost of SVM, meanwhile guarantees its generalization performance.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1344-1348

Citation:

Online since:

July 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] V. Vapnik, Statistical Learning Theory, Wiley, New York, (1998).

Google Scholar

[2] Dervis Karaboga, Celal Ozturk, A novel clustering approach: Aritificial Bee Colony (ABC) algorithm, Applied Soft Computing, vol. 11, no. 1, pp.652-657, (2011).

DOI: 10.1016/j.asoc.2009.12.025

Google Scholar

[3] Yan Yue, A Multi-Classified Method of Support Vector Machine (SVM) Based on Entropy, Applied Mechanics and Materials, vol. 241-244, pp.1629-1632, (2012).

DOI: 10.4028/www.scientific.net/amm.241-244.1629

Google Scholar

[4] Qiang Wu, SVM Soft Margin Classifiers: Linear Programming versus Quadratic Programming, Neural Computation, vol. 17, no. 5, pp.1160-1187, (2005).

DOI: 10.1162/0899766053491896

Google Scholar

[5] Colin Campbell, Kernel methods: a survey of current techniques, Neurocomputing, vol. 48, no. 1-4, pp.63-84, (2002).

Google Scholar

[6] Lin Yujun, Luo Ting Yao Sheng, Mo Kaikai, Xu Tingting, An improved clustering method based on k-means, Fuzzy Systems and Knowledge Discovery(FSKD), pp.734-737, (2012).

DOI: 10.1109/fskd.2012.6234296

Google Scholar

[7] Olivier Chapelle, Jason Weston, Cluster Kernels for Semi-Supervised Learning, Advances in Neural Information Processing Systems 15, pp.585-592, (2003).

Google Scholar