Using Stratified Sample and Grid Search to Improve Disease Prediction Accuracy of SVM

Article Preview

Abstract:

SVM (Support Vector Machine) is a powerful data mining algorithm, and is mainly used to finish classification or regression tasks. In this literature, SVM is used to conduct disease prediction. We focus on integrating with stratified sample and grid search technology to improve the classification accuracy of SVM, thus, we propose an improved algorithm named SGSVM: Stratified sample and Grid search based SVM. To testify the performance of SGSVM, heart-disease data from UCI are used in our experiment, and the results show SGSVM has obvious improvement in classification accuracy, and this is very valuable especially in disease prediction.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

644-647

Citation:

Online since:

February 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] L. H. Witten, E. Frank, M. A. Hall: Data Mining: practical machine learning tools and techniques. -3rd ed. (Elsevier, Burlington 2011).

DOI: 10.1016/b978-0-12-374856-0.00015-8

Google Scholar

[2] X. D. Wu, V. Kumar: The Top Ten Algorithms in Data Mining (CRC Press, New York 2009).

Google Scholar

[3] V. Vapnik. The Nature of Statistical Learning Theory. (Springer, Verlag 1995).

Google Scholar

[4] V. vapnik., Statistical Learning Theory. (Wiley, New York 1998).

Google Scholar

[5] H.L. Chen, B. Yang, G. Wang: J Med Syst. 36: 2505-2519 (2012).

Google Scholar

[6] A.K. Hens, M.K. Tiwari: Expert Systems with Applications, 39(2012) 6774-6781. Fig. 2 Fig. 3 Table 2. Ten times classification accuracy results corresponding to stratified sample and random sample based on grid search technology.

Google Scholar