A Kind of Classification Algorithms of Data Mining and Quantitative Analysis

Article Preview

Abstract:

C4.5, Bayesian network and Sequential Minimal Optimization (SMO) are three typical classification algorithms in data mining. Using cross-validation method with 10 folds get analysis and calculation results of the experiments for the three classification algorithms in the same training set and test set. The main metrics include accuracy, precision, speed, robustness, scalability and comprehensibility, we use margin curve show these. It provides a theoretical and experimental basis for users to select a proper classification algorithm with different training sets in quality and amount.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 655-657)

Pages:

963-968

Citation:

Online since:

January 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Fayyad, U M, Piatetsky Shapiro G, Smyth P. From Data Mining to Knowledge Discovery: an Overview. Advances in Knowledge Discovery and Data Mining, 1996: 1~34

DOI: 10.1007/s10618-006-0058-2

Google Scholar

[2] Shimin Lin, Fengzhan Tian:Construction and applications in data mining of bayesian networks. Journal of Tsinghua University(Science and Technology), 2001, 41(1):49~52. In Chinese

Google Scholar

[3] Jiawei Han, Micheline Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2006: 285~370

Google Scholar

[4] O. Chapelle and V. Vapnik. Model selection for support vector machines, to appear in Advances in Neural Information Processing Systems, 2000, 12

Google Scholar

[5] Jianmin Li, Bo Zhang, Fuzong Lin:An Improvement Algorithm to Sequential Minimal Optimization. Journal of Software, 2003, 14(5):917~924. In Chinese

Google Scholar

[6] Keerthi S, Gilbert E. Convergence of a generalized SMO algorithm for SVM classifier design. Machine Learning, 2002, 46(1/3):351~360

Google Scholar

[7] Mark A. Hall, Geoffrey Holmes: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining, IEEE Transactions on Knowledge and data engineering, 2003, 15(3)

DOI: 10.1109/tkde.2003.1245283

Google Scholar

[8] T. Sousa, A. Silva, A. Neves. A Particle Swarm Data Miner. Fernando Moura Pires, Salvador Abreu, 2003:43~53

Google Scholar