Ensemble Data Classification based on Diversity of Classifiers Optimized by Genetic Algorithm


Article Preview

In this research we propose an ensemble classification technique base on creating classification from a variety of techniques such as decision trees, support vector machines, neural networks and then choosing optimize the appropriate classifiers by genetic algorithm and also combined by a majority vote in order to increase classification accuracy. From classification accuracy test on Australian Credit, German Credit and Bankruptcy Data, we found that the proposed ensemble classification models selected by genetic algorithm yields highest performance and our algorithms are effective in building ensemble.



Advanced Materials Research (Volumes 433-440)

Edited by:

Cai Suo Zhang




D. Thammasiri and P. Meesad, "Ensemble Data Classification based on Diversity of Classifiers Optimized by Genetic Algorithm", Advanced Materials Research, Vols. 433-440, pp. 6572-6578, 2012

Online since:

January 2012




[1] Y. Yao, Z. Fu, X. Zhao and W. Cheng, Combining Classifier Based on Decision Tree, icie, vol. 2, pp.37-40, 2009 WASE International Conference on Information Engineering, (2009).

DOI: https://doi.org/10.1109/icie.2009.12

[2] J. Wang, J. Yang, S. Li, Q. Dai, and J. Xie, Number Image Recognition Based on Neural Network Ensemble, in Proceedings of the Third International Conference on Natural Computation - Volume 01: IEEE Computer Society, (2007).

DOI: https://doi.org/10.1109/icnc.2007.506

[3] K. C. Ying, S. -W. Lin, Z. J. Lee, and Y. T. Lin, An ensemble approach applied to classify spam e-mails, Expert Syst. Appl., vol. 37, pp.2197-2201, (2010).

DOI: https://doi.org/10.1016/j.eswa.2009.07.080

[4] X. Pei-Yong, D. X. Qian, and J. B. Ning, A GA-based feature selection and ensemble learning for high-dimensional datasets, in Machine Learning and Cybernetics, 2009 International Conference on, (2009), pp.7-12.

DOI: https://doi.org/10.1109/icmlc.2009.5212542

[5] L. Zhou, K. K. Lai, and L. Yu, Least squares support vector machines ensemble models for credit scoring, Expert Systems with Applications, vol. 37, pp.127-133, (2010).

DOI: https://doi.org/10.1016/j.eswa.2009.05.024

[6] G. Martinez-Muoz, D. Hernandez-Lobato, and A. Suarez, An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation, IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, pp.245-259, (2009).

DOI: https://doi.org/10.1109/tpami.2008.78

[7] K. -H. Liu, B. Li, J. Zhang, and J. -X. Du, Ensemble component selection for improving ICA based microarray data prediction models, Pattern Recognition, vol. 42, pp.1274-1283, (2009).

DOI: https://doi.org/10.1016/j.patcog.2009.01.021

[8] X. B. C. B. Ning, Y.W. Xu, J. Zhang, Virus-Evolutionary Genetic Algorithm Based Selective Ensemble Classifier for Pedestrian Detection, " in GEC, 09 Shanghai, China, (2009).

DOI: https://doi.org/10.1145/1543834.1543893

[9] M. T. Mitchell, Machine Learning. New York: McGraw-Hill, (1997).

[10] Quinlan, J. R. Induction of decision trees. Machine Learning, 81–106, (1986).

[11] Defu Zhang, Qingshan Jiang, and Xin Li. A Hybrid Mining Model Based on Neural Network and Kernel Smoothing Technique. Lecture Notes in Computer Science. (2005), 3516: 801–805.

DOI: https://doi.org/10.1007/11428862_110

[12] V. Vapnik, The nature of statistical learning theory, New York: Wiley, (1998).

[13] B. SchP olkopf, A. Smola, Learning with Kernels, MIT Press, Cambridge, MA, (2002).

[14] Breiman, L. Bagging predictors. Machine Learning, 24, 123–140, (1996).

[15] J. H. Holland, Adaptation in Natural and Artificial Systems An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. The MIT Press, (1992).

[16] Murphy, P. M., Aha, D. W. UCI repository of machine learning databases. Department of Information and Computer Science, Uni-versity of California Irvine, CA. Available from http: /archive. ics. uci. edu/ml/, (2001).

[17] W. Pietruszkiewicz, Application of discrete Predicting structures in an earlywarning expert system for financial distress. Ph.D. Thesis, Szczecin, Technical University, Szczecin., (2004).

[18] Rayner Alfred, Knowledge Discovery: Enhancing Data Mining and Decision Support Integration, (2005).

[19] L. Breiman, Bagging predictors, Machine Learning., vol. 24, pp.123-140, (1996).