Gene Expression Programming Based Classification for Automated Birth Defects Detection

Article Preview

Abstract:

With the rapid development of digital medicine, improving the diagnostic accuracy for birth defects (BD) by using data mining techniques has been paid more attentions by researchers. In this paper, an automated classification technique based on Gene Expression Programming (GEP) to detect the defect infants, named Birth Defects Detection based on Gene Expression Programming (BDD-GEP) is proposed. The main contributions of this paper include: (1) proposing two contrast inequalities (CIs) for birth defects detection: the defection contrasts to normal and the normal contrasts to defection, (2) designing a new fitness function to mine the normal and defect CIs by GEP, (3) presenting a method to select useful CIs for classification, (4) implementing the BDD-GEP algorithm through combining the proposed CIs with k-Nearest Neighbor algorithm. In order to evaluate the proposed classification method, 11,897 infant samples from national center for birth defects monitoring of China were used, and the method was compared with several existing classification methods. The experimental results show that the overall detection accuracy of BDD-GEP was as high as 87.8%. Specifically, the F-measure of the detect samples was about 70.2%, and the F-measure of the normal samples was about 92.3%.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

508-514

Citation:

Online since:

September 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Z. Li, A. Ren, L. Zhang, Z. Guo, Z. Li, A population-based case-control study of risk factors for neural tube defects in four high-prevalence areas of Shanxi province China, Paediatr Perinat Epidemiol , Vol. 20, 1, (2006), pp.43-53.

DOI: 10.1111/j.1365-3016.2006.00694.x

Google Scholar

[2] Ministry of Health of the People's Republic of China, Annual state health statistic data (1999), http: /www. moh. gov. cn.

Google Scholar

[3] CDC, 2008a, Birth Defects Research. Available from: http: /www. cdc. gov/ncbddd/bd/research. htm, (Accessed 05 August 2008).

Google Scholar

[4] C. Ferreira, Gene Expression Programming: A New Adaptive Algorithm for Solving Problems, Complex Systems. Vol. 13, 2 (2001), pp.87-129.

Google Scholar

[5] L. Duan, J. Zuo, T. Zhang, J. Peng, J. Gong, Mining Contrast Inequalities in Numeric Dataset, (Proc. of WAIM 2010), LNCS. Vol. 6184, pp.194-205.

DOI: 10.1007/978-3-642-14246-8_21

Google Scholar

[6] C. Zhou, P.C. Nelson, W. Xiao, and T.M. Tirpak, Discovery of Classification Rules by Using Gene Expression Programming, Proc. of Artificial Intelligence, Las Vegas(2002), pp.1355-1361.

Google Scholar

[7] V.K. Karakasis, A. Stafylopatis, Efficient Evolution of Accurate Classification Rules Using a Combinnation of Gene Expression Programming and Clonal Selection, (Proc. of Evolutionary Computation 2008), 12 (6), pp.662-678.

DOI: 10.1109/tevc.2008.920673

Google Scholar

[8] Z. Jie, T. Changjie, L. Chuan, C. Anlong, and Y. Chang'an, Time Series Prediction based on Gene Expression Programming, (Pro. of WAIM 2004), pp.55-64.

Google Scholar

[9] J. Skaruz, F. Seredynski, Detecting web application attacks with use of Gene Expression Programming, (Proc. of Evolutionary Computation 2009), p.2029-(2035).

DOI: 10.1109/cec.2009.4983190

Google Scholar

[10] Hemmi I, Bayesian estimation of the incidence rate in birth defects monitoring, Cong. Anom (1988), 28 (2), pp.103-109.

DOI: 10.1111/j.1741-4520.1988.tb00583.x

Google Scholar

[11] X. Li, J. Wang, Y. Liao, B. Meng, X. Zhang, A geological analysis for the environmental cause of human birth defects bansed on GIS, Toxicological and Environmental Chemistry (2006), 88 (3), pp.551-559.

DOI: 10.1080/00222930600803282

Google Scholar

[12] H. Bai, Y. Ge, J. Wang and Y. Liao, Using rough set theory to identify villages affected by birth defects: the example of Heshun, Shanxi, China, International Journal of Geographical Information Science (2010), 24 (4), pp.559-576.

DOI: 10.1080/13658810902960079

Google Scholar

[13] Y. Liao, J. Wang, Y. Guo, X. Zheng, Risk assessment of human neural tube defects using a Bayesian belif network, Springer, Published online, 12 February, (2009).

Google Scholar

[14] G. Dong and J. Li. Efficient mining of emerging patterns: Discovering trends and differences. In KDD, pp.43-52, (1999).

Google Scholar

[15] D. Aha, D. Kibler, Instance-based learing algorithms, Machine Learning (1991), 6, pp.37-66.

Google Scholar