Detecting Epistasis by LASSO-Penalized-Model Search Algorithm in Human Genome-Wide Association Studies

Article Preview

Abstract:

Extensive studies have shown that many complex diseases are influenced by interaction of certain genes, while due to the limitations and drawbacks of adopting logistic regression (LR) to detect epistasis in human Genome-Wide Association Studies (GWAS), we propose a new method named LASSO-penalized-model search algorithm (LPMA) by restricting it to a tuning constant and combining it with a penalization of the L1-norm of the complexity parameter, and it is implemented utilizing the idea of multi-step strategy. LASSO penalized regression particularly shows advantageous properties when the number of factors far exceeds the number of samples. We compare the performance of LPMA with its competitors. Through simulated data experiments, LPMA performs better regarding to the identification of epistasis and prediction accuracy.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 989-994)

Pages:

2426-2430

Citation:

Online since:

July 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Ozaki, K., et al.: Functional snps in the lymphotoxin-alpha gene that are associated with susceptibility to myocardial infarction. Nat. Genet. 32, 650–654. (2002).

DOI: 10.1038/ng1047

Google Scholar

[2] Wu J, Devlin B, Ringquist S, et al. Screen and clean: a tool for identifying interactions in genome-wide association studies. Genetic epidemiology, 34(3): 275-285. (2010).

DOI: 10.1002/gepi.20459

Google Scholar

[3] Cordell HJ. Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet 11: 2463–2468. (2002).

DOI: 10.1093/hmg/11.20.2463

Google Scholar

[4] Shi W, Wahba G, Wright S, et al. LASSO Pattern search algorithm with applications to ophthalmology and genomic data. Statistics and Its Interface, 1: 137–153, (2008).

DOI: 10.4310/sii.2008.v1.n1.a12

Google Scholar

[5] Wright, F.A. et al. Simulating association studies: a data-based resampling method for candidate regions or whole genome scans. Bioinformatics, 23, 2581–2588. (2007).

DOI: 10.1093/bioinformatics/btm386

Google Scholar

[6] Wang Y, Liu X, Robbins K, Rekaya R. Antepiseeker: detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm. BMC research notes 3(1): 117. (2010).

DOI: 10.1186/1756-0500-3-117

Google Scholar

[7] Yung L S, Yang C, Wan X, et al. GBOOST: a GPU-based tool for detecting gene–gene interactions in genome–wide case control studies. Bioinformatics, 27(9): 1309-1310. (2011).

DOI: 10.1093/bioinformatics/btr114

Google Scholar

[8] Wang, Z. et al.: CEO: a Cloud Epistasis Computing model in GWAS. In Proceedings of IEEE International Conference on Bioinformatics and Biomedicine, IEEE Computer Society, p.85–90. (2010).

DOI: 10.1109/bibm.2010.5706542

Google Scholar