MRCluster: Mining Constant Row Bicluster in Gene Expression Data

Article Preview

Abstract:

Biclustering is one of the important techniques for gene expression data analysis. A bicluster is a set of genes coherently expressed for a set of biological conditions. Various biclustering algorithms have been proposed to find biclusters of different types. However, most of them are not efficient. We propose a novel algorithm MRCluster to mine constant row biclusters from real-valued dataset. MRCluster uses Apriori property and several novel pruning techniques to mine biclusters efficiently. We compare our algorithm with a recent approach RAP, and experimental results show that MRCluster is much more efficient than RAP in mining biclusters with constant rows from real-valued gene expression data.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

628-633

Citation:

Online since:

October 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Y. Cheng and G.M. Church, Biclustering of Expression Data. Proc. 8th Int"l Conf. Intelligent Systems for Molecular Biology (ISMB00), ACM Press, 2000, pages. 93–103.

Google Scholar

[2] S.C. Madeira and A.L. Oliveira. Biclustering algorithms for biological data analysis: a survey. IEEE Transactions on computational Biology and Bioinformatics, pages 24–45, (2004).

DOI: 10.1109/tcbb.2004.2

Google Scholar

[3] A. Subramanian, P. Tamayo, V. Mootha, S. Mukherjee, B. Ebert, M. Gillette, A. Paulovich, S. Pomeroy, T. Golub, E. Lander, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS, 102(43): 15545–15550, (2005).

DOI: 10.1073/pnas.0506580102

Google Scholar

[4] Cheng Y, Church GM: Biclustering of gene expression data. Proceedings of the International Conference on Intelligent Systems for Molecular Biology 2000, 8: 93-103.

Google Scholar

[5] T. M. Murali and S. Kasif. Extracting conserved gene expression motifs from gene expression data. In Proc. Pac Symp Biocomput, pages 77–88, (2003).

DOI: 10.1142/9789812776303_0008

Google Scholar

[6] Ben-Dor A, Chor B, Karp R, Yakhini Z: Discovering local structure in gene expression data: the order-preserving submatrix problem. J Comput Biol 2003, 10: 373-384.

DOI: 10.1089/10665270360688075

Google Scholar

[7] G. Pandey, G. Atluri, M. Steinbach, C.L. Myers, and V. Kumar. An association analysis approach to biclustering. In ACM SIGKDD, pages 677–686. ACM New York, NY, USA, (2009).

DOI: 10.1145/1557019.1557095

Google Scholar

[8] J.M. Zahn, S. Poosala, etc. AGEMAP: A gene expression database for aging in mice. PLOS Genetics, 3(11): 2326-2337, (2007).

Google Scholar