Microarray Expression Analysis Using Seed-Based Clustering Method


Article Preview

Clustering methods have been often used to find biologically relevant groups of genes or conditions based on their expression levels. Since many functionally related genes tend to be coexpressed, by identifying groups of genes with similar expression profiles, the functionalities of unknown genes can be inferred from those of known genes in the same group. In this paper we address a novel clustering approach, called seed-based clustering, where seed genes are first systematically chosen by computational analysis of their expression profiles, and then the clusters are generated by using the seed genes as initial values for k-means clustering. The seed-based clustering method has strong mathematical foundations and requires only a few matrix computations for seed extraction. As a result, it provides stability of clustering results by eliminating randomness in the selection of initial values for cluster generation. Our empirical results reported here indicate that the entire clustering process can be systematically pursued using seedbased clustering, and that its performance is favorable compared to current approaches.



Key Engineering Materials (Volumes 277-279)

Edited by:

Kwang Hwa Chung, Yong Hyeon Shin, Sue-Nie Park, Hyun Sook Cho, Soon-Ae Yoo, Byung Joo Min, Hyo-Suk Lim and Kyung Hwa Yoo




M. Y. Shin and S. H. Park, "Microarray Expression Analysis Using Seed-Based Clustering Method", Key Engineering Materials, Vols. 277-279, pp. 343-348, 2005

Online since:

January 2005




[1] M.B. Eisen, P.T. Spellman, P.O. Brown and D. Botstein, Cluster analysis and display of genome-wide expression patterns, Proc. Natl. Acad. Sci., 95: 14863-14868, (1998).

DOI: https://doi.org/10.1073/pnas.95.25.14863

[2] S. Tavazoie, J.D. Hughes, M.J. Campbell, R.J. Cho and G. M. Church, Systematic determination of genetic network architecture, Nature Genetics, 22: 281-285, (1999).

[3] P. Tamayo, D. Slonim, J. Mesirov, Q. Zhu, S. Kitareewan, E. Dmitrovsky, E.S. Lander and T. R. Golub, Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation, Proc. Natl. Acad. Sci., 96: 2907-2912, (1999).

DOI: https://doi.org/10.1073/pnas.96.6.2907

[4] K.Y. Yeung and W.L. Ruzzo, Principle component analysis for clustering gene expression data, Bioinformatics, 17(9): 763-774, (2001).

DOI: https://doi.org/10.1093/bioinformatics/17.9.763

[5] K.Y. Yeung, D.R. Haynor and W. L. Ruzzo, Validating clustering for gene expression data, Bioinformatics, 17(4): 309-318, (2001).

DOI: https://doi.org/10.1093/bioinformatics/17.4.309

[6] J. Quackenbush, Computational analysis of microarray data, Nature Reviews Genetics, 2: 418422, June (2001).

[7] R.J. Cho, M.J. Campbell, E.A. Winzeler, L. Steinmetz, A. Conway, L. Wodicka, T.G. Wolfsberg, A.E. Gabrielian, D. Landsman, D.J. Lockhart, and R.W. Davis, A genome-wide transcriptional analysis of the mitotic cell cycle, Molecular Cell, 2: 65-73, (1998).

DOI: https://doi.org/10.1016/s1097-2765(00)80114-8

[8] http: /staff. washington. edu/kayee/cluster.