Inferring Functional Annotation for Human Genes from Gene-PDB Structure Mapping

Article Preview

Abstract:

Protein 3D structure is one of the main factors in reecting gene functions. The availability of protein structure data in Protein Data Bank (PDB) allows us to conduct gene function analysis based on protein structure data. However, the molecules in PDB, whose structures having been determined, are always not corresponding to a unique gene. That is to say, the mapping from a gene to the PDB is not one-to-one. This feature complicates the situation and increases the difculty of gene function analysis. In this paper, we attempt to tackle this problem and also study the problem of predicting gene function from protein structures based on the gene-PDB mapping. We rst obtain the gene-PDB mapping, which is used to represent a gene by the structure set of all its corresponding PDB molecules. We then dene a new gene-gene similarity measurement based on the structure similarity between PDB molecules, and we further show that this new measurement matches with the gene functional similarity. This means that the measurement we dened here can be used effectively for gene function prediction.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

391-396

Citation:

Online since:

August 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] SGD project. Saccharomyces Genome Database, ftp: /ftp. yeastgenome. org/yeast/sequence similarity/yeast vs yeast.

Google Scholar

[2] Marcotte, E., Pellegrini, M., Thompson, M., Yeates, T. and Eisenberg, D. A combined algorithm for genome-wide prediction of protein function, Nature, 402 83-86, (1999).

DOI: 10.1038/47048

Google Scholar

[3] Pellgrini, M., Marcotte E.M., Thompson, M.J., Eisenberg,D. and Yeates, T. Assigning protein functions by comparative genome analysis: protein phylogenetic profiles Proc. Nat. Acad. Sci., 96 (8) 4285-4288, (1999).

DOI: 10.1073/pnas.96.8.4285

Google Scholar

[4] Rost B. Enzyme function less conserved than anticipated, J. Mol. Biol., 318 595-608, (2000).

DOI: 10.1016/s0022-2836(02)00016-5

Google Scholar

[5] Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D. and Altman, R. Missing value estimation methods for DNA microarray, Bioinformatics, 17 520-525, (2000).

DOI: 10.1093/bioinformatics/17.6.520

Google Scholar

[6] Brown, M., Grundy, W. N., Lin, D., Cristianini, N., Sugnet, C. W., Furey, T. S., Ares, M. Jr. and Haussler, D. Knowledge-based analysis of microarray gene expression data by using support vector machines, Proc. Natl. Acad. Sci., 97 262-267, (2000).

DOI: 10.1073/pnas.97.1.262

Google Scholar

[7] Li, L., Shiga, M., Ching, W. and Mamitsuka, H. Annotating gene functions with integrative spectral clustering on microarray expressions and sequences, Genome Informatics, 22 95-120, (2009).

DOI: 10.1142/9781848165786_0009

Google Scholar

[8] Shi, J. and Malik, J. Normalized cuts and image segmentation, IEEE Trans. on Pattern Analysis and Machine Intelligence, 22 888–905, (2000).

DOI: 10.1109/34.868688

Google Scholar

[9] Guimera, R. and Nunes Amaral, L. A. Functional cartographyof complex metabolic networks, Nature, 433(7028): 895–900, (2005).

DOI: 10.1038/nature03288

Google Scholar

[10] Newman, M. E. J. and Girvan, M. Finding and evaluating community structure in networks Phys. Rev. E, 69: 026-113, (2004).

Google Scholar

[11] Watts, D. and Strogatz, S. Collective dynamics of small-world, networks, Nature 393, 440-442, (1998).

DOI: 10.1038/30918

Google Scholar

[12] Barabasi, A. and Albert, R. Emergence of scaling in randomnetworks, Science, 286 509 - 512, (1999).

Google Scholar

[13] Song, C., Havlin, S. and Makse, H. Self-similarity of complexnetworks, Nature 433, 392-395, (2005).

DOI: 10.1038/nature03248

Google Scholar

[14] http: /hkumath. hku. hk/∼wkc/data/gene. PDBid. txt.

Google Scholar

[15] http: /hkumath. hku. hk/∼wkc/data/gene. struc. similarity. mat.

Google Scholar

[16] http: /hkumath. hku. hk/∼wkc/data/gene. goset. mat.

Google Scholar

[17] http: /hkumath. hku. hk/∼wkc/data/gene. func. similarity. mat.

Google Scholar

[18] http: /hkumath. hku. hk/∼wkc/data/go. distance. mat.

Google Scholar