Algorithm Based on Multiple Features for Recognition of E.coli Promoter Region

Article Preview

Abstract:

A promoter plays an important role in the transcription of DNA sequences; its recognition is very significant. This paper concerns about the problem of extracting information from a given DNA region and to make predictions whether it contains a promoter. We incorporate three types of information to access the significance of a promoter, information on content and information on structure, and then these information are used in a trained nonlinear classifier. The results show that our method performs well and achieve 13.4% average error rate on positive and negative data from non-coding regions and 17.0% average error rate on positive and negative data from coding regions.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

164-169

Citation:

Online since:

February 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Leo Gordon, Alexey Ya. Chervonenkis, Alex J. Gammerman et al, Sequence alignment kernel for recognition of promoter region, Bioinformatics, Vol. 19 no. 15 2003, P1964-(1971).

DOI: 10.1093/bioinformatics/btg265

Google Scholar

[2] T.A. Brown, Genome[M], Beijing:Science Press, 2002, P201-P209.

Google Scholar

[3] Anason S. Haless, Dmitriy Leyfer et al. PromSer: a large-scale mammalian promoter and transcription start site identification service. Nucleic Acids Research, 2003, Vol 31, No. 13.

DOI: 10.1093/nar/gkg549

Google Scholar

[4] Bailey, T. L. and Elkan, C., Unsupervised learning of multiple motifs in biopolymers using expectation maximization. Mach. Learning, 1995, 21, 51–80.

DOI: 10.1007/bf00993379

Google Scholar

[5] Harley, C. and Reynolds, R., Analysis of E. coli promoter sequences. Nucleic Acids Res., 1987, 15, 2343–2361.

Google Scholar

[6] Demeler, B. and Zhou, G., Neural network optimization for E. coli promoter prediction. Nucleic Acids Res., 1991, 19, 1993-(1999).

Google Scholar

[7] O'Neill, M., Escherichia coli promoters: neural networks develop distinct descriptions in learning to search for promoters of different spacing classes. Nucleic Acids Res., 1992, 20, 3471–3477.

DOI: 10.1093/nar/20.13.3471

Google Scholar

[8] Pedersen, A., Baldi, P., Brunak, S. and Chauvin, Y., Characterization of prokaryotic and eukaryotic promoters using hidden markov models. In Proceedings of the 1996 Conference on Intelligent Systems for Molecular Biology, P182–191.

Google Scholar

[9] Simon Handley. Predicting Whether or Not a Nucleic Acid Sequences is an E. coli Promoter Region Using Genetic Programming. IEEE, 1995: 122~127.

Google Scholar

[10] Heladia Salgado, Socorro Gama-Castro, Agustino Martinez-Antonio et al, RegulonDB(version 4. 0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12, Nucleic Acids Research, 2004, Vol 32, Database issue D303-D306.

DOI: 10.1093/nar/gkh140

Google Scholar

[11] Blattner, F., Plunkett, G., Bloch, C., et al., The complete genome sequence of Escherichia coli k-12. Science, 1997, 277, 1453–1462. Genbank /EMBL/DDBJ record: U00096.

Google Scholar