Bacteria Genome Compression Based on the Weighted Context Model

Article Preview

Abstract:

The genome compression algorithm based on the weighted context model is present in this paper. In this algorithm, the method to construct the weighted context model is discussed. The adaptive code length of the context models are used to determine the values of the corresponding weights of models. In the coding process, the values of these weights are modified in order to achieve the minimum code length and to enhance the compression efficiency. At last, the proposed algorithm is used to the bacteria genome compression. The experiments results indicate that the algorithm present could produce better compression result than the results by other algorithms.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 989-994)

Pages:

1561-1565

Citation:

Online since:

July 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] S. Grumbach and F. Tahi, Compression of DNA sequences, in Proc. of the Data Compression Conf. DCC-93, Snowbird, Utah, p.340–350, (1993).

DOI: 10.1109/dcc.1993.253115

Google Scholar

[2] S. Grumbach and F. Tahi. A new challenge for compression algorithms: Genetic sequences. Inf. Process. Manage, 30(6): 866–875, (1994).

DOI: 10.1016/0306-4573(94)90014-0

Google Scholar

[3] X. Chen, S. Kwong, and M. Li. A compression algorithm for DNA sequences and its applicationsin genome comparison. RECOMB, page 107, (2000).

Google Scholar

[4] T. Matsumoto, K. Sadakane, and H. Imai, Biological sequence compression algorithms, GenomeInformatics, Vols. 11, p.43–52, (2000).

Google Scholar

[5] M. Cao, T.I. Dix, L. Allison, and C. Mears, A simple statistical algorithm for biological sequence com-pression, in Proc. of the Data Compression Conf, DCC-2007, Snowbird, Utah, (2007).

DOI: 10.1109/dcc.2007.7

Google Scholar

[6] Armando J. Pinho etc, Bacteria DNA Sequence Compression using a mixture of finite-context models, IEEE Statistical Signal Processing Workshop, Portugal, pp.125-128, (2011).

DOI: 10.1109/ssp.2011.5967637

Google Scholar

[7] Min Chen, Jianhua Chen, Affinity propagation for the Context quantization, Advanced Materials Research, Vols. 791, pp.1533-1536, (2013).

DOI: 10.4028/www.scientific.net/amr.791-793.1533

Google Scholar

[8] Min Chen, Fuyan Wang, Context quantization based on the modified K-means clustering, Advanced Materials Research, Vols. 756, pp.4068-4072, (2013).

DOI: 10.4028/www.scientific.net/amr.756-759.4068

Google Scholar