Incomplete Nucleic Acid Sequences Visualization: A Case Study in Virus Sequences

Article Preview

Abstract:

Dengue virus (DENV) is one of the most widespread infectious diseases in the world, especially in the South East Asian regions. Transmitting the virus through mosquitoes, Dengue is an infectious viral borne disease. The virus sequences are assembled as series of nucleic acid, making the task of diagnosing virus sequences burdensome. Graphical representations are then proposed to represent Dengue virus to sustain the studies in virus sequences diagnosis. However, graphically representing sequences remained a crucified task especially for the incomplete genome sequences due to the missing nucleic acids. Although a number of studies provide methodologies on virus sequence visualization, in Dengue virus researches, those methodologies provide the visualization solely for complete genome sequences while neglecting the incomplete genome sequences. With the unaccommodating availabilities of research inputs, our study proposes a methodology for graphically representing the incomplete Dengue virus sequences, as well as complete virus sequences, by imputing in the incomplete part of a sequence with created reference sequences. The proposed methodology employs the use of database technology and majority voting technique to create reference sequences for each serotype of Dengue. Experimental results show that incomplete sequences are visualized realistically according to its respective serotype, thus providing flexibilities in Dengue virus researches to compensate incomplete sequences as inputs.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

637-640

Citation:

Online since:

August 2015

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] World Health Organization (WHO), Dengue and severe dengue, 2014. [Online]. Available: http: /www. who. int/mediacentre/factsheets/fs117/en.

Google Scholar

[2] E. Hamori and J. Ruskin, H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences, J. Biol. Chem., vol. 258, no. 2, p.1318–1327, (1983).

DOI: 10.1016/s0021-9258(18)33196-x

Google Scholar

[3] M. A. Gates, Simpler DNA sequence representations, Nature, vol. 316, no. 6025, p.219, (1985).

DOI: 10.1038/316219a0

Google Scholar

[4] A. Nandy, Two-dimensional graphical representation of DNA sequences and intron-exon discrimination in intron-rich sequences, Comput. Appl. Biosci., vol. 12, no. 1, p.55–62, (1996).

DOI: 10.1093/bioinformatics/12.1.55

Google Scholar

[5] Stephen S.T. Yau, Jiasong Wong, Amir Niknejad, Chaoxiao Lu, Ning Jin, and Yee-kin Ho, DNA sequence representation without degeneracy, in Nucleic Acids Res. 31 (2003), pp.3078-3080, (2003).

DOI: 10.1093/nar/gkg432

Google Scholar

[6] B. Viriyasaksathian, Y. Wongsawat, and P. Suriyaphol, CADViST: Visualization tool for BLAST Alignment of Dengue virus sequences, in 2010 4th International Conference on Bioinformatics and Biomedical Engineering, (iCBBE ), (2010).

DOI: 10.1109/icbbe.2010.5515779

Google Scholar

[7] C. Temiyasathit and P. Thaithatkul, The analysis of Dengue virus serotypes via UnitX representation, in The 7th INFORMS Workshop on Data Mining and Health Informatics (DM-HI ), (2012).

Google Scholar