Browse > Article

Sequence Alignment Algorithm using Quality Information  

Na, Joong-Chae (서울대학교 전기, 컴퓨터공학부)
Roh, Kang-Ho (서울대학교 전기, 컴퓨터공학부)
Park, Kun-Soo (서울대학교 전기, 컴퓨터공학부)
Abstract
In this Paper we consider the problem of sequence alignment with quality scores. DNA sequences produced by a base-calling program (as part of sequencing) have quality scores which represent the confidence level for individual bases. However, previous sequence alignment algorithms do not consider such quality scores. To solve sequence alignment with quality scores, we propose a measure of an alignment of two sequences with orality scores. We show that an optimal alignment in this measure can be found by dynamic programming.
Keywords
Sequence alignment; dynamic programming; bioinformatics; DNA sequence comparison;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Zhang, Z., Berman, P., Wiehe, T. and Miller, W., 'Post-processing long pairwise alignments,' Bioinformatics 15(2), pp. 1012-1019, 1999   DOI
2 Arslan, A., Egecioglu, O. and Pevzner P., 'A new approach to sequence comparison: Normalized sequence alignment,' Bioinformatics 17(4), pp. 327-337, 2001   DOI   ScienceOn
3 Smith, T.F. and Waterman, M.S., Identification of Common Molecular Biology, PWS Publishing Company, 1997
4 Crochemore M., Landau, G. and Ziv-Ukelson, M., 'A sub-quadratic sequence alignment algorithm for unrestricted cost matrices,' In 13th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 679-688, 2002
5 Ewing, B., Hillier, L., Wendl, M.C. and Green, P., 'Base-calling of automated sequencer traces using phred. I. accuracy assessment,' Genome Research 8(3), pp. 175-185, 1998
6 Apostolico, A. and Giancarlo, R., 'Sequence Alignment in Molecular Biology,' Journal of Computational Biology 5(2), pp. 173-196, 1998   DOI   ScienceOn
7 Gusfield, D., 'Efficient methods for multiple sequence alignment with guaranteed error bounds,' Bulletin of Mathematical Biology 55, pp. 141-154, 1993   DOI
8 Pevzner, P., Computational Molecular Biology: An Algorithmic Approach, The MIT Press, 2000
9 Waterman, M.S., Introduction to Computational Biology, Champman and Hall, 1995
10 Gusfield, D., Algorithms on Strings, Trees and Sequences: Computer science and Computational Biology, Cambridge University Press, 1997
11 Hubbard, T., Lesk, A. and Tramontano, A., 'Gathering them into the fold,' Nature Structural Biology 4, pp 313, 1996   DOI   ScienceOn
12 Green, P., Documentation for phrap, Genome Center, University of Washington, http://www.phrap.org/phrap.docs/phrap.html
13 Batzoglou, S., Jaffe, D., Stanley, K., Butler, J., Gnerre, S., Mauceli, E., Berger, B., Mesirov, J. and Lander E., 'Arachne: A whole-genome shotgun assembler,' Genome Research 12, pp. 177-189, 2002   DOI   ScienceOn
14 Jaffe, D., Butler, J., Gnerre, S., Mauceli, E., Lindblan-Toh, K., Mesirov, J., Zody, M. and Lander E., 'Whole-genome sequence assembly for mammalian genomes: Arachne 2,' Genome Research 13, pp. 91-96, 2003   DOI   ScienceOn
15 Gotoh, O., 'An improved algorithm for matching biological sequences,' Journal of Molecular Biology 162, pp. 705-504, 1982   DOI
16 Needleman, S.B. and Wunsch, C.D., 'A general method applicable to the search for similarities in the amino acid sequences of two proteins,' Journal of Molecular Biology 48, pp. 443-453, 1970   DOI