Browse > Article

A Local Alignment Algorithm using Normalization by Functions  

Lee, Sun-Ho (서울대학교 컴퓨터공학부)
Park, Kun-Soo (서울대학교 컴퓨터공학부)
Abstract
A local alignment algorithm does comparing two strings and finding a substring pair with size l and similarity s. To find a pair with both sufficient size and high similarity, existing normalization approaches maximize the ratio of the similarity to the size. In this paper, we introduce normalization by functions that maximizes f(s)/g(l), where f and g are non-decreasing functions. These functions, f and g, are determined by experiments comparing DNA sequences. In the experiments, our normalization by functions finds appropriate local alignments. For the previous algorithm, which evaluates the similarity by using the longest common subsequence, we show that the algorithm can also maximize the score normalized by functions, f(s)/g(l) without loss of time.
Keywords
local alignment; normalization; normalization by functions; DNA sequence;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. van Emde Boas, R. Kass, and E. Zijlstra, Design and implementation of an efficient priority queue, Math. Systems Theory, 10, 99-127, 1977   DOI
2 D.B. Johnson, A priority queue in which initialization and queue operations take O(loglogD) time, Math. Systems Theory, 15, 295-309, 1982   DOI
3 P. Green, http://www.phrap.org
4 National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov
5 N. Efraty and G. M. Landau, Sparse normalized local alignment, In Proceedings of the 15th Annual Symposium on Combinatorial Pattern Matching (CPM), 333-346, 2004
6 D. Eppstein, Z. Galil, R. Giancarlo, and G. F. Italiano, Sparse dynamic programming I: linear cost functions, Journal of ACM, 39(3), 519-545, 1992   DOI
7 D. Gusfield, Algorithms on strings, trees, and sequences, Cambridge University Press, 1997
8 A.N. Arslan, O. Egecioglu, and P.A. Pevzner, A new approach to sequence comparison: normalized sequence alignment, Bioinformatics, 17(4), 327-337, 2001   DOI   ScienceOn