Finding approximate occurrence of a pattern that contains gaps by the bit-vector approach

  • Lee, In-Bok (School of Computer Science and Engineering, Seoul National University) ;
  • Park, Kun-Soo (School of Computer Science and Engineering, Seoul National University)
  • Published : 2003.10.31

Abstract

The application of finding occurrences of a pattern that contains gaps includes information retrieval, data mining, and computational biology. As the biological sequences may contain errors, it is important to find not only the exact occurrences of a pattern but also approximate ones. In this paper we present an O(mnk$_{max}$/w) time algorithm for the approximate gapped pattern matching problem, where m is the length of the text, H is the length of the pattern, w is the word size of the target machine, and k$_{max}$ is the greatest error bound for subpatterns.

Keywords