Circular pattern matching with k mismatches

作者:

Highlights:

摘要

We consider the circular pattern matching with k mismatches (k-CPM) problem in which one is to compute the minimal Hamming distance of every length-m substring of T and any cyclic rotation of P, if this distance is no more than k. It is a variation of the well-studied k-mismatch problem. A multitude of papers has been devoted to solving the k-CPM problem, but only average-case upper bounds are known. In this paper, we present the first non-trivial worst-case upper bounds for this problem. Specifically, we show an O(nk)-time algorithm and an O(n+nmk4)-time algorithm. The latter algorithm applies in an extended way a technique that was very recently developed for the k-mismatch problem Bringmann et al. (2019) [10].A preliminary version of this work appeared at FCT 2019 [35]. In this version we improve the time complexity of the second algorithm from O(n+nmk5) to O(n+nmk4).

论文关键词:Circular pattern matching,k-mismatch problem,Approximate pattern matching

论文评审过程:Received 1 December 2019, Revised 1 July 2020, Accepted 19 July 2020, Available online 29 July 2020, Version of Record 5 August 2020.

论文官网地址:https://doi.org/10.1016/j.jcss.2020.07.003