Pattern matching with address errors: Rearrangement distances

作者:

Highlights:

摘要

Historically, approximate pattern matching has mainly focused at coping with errors in the data, while the order of the text/pattern was assumed to be more or less correct. In this paper we consider a class of pattern matching problems where the content is assumed to be correct, while the locations may have shifted/changed. We formally define a broad class of problems of this type, capturing situations in which the pattern is obtained from the text by a sequence of rearrangements. We consider several natural rearrangement schemes, including the analogues of the ℓ1 and ℓ2 distances, as well as two distances based on interchanges. For these, we present efficient algorithms to solve the resulting string matching problems.

论文关键词:Approximate pattern matching,Swaps,Rearrangement errors,Address metrics

论文评审过程:Received 2 October 2007, Revised 26 February 2009, Available online 5 March 2009.

论文官网地址:https://doi.org/10.1016/j.jcss.2009.03.001