Using positional sequence patterns to estimate the selectivity of SQL LIKE queries

作者:

Highlights:

• We propose a new type of sequence pattern (i.e., positional sequence patterns).

• We introduce information content-based elimination of some patterns.

• We propose a slider-based partial pattern matching scheme.

• P-SPH decreases the error rate of selectivity estimations up to 20%.

摘要

•We propose a new type of sequence pattern (i.e., positional sequence patterns).•We introduce information content-based elimination of some patterns.•We propose a slider-based partial pattern matching scheme.•P-SPH decreases the error rate of selectivity estimations up to 20%.

论文关键词:Selectivity estimation,Histograms,Data management,Sequence pattern mining,Information content

论文评审过程:Received 3 March 2019, Revised 12 July 2020, Accepted 13 July 2020, Available online 19 July 2020, Version of Record 18 August 2020.

论文官网地址:https://doi.org/10.1016/j.eswa.2020.113762