Using positional sequence patterns to estimate the selectivity of SQL LIKE queries
作者:
Highlights:
• We propose a new type of sequence pattern (i.e., positional sequence patterns).
• We introduce information content-based elimination of some patterns.
• We propose a slider-based partial pattern matching scheme.
• P-SPH decreases the error rate of selectivity estimations up to 20%.
摘要
•We propose a new type of sequence pattern (i.e., positional sequence patterns).•We introduce information content-based elimination of some patterns.•We propose a slider-based partial pattern matching scheme.•P-SPH decreases the error rate of selectivity estimations up to 20%.
论文关键词:Selectivity estimation,Histograms,Data management,Sequence pattern mining,Information content
论文评审过程:Received 3 March 2019, Revised 12 July 2020, Accepted 13 July 2020, Available online 19 July 2020, Version of Record 18 August 2020.
论文官网地址:https://doi.org/10.1016/j.eswa.2020.113762