Detecting coherent explorations in SQL workloads

作者:

Highlights:

• We propose an approach for segmenting a SQL workload into meaningful, coherent database explorations.

• We describe each query of a SQL workload with a set of features that are intrinsic to a query or that relate the query to its neighbor in the workload.

• We define a set of similarity indexes, exploiting query features, that quantify the similarity between contiguous queries from several points of view.

• We proposed three methods for workload segmentation: unsupervised learning, supervised learning and weak supervision.

摘要

•We propose an approach for segmenting a SQL workload into meaningful, coherent database explorations.•We describe each query of a SQL workload with a set of features that are intrinsic to a query or that relate the query to its neighbor in the workload.•We define a set of similarity indexes, exploiting query features, that quantify the similarity between contiguous queries from several points of view.•We proposed three methods for workload segmentation: unsupervised learning, supervised learning and weak supervision.

论文关键词:

论文评审过程:Received 9 July 2019, Revised 19 October 2019, Accepted 24 November 2019, Available online 9 December 2019, Version of Record 10 June 2020.

论文官网地址:https://doi.org/10.1016/j.is.2019.101479