Query Size Estimation by Adaptive Sampling

作者:

Highlights:

摘要

We present an adaptive, random sampling algorithm for estimating the size of general queries. The algorithm can be used for any query D over a database D such that (1) for some n, the answer to L can be partitioned into n disjoint subsets L1, L2, ..., Ln, and (2) for 1 ≤ i ≤ n, the size of Li, is bounded by some function b(D, L), and (3) there is some algorithm by which we can compute the size of Li, where i is chosen randomly. We consider the performance of the algorithm on three special cases of the algorithm: join queries, transitive closure queries, and general recursive Datalog queries.

论文关键词:

论文评审过程:Available online 25 May 2002.

论文官网地址:https://doi.org/10.1006/jcss.1995.1050