Region clustering based evaluation of multiple top-N selection queries

作者：

Highlights：

•

摘要

In many database applications, there are opportunities for multiple top-N queries to be evaluated at the same time. Often it is more cost effective to evaluate multiple such queries collectively than individually. In this paper, we propose a new method for evaluating multiple top-N queries concurrently over a relational database. The basic idea of this method is region clustering that groups the search regions of individual top-N queries into larger regions and retrieves the tuples from the larger regions. This method avoids having the same region accessed multiple times and reduces the number of random I/O accesses to the underlying databases. Extensive experiments are carried out to measure the performance of this new strategy and the results indicate that it is significantly better than the naïve method of evaluating these queries one by one for both low-dimensional (2, 3, and 4) and high-dimensional (25, 50, and 104) data.

论文关键词：Top-N query,Multiple queries evaluation,Region clustering

论文评审过程：Received 1 September 2006, Revised 30 July 2007, Accepted 4 September 2007, Available online 17 September 2007.

论文官网地址：https://doi.org/10.1016/j.datak.2007.09.004