Histogram-by: A grouping operator for continuous domains

作者:

Highlights:

摘要

In this paper, we propose a new operator, histogram-by, which provides a grouping for continuous domains, which partitions records into several groups by given ranges of the target attributes. The histogram-by operator can be represented as histogram-by clause in the SQL statement, and can be easily amenable to query optimization. As the application of the histogram-by operator, we introduce a multi-dimensional histogram query, which returns aggregate values of all ranges specified by the histogram-by clause. To process the query efficiently, we propose effective algorithms using aggregate R-trees. Our experimental results show that our algorithms are reliable in terms of performance over the synthetic and real-world datasets.

论文关键词:Histogram-by,Multi-dimensional histogram query,Aggregate R-tree,Data warehouse,Spatio-temporal database

论文评审过程:Received 31 October 2005, Revised 31 October 2005, Accepted 20 March 2006, Available online 18 April 2006.

论文官网地址:https://doi.org/10.1016/j.datak.2006.03.007