Space-efficient cubes for OLAP range-sum queries

作者:

Highlights:

摘要

Data cubes support a powerful data analysis method called the range-sum query. The range-sum query is widely used in finding trends and in discovering relationships among attributes in diverse database applications. A range-sum query computes aggregate information over an online analytical processing (OLAP) data cube in specified query ranges. Existing techniques for range-sum queries on data cubes use an additional cube called the prefix sum cube (PC), to store the cumulative sums of data, causing a high space overhead. This space overhead not only leads to extra costs for storage devices, but also causes additional propagations of updates and longer access time on physical devices.In this paper, we present a new cube representation called ‘the PC Pool’, which drastically reduces the space of the PC in a large data warehouse. The PC Pool decreases the update propagation caused by the dependency between values in cells of the PC. We develop an effective algorithm, which finds dense sub-cubes from a large data cube. We perform an extensive experiment with diverse data sets, and examine the space reduction and performance of our proposed method with respect to various dimensions of the data cube and query sizes. Experimental results show that our method reduces the space of the PC while having a reasonable query performance.

论文关键词:Range-sum query,OLAP,Data cube,Clustering

论文评审过程:Received 1 September 2002, Accepted 1 December 2002, Available online 11 February 2003.

论文官网地址:https://doi.org/10.1016/S0167-9236(03)00003-4