A survey on summarizability issues in multidimensional modeling

作者:

Highlights:

摘要

The development of a data warehouse (DW) system is based on a conceptual multidimensional model, which provides a high level of abstraction in accurately and expressively describing real-world situations. Once this model is designed, the corresponding logical representation must be obtained as the basis of the implementation of the DW according to one specific technology. However, even though a good conceptual multidimensional model is designed underneath a DW, there is a semantic gap between this model and its logical representation. In particular, this gap complicates an adequate treatment of summarizability issues, which in turn may lead to erroneous results of data analysis tools. Research addressing this topic has produced only partial solutions, and individual terminology used by different parties hinders further progress. Consequently, based on a unifying vocabulary, this survey sheds light on (i) the weak and strong points of current approaches for modeling complex multidimensional structures that reflect real-world situations in a conceptual multidimensional model and (ii) existing mechanisms to avoid summarizability problems when conceptual multidimensional models are being implemented.

论文关键词:Multidimensional modeling,Summarizability,Data warehouse,Data analysis

论文评审过程:Received 25 October 2008, Revised 3 July 2009, Accepted 6 July 2009, Available online 14 July 2009.

论文官网地址:https://doi.org/10.1016/j.datak.2009.07.010