Multidimensional normal forms for data warehouse design

作者:

Highlights:

摘要

A data warehouse is an integrated and time-varying collection of data derived from operational data and primarily used in strategic decision making by means of OLAP techniques. Although it is generally agreed that warehouse design is a non-trivial problem and that multidimensional data models as well as star or snowflake schemata are relevant in this context, there exist neither methods for deriving such a schema from an operational database nor measures for evaluating a warehouse schema. In this paper, a sequence of multidimensional normal forms is established that allow reasoning about the quality of conceptual data warehouse schemata in a rigorous manner. These normal forms address traditional database design objectives such as faithfulness, completeness, and freedom of redundancies as well as the notion of summarizability, which is specific to multidimensional database schemata.

论文关键词:Data warehouse design,Multidimensional data,Normal form,Summarizability,Functional dependency

论文评审过程:Received 4 September 2001, Revised 16 April 2002, Accepted 22 April 2002, Available online 12 June 2002.

论文官网地址:https://doi.org/10.1016/S0306-4379(02)00024-8