Schema versioning in data warehouses: Enabling cross-version querying via schema augmentation

作者:

Highlights:

摘要

As several mature implementations of data warehousing systems are fully operational, a crucial role in preserving their up-to-dateness is played by the ability to manage the changes that the data warehouse (DW) schema undergoes over time in response to evolving business requirements. In this paper we propose an approach to schema versioning in DWs, where the designer may decide to undertake some actions on old data aimed at increasing the flexibility in formulating cross-version queries, i.e., queries spanning multiple schema versions. First, we introduce a representation of DW schemata as graphs of simple functional dependencies, and discuss its properties. Then, after defining an algebra of schema graph modification operations aimed at creating new schema versions, we discuss how augmented schemata can be introduced to increase flexibility in cross-version querying. Next, we show how a history of versions for DW schemata is managed and discuss the relationship between the temporal horizon spanned by a query and the schema on which it can consistently be formulated.

论文关键词:Data warehousing,Schema versioning,Cross-version querying,Schema augmentation

论文评审过程:Received 4 September 2005, Accepted 21 September 2005, Available online 18 October 2005.

论文官网地址:https://doi.org/10.1016/j.datak.2005.09.004