Contextualizing data warehouses with documents

作者:

Highlights:

摘要

Current data warehouse and OLAP technologies are applied to analyze the structured data that companies store in databases. The context that helps to understand data over time is usually described separately in text-rich documents. This paper proposes to integrate the traditional corporate data warehouse with a document warehouse, resulting in a contextualized warehouse. Thus, the user first selects an analysis context by supplying some keywords. Then, the analysis is performed on a novel type of OLAP cube, called an R-cube, which is materialized by retrieving and ranking the documents and corporate facts related to the selected context.

论文关键词:OLAP,Text-rich XML documents,Information retrieval

论文评审过程:Available online 7 February 2007.

论文官网地址:https://doi.org/10.1016/j.dss.2006.12.005