Tuple source relational model: A source-aware data model for multidatabases

作者:

Highlights:

摘要

In some integration projects, complete integration of database instances may not be necessary. It may also be too costly and impossible to do so due to poor local data quality and insufficient instance-level knowledge. In this research, we study how multidatabases with global schemas should be represented and manipulated when the data instances from the local databases do not require to be fully integrated. We propose the tuple source (TS) relational model to represent multidatabases under such an integration requirement. This model extends the classical relational model by augmenting every relation with a source attribute to identify the local database that the tuples come from. The source attribute can also be used to specify the right context to interpret global data instances. To manipulate TS relations, we have developed a set of tuple source relational algebraic operations and an extended SQL query language known as TS-SQL. With TS relational model, flexible multidatabase queries that involve instances from different local databases can be formulated easily. In this paper, we also reported our distributed query processing and optimization strategies and their implementation.

论文关键词:Database integration,Multidatabases,Relational data model,Distributed query processing

论文评审过程:Received 7 October 1997, Revised 20 March 1998, Accepted 24 April 1998, Available online 1 March 1999.

论文官网地址:https://doi.org/10.1016/S0169-023X(99)00021-X