Constructing the Bayesian network structure from dependencies implied in multiple relational schemas

作者:

Highlights:

摘要

Relational models are the most common representation of structured data, and acyclic database theory is important in relational databases. In this paper, we propose the method for constructing the Bayesian network structure from dependencies implied in multiple relational schemas. Based on the acyclic database theory and its relationships with probabilistic networks, we are to construct the Bayesian network structure starting from implied independence information instead of mining database instances. We first give the method to find the maximum harmoniousness subset for the multi-valued dependencies on an acyclic schema, and thus the most information of conditional independencies can be retained. Further, aiming at multi-relational environments, we discuss the properties of join graphs of multiple 3NF database schemas, and thus the dependencies between separate relational schemas can be obtained. In addition, on the given cyclic join dependency, the transformation from cyclic to acyclic database schemas is proposed by virtue of finding a minimal acyclic augmentation. An applied example shows that our proposed methods are feasible.

论文关键词:Relational data model,Bayesian network,Acyclic database schema,Harmoniousness multi-valued dependency set,Join dependency

论文评审过程:Available online 16 December 2010.

论文官网地址:https://doi.org/10.1016/j.eswa.2010.12.053