Efficient derivation of numerical dependencies

作者:

Highlights:

摘要

Numerical dependencies (NDs) are database constraints that limit the number of distinct Y-values that can appear together with any X-value, where both X and Y are sets of attributes in a relation schema. While it is known that NDs are not finitely axiomatizable, there is no study on how to efficiently derive NDs using a set of sound (yet necessarily incomplete) rules. In this paper, after proving that solving the entailment problem for NDs using the chase procedure has exponential space complexity, we show that, given a set of inference rules similar to those used for functional dependencies, the membership problem for NDs is NP-hard. We then provide a graph-based characterization of NDs, which is exploited to design an efficient branch & bound algorithm for ND derivation. Our algorithm adopts several optimization strategies that provide considerable speed-up over a naïve approach, as confirmed by the results of extensive tests we made for efficiency and effectiveness using six different datasets.

论文关键词:Numerical dependency,Membership problem,Branch and bound algorithm,Projection cardinality estimation

论文评审过程:Received 10 October 2011, Revised 25 July 2012, Accepted 25 July 2012, Available online 9 August 2012.

论文官网地址:https://doi.org/10.1016/j.is.2012.07.007