Referential integrity quality metrics

作者:

Highlights:

摘要

Referential integrity is an essential global constraint in a relational database, that maintains it in a complete and consistent state. In this work, we assume the database may violate referential integrity and relations may be denormalized. We propose a set of quality metrics, defined at four granularity levels: database, relation, attribute and value, that measure referential completeness and consistency. Quality metrics are efficiently computed with standard SQL queries, that incorporate two query optimizations: left outer joins on foreign keys and early foreign key grouping. Experiments evaluate our proposed metrics and SQL query optimizations on real and synthetic databases, showing they can help in detecting and explaining referential errors.

论文关键词:Referential integrity,Foreign key,Metric,Database integration

论文评审过程:Received 12 September 2006, Revised 7 May 2007, Accepted 18 June 2007, Available online 23 June 2007.

论文官网地址:https://doi.org/10.1016/j.dss.2007.06.004