A model-based evaluation of data quality activities in KDD

作者:

Highlights:

• MRDQA: a model-based approach for supporting the Data Quality task on KDD.

• Evaluation of quality requirements of weakly-structured data via model-checking.

• A fine-grained quality analysis of the cleansing procedures effectiveness.

• Automatic identification of error-patterns and interactive visualisation.

• Experiments done on a real scenario making data publicly available.

摘要

•MRDQA: a model-based approach for supporting the Data Quality task on KDD.•Evaluation of quality requirements of weakly-structured data via model-checking.•A fine-grained quality analysis of the cleansing procedures effectiveness.•Automatic identification of error-patterns and interactive visualisation.•Experiments done on a real scenario making data publicly available.

论文关键词:Data quality,Data cleansing,Model checking,Real-life application

论文评审过程:Received 30 October 2013, Revised 12 May 2014, Accepted 24 July 2014, Available online 30 September 2014.

论文官网地址:https://doi.org/10.1016/j.ipm.2014.07.007