Predictability-based collective class association rule mining

作者:

Highlights:

摘要

Associative classification is rule-based involving candidate rules as criteria of classification that provide both highly accurate and easily interpretable results to decision makers. The important phase of associative classification is rule evaluation consisting of rule ranking and pruning in which bad rules are removed to improve performance. Existing association rule mining algorithms relied on frequency-based rule evaluation methods such as support and confidence, failing to provide sound statistical or computational measures for rule evaluation, and often suffer from many redundant rules. In this research we propose predictability-based collective class association rule mining based on cross-validation with a new rule evaluation step. We measure the prediction accuracy of each candidate rule in inner cross-validation steps. We split a training dataset into inner training sets and inner test sets and then evaluate candidate rules’ predictive performance. From several experiments, we show that the proposed algorithm outperforms some existing algorithms while maintaining a large number of useful rules in the classifier. Furthermore, by applying the proposed algorithm to a real-life healthcare dataset, we demonstrate that it is practical and has potential to reveal important patterns in the dataset.

论文关键词:Associative classification,Class association rules,Rule ranking,Rule pruning,Data mining

论文评审过程:Received 5 July 2016, Revised 12 February 2017, Accepted 13 February 2017, Available online 16 February 2017, Version of Record 21 February 2017.

论文官网地址:https://doi.org/10.1016/j.eswa.2017.02.024