Classification based on specific rules and inexact coverage

作者:

Highlights:

摘要

Association rule mining and classification are important tasks in data mining. Using association rules has proved to be a good approach for classification. In this paper, we propose an accurate classifier based on class association rules (CARs), called CAR-IC, which introduces a new pruning strategy for mining CARs, which allows building specific rules with high confidence. Moreover, we propose and prove three propositions that support the use of a confidence threshold for computing rules that avoids ambiguity at the classification stage. This paper also presents a new way for ordering the set of CARs based on rule size and confidence. Finally, we define a new coverage strategy, which reduces the number of non-covered unseen-transactions during the classification stage. Results over several datasets show that CAR-IC beats the best classifiers based on CARs reported in the literature.

论文关键词:Data mining,Supervised classification,Class association rules,Association rule mining

论文评审过程:Available online 1 April 2012.

论文官网地址:https://doi.org/10.1016/j.eswa.2012.03.057