The effect of threshold values on association rule based classification accuracy

作者:

Highlights:

摘要

Classification Association Rule Mining (CARM) systems operate by applying an Association Rule Mining (ARM) method to obtain classification rules from a training set of previously classified data. The rules thus generated will be influenced by the choice of ARM parameters employed by the algorithm (typically support and confidence threshold values). In this paper we examine the effect that this choice has on the predictive accuracy of CARM methods. We show that the accuracy can almost always be improved by a suitable choice of parameters, and describe a hill-climbing method for finding the best parameter settings. We also demonstrate that the proposed hill-climbing method is most effective when coupled with a fast CARM algorithm such as the TFPC algorithm which is also described.

论文关键词:KDD,Data mining,Classification rule mining,Classification association rule mining

论文评审过程:Received 30 September 2005, Revised 6 January 2006, Accepted 8 February 2006, Available online 20 March 2006.

论文官网地址:https://doi.org/10.1016/j.datak.2006.02.005