Parametric Algorithms for Mining Share Frequent Itemsets

作者:Brock Barber, HOWARD J. HAMILTON

摘要

Itemset share, the fraction of some numerical total contributed by items when they occur in itemsets, has been proposed as a measure of the importance of itemsets in association rule mining. The IAB and CAC algorithms are able to find share frequent itemsets that have infrequent subsets. These algorithms perform well, but they do not always find all possible share frequent itemsets. In this paper, we describe the incorporation of a threshold factor into these algorithms. The threshold factor can be used to increase the number of frequent itemsets found at a cost of an increase in the number of infrequent itemsets examined. The modified algorithms are tested on a large commercial database. Their behavior is examined using principles of classifier evaluation from machine learning.

论文关键词:knowledge discovery, data mining, itemsets, association rule mining, share based measures

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1011276003319