An efficient algorithm for mining top-rank-k frequent patterns

作者:Thu-Lan Dam, Kenli Li, Philippe Fournier-Viger, Quang-Huy Duong

摘要

Mining top-rank-k frequent patterns is a popular data mining task, which consists of discovering the patterns in a transaction database that belong to the k first ranks in terms of support. Although, several algorithms have been proposed for this task, it remains computationally expensive. To address this issue, this paper proposes a novel algorithm named BTK. It relies on a novel tree structure named TB-tree to store crucial information about frequent patterns. Moreover, BTK employs a new B-list structure to store information about patterns, and relies on subsume indexes to reduce the search space and speed up the discovery of top-rank-k frequent patterns. BTK also uses an early pruning strategy and an effective threshold raising mechanism. Additionally, BTK introduces two efficient procedures for respectively generating subsume indexes and intersecting B-lists. Extensive experiments were conducted on several datasets to evaluate the efficiency of the proposed algorithm. Results show that BTK is highly efficient and competitive.

论文关键词:Pattern mining, Top-rank-k frequent patterns, TB-tree, B-list, N-list

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-015-0748-9