CanTree: a canonical-order tree for incremental frequent-pattern mining

作者:Carson Kai-Sang Leung, Quamrul I. Khan, Zhan Li, Tariqul Hoque

摘要

Since its introduction, frequent-pattern mining has been the subject of numerous studies, including incremental updating. Many existing incremental mining algorithms are Apriori-based, which are not easily adoptable to FP-tree-based frequent-pattern mining. In this paper, we propose a novel tree structure, called CanTree (canonical-order tree), that captures the content of the transaction database and orders tree nodes according to some canonical order. By exploiting its nice properties, the CanTree can be easily maintained when database transactions are inserted, deleted, and/or modified. For example, the CanTree does not require adjustment, merging, and/or splitting of tree nodes during maintenance. No rescan of the entire updated database or reconstruction of a new tree is needed for incremental updating. Experimental results show the effectiveness of our CanTree in the incremental mining of frequent patterns. Moreover, the applicability of CanTrees is not confined to incremental mining; CanTrees can also be applicable to other frequent-pattern mining tasks including constrained mining and interactive mining.

论文关键词:Knowledge discovery and data mining, Tree structure, Frequent sets, Incremental mining, Constrained mining, Interactive mining

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-006-0032-8