UBP-Miner: An efficient bit based high utility itemset mining algorithm

作者:

Highlights:

摘要

HUIM (High utility itemset mining) is a key problem in data mining. The goal is to find itemsets having a high importance or profit in a database, to identify useful knowledge that can support decision-making. In recent years, many HUIM algorithms have been put forward. Among them, utility-list-based algorithms have become very popular as they are easily extendable and efficient. Although several improvements were made, efficiency remains a critical issue. To address this problem, this paper proposes to improve the utility-list construction process, a key operation that has not been much studied in prior work. A novel set of bitwise operations is proposed called BEO (Bit mErge cOnstruction) to speed up the construction process. Besides, a novel data structure called UBP (Utility Bit Partition) is designed to support BEO. This structure is integrated into a novel UBP-Miner algorithm, which also applies several search space reduction strategies. Experimental results show that UBP-Miner is faster than several state-of-the-art algorithms such as HUI-Miner* and ULB-Miner on common benchmark datasets.

论文关键词:00-01,99-00,Pattern mining,High utility itemset,Utility list buffer,Bitwise operations

论文评审过程:Received 14 October 2021, Revised 8 March 2022, Accepted 18 April 2022, Available online 25 April 2022, Version of Record 10 May 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.108865