Efficient high-utility occupancy itemset mining algorithm on massive data

作者:

Highlights:

• A suffix-partitioning-based SHO algorithm is proposed to mine HUOIs on massive data.

• Two optimization strategies are proposed to prune itemsets as early as possible.

• Four pruning strategies and two upper bounds are designed.

• Extensive experimental results show high efficiency of the SHO algorithm.

摘要

•A suffix-partitioning-based SHO algorithm is proposed to mine HUOIs on massive data.•Two optimization strategies are proposed to prune itemsets as early as possible.•Four pruning strategies and two upper bounds are designed.•Extensive experimental results show high efficiency of the SHO algorithm.

论文关键词:Massive data,High utility occupancy pattern mining,Suffix-based partitioning,LI strategy,RTI optimization strategy

论文评审过程:Received 4 March 2022, Revised 4 July 2022, Accepted 29 July 2022, Available online 4 August 2022, Version of Record 12 August 2022.

论文官网地址:https://doi.org/10.1016/j.eswa.2022.118329