Extracting non-redundant correlated purchase behaviors by utility measure

作者:

Highlights:

摘要

From web search and data mining, users’ click and purchase behaviors contain valuable information, thus numerous approaches have been proposed to identify embedded useful knowledge from them. In these real-life situations, each user may perform the same action/event multiple times, and multiple accessed events product different profit. Many utility-oriented data mining approaches thus have been extensively studied. Previous studies have the limitation that the overall utility of traditional pattern is limited since they rarely consider the inherent correlation. For example, from the purchase behavior, the low-utility patterns sometimes with a very high-utility pattern will be considered as a valuable pattern even if this behavior may be not highly correlated. A more intelligent framework that provides non-redundant and correlated behavior based on utility measure is thus desired. In this paper, we first present a novel method to extract non-redundant correlated purchase behaviors considering the utility and correlation factors. The high qualified patterns can be derived with high profit and strong correlation, which can lead to higher recall and reveal better precision. In the proposed projection-based approach, an efficient projection mechanism and a sorted downward closure property are developed to reduce the database size. Several pruning strategies are further developed to efficiently and effectively discover the desired patterns. An extensive experimental study showed that the novel non-redundant correlated high-utility pattern has more effectiveness than the previous knowledge representation. Moreover, the proposed algorithm is efficient in terms of execution time and memory usage.

论文关键词:Purchase behavior,Utility,Inherent correlation,Projection

论文评审过程:Received 16 May 2017, Revised 4 December 2017, Accepted 5 December 2017, Available online 8 December 2017, Version of Record 3 February 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.12.003