An efficient method for mining sequential patterns with indices

作者:

Highlights:

摘要

In recent years, mining informative data and discovering hidden information have become increasingly in demand. One of the popular means to achieve this is sequential pattern mining, which is to find informative patterns stored in databases. Its applications cover different areas and many methods have been proposed. Recently, pseudo-IDLists were proposed to improve both runtime and memory usage in the mining process. However, the idea cannot be directly used for sequential pattern mining as it only works on clickstream patterns, a more distinct type of sequential pattern. We propose adaptations and changes to the original idea to introduce SUI (Sequential pattern mining Using Indices). Comparing SUI with two other state-of-the-art algorithms on six test databases, we show that SUI has effective and efficient performance and memory usage.

论文关键词:Pseudo-IDList,Data-IDList,Vertical format,Sequential pattern mining

论文评审过程:Received 2 August 2021, Revised 10 November 2021, Accepted 11 December 2021, Available online 23 December 2021, Version of Record 11 January 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107946