New approach for the sequential pattern mining of high-dimensional sequence databases

作者:

Highlights:

摘要

In this paper a new algorithm, the Top-Down mining of Sequential patterns (TD-Seq), for mining sequential patterns from high-dimensional stock sequence databases is presented. Existing algorithms are limited by efficiency problems in dealing with high-dimensional sequence databases. To address this problem, a two-phase mining method is proposed, in which a top-down transposition-based searching strategy as well as a new support counting method are exploited. Three pruning rules were also developed to reduce the search space further. Experiments conducted on actual databases demonstrate the improved performance of TD-Seq over existing algorithms.

论文关键词:Sequential pattern mining,High-dimensional database,Data mining

论文评审过程:Received 20 March 2009, Revised 4 August 2010, Accepted 17 August 2010, Available online 21 August 2010.

论文官网地址:https://doi.org/10.1016/j.dss.2010.08.029