Efficient mining of traversal patterns

作者:

Highlights:

摘要

A new problem of mining traversal patterns from Web access logs is introduced. The traversal patterns are defined to keep duplicates as well as consecutive ordering in the sessions. Then an efficient algorithm is proposed. The algorithm is online, which allows the user to see the incremental results with respect to the scanned part of the database. The algorithm also adapts to large databases through dynamic compressions and effective pruning. Finally the algorithm is evaluated through experiments with real Web logs.

论文关键词:Data mining,Clickstream analysis,Traversal patterns,Suffix tree

论文评审过程:Received 24 July 2001, Revised 24 July 2001, Accepted 24 July 2001, Available online 3 October 2001.

论文官网地址:https://doi.org/10.1016/S0169-023X(01)00039-8