Validation and interpretation of Web users’ sessions clusters

作者:

Highlights:

摘要

Understanding users’ navigation on the Web is important towards improving the quality of information and the speed of accessing large-scale Web data sources. Clustering of users’ navigation into sessions has been proposed in order to identify patterns and similarities which are then managed in the context of Web users oriented applications (searching, e-commerce, etc.). This paper deals with the problem of assessing the quality of user session clusters in order to make inferences regarding the users’ navigation behavior. A common model-based clustering algorithm is used to result in clusters of Web users’ sessions. These clusters are validated by using a statistical test, which measures the distances of the clusters’ distributions to infer their dissimilarity and distinguishing level. Furthermore, a visualization method is proposed in order to interpret the relation between clusters. Using real data sets, we illustrate how the proposed analysis can be applied in popular application scenarios to reveal valuable associations among Web users’ navigation sessions.

论文关键词:Cluster validation,Web data clustering,Cluster interpretation,Cluster visualization,Web users’ sessions mining

论文评审过程:Received 25 May 2006, Revised 15 October 2006, Accepted 15 October 2006, Available online 21 December 2006.

论文官网地址:https://doi.org/10.1016/j.ipm.2006.10.010