Web log data warehousing and mining for intelligent web caching

作者:

Highlights:

摘要

We introduce intelligent web caching algorithms that employ predictive models of web requests; the general idea is to extend the least recently used (LRU) policy of web and proxy servers by making it sensitive to web access models extracted from web log data using data mining techniques. Two approaches have been studied in particular, frequent patterns and decision trees. The experimental results of the new algorithms show substantial improvement over existing LRU-based caching techniques, in terms of hit rate. We designed and developed a prototypical system, which supports data warehousing of web log data, extraction of data mining models and simulation of the web caching algorithms.

论文关键词:Web caching,Log data warehousing,Data mining,Frequent patterns,Association rules,Decision trees

论文评审过程:Received 19 June 2001, Revised 24 July 2001, Accepted 24 July 2001, Available online 3 October 2001.

论文官网地址:https://doi.org/10.1016/S0169-023X(01)00038-6