Frequent items in streaming data: An experimental evaluation of the state-of-the-art

作者:

Highlights:

摘要

The problem of detecting frequent items in streaming data is relevant to many different applications across many domains. Several algorithms, diverse in nature, have been proposed in the literature for the solution of the above problem. In this paper, we review these algorithms, and we present the results of the first extensive comparative experimental study of the most prominent algorithms in the literature. The algorithms were comprehensively tested using a common test framework on several real and synthetic datasets. Their performance with respect to the different parameters (i.e., parameters intrinsic to the algorithms, and data related parameters) was studied. We report the results, and insights gained through these experiments.

论文关键词:Frequent items,Data streams,Stream mining

论文评审过程:Received 28 June 2008, Revised 17 October 2008, Accepted 24 November 2008, Available online 6 December 2008.

论文官网地址:https://doi.org/10.1016/j.datak.2008.11.001