Event-based lossy compression for effective and efficient OLAP over data streams

作者:

Highlights:

摘要

An innovative event-based lossy compression model for effective and efficient OLAP over data streams, called ECM-DS, is presented and experimentally assessed in this paper. The main novelty of our compression approach with respect to traditional data stream compression techniques relies on exploiting the semantics of the reference application scenario in order to drive the compression process by means of the “degree of interestingness” of events occurring in the target stream. This finally improves the quality of retrieved approximate answers to OLAP queries over data streams, and, in turn, the quality of complex knowledge discovery tasks over data streams developed on top of ECM-DS, and implemented via ad-hoc data stream mining algorithms. Overall, the compression strategy we propose in this research puts the basis for a novel class of intelligent applications over data streams where the knowledge on actual streams is integrated-with and correlated-to the knowledge related to expired events that are considered critical for the target OLAP analysis scenario. Finally, a comprehensive experimental evaluation over several classes of data stream sets clearly confirms the benefits deriving from the event-based data stream compression approach proposed in ECM-DS.

论文关键词:Data stream query processing,Data stream compression methodologies and techniques,Knowledge discovery from data streams,OLAP over data streams,Event-based data stream processing,Event-based data stream compression

论文评审过程:Available online 20 February 2010.

论文官网地址:https://doi.org/10.1016/j.datak.2010.02.006