Efficient processing of probabilistic group subspace skyline queries in uncertain databases

作者:

Highlights:

摘要

Due to the pervasive data uncertainty in many real applications, efficient and effective query answering on uncertain data has recently gained much attention from the database community. In this paper, we propose a novel and important query in the context of uncertain databases, namely probabilistic group subspace skyline (PGSS) query, which is useful in applications like sensor data analysis. Specifically, a PGSS query retrieves those uncertain objects that are, with high confidence, not dynamically dominated by other objects, with respect to a group of query points in ad-hoc subspaces. In order to enable fast PGSS query answering, we propose effective pruning methods to reduce the PGSS search space, which are seamlessly integrated into an efficient PGSS query procedure. Furthermore, to achieve low query cost, we provide a cost model, in light of which uncertain data are pre-processed and indexed. Extensive experiments have been conducted to demonstrate the efficiency and effectiveness of our proposed approaches.

论文关键词:Uncertain database,Probabilistic group subspace skyline queries

论文评审过程:Received 16 February 2011, Revised 23 August 2012, Accepted 28 August 2012, Available online 26 September 2012.

论文官网地址:https://doi.org/10.1016/j.is.2012.08.006