Generating efficient safe query plans for probabilistic databases

作者:

Highlights:

摘要

Managing uncertain information using probabilistic databases has drawn much attention recently in many fields. Generating efficient safe plans is the key to evaluating queries whose data complexities are PTIME. In this paper, we propose a new approach generating efficient safe plans for queries. Our algorithm adopts effective preprocessing and multiway split techniques, thus the generating safe plans avoid unnecessary probabilistic cartesian-products and have the minimum number of probabilistic projections. Further, we extend existing transformation rules to allow the safe plans generated by the Safe-Plan algorithm [N. Dalvi, D. Suciu, Efficient query evaluation on probabilistic database, The VLDB Journal 16 (4) (2007) 523–544] and the proposed algorithm to be transformed by each other. Applying our approach through the TPC-H benchmark queries, the experiments show that the safe plans generated by our algorithm are more efficient than those generated by the Safe-Plan algorithm.

论文关键词:Probabilistic databases,Safe plan,Query evaluation

论文评审过程:Received 19 March 2008, Revised 6 August 2008, Accepted 7 August 2008, Available online 25 August 2008.

论文官网地址:https://doi.org/10.1016/j.datak.2008.08.002