Clustering factor estimation for totally clustered attributes

作者:

Highlights:

摘要

Cost models based on the clustering factor (CF) of the attributes have been proposed and shown to be attractive for block access estimation in databases, thanks to their accuracy and economy of use. While query optimizers can use the actual CFs, measured from the data, physical design methods and tools must rely on estimates before the data are stored.In this paper we present a CF estimation procedure which can be applied to totally clustered attributes (e.g. ordered attributes). Simple and accurate approximations of the derived formulas are also introduced.Simulations show the accuracy of the proposed CF estimates and the improvment in their behaviour compared to previously published estimates. Reliability for physical design of cost models based on the CF in the presence of a skewed data distribution is also discussed.

论文关键词:Databases,Clustering factor,Performance evaluation,Physical design,Relational database

论文评审过程:Received 4 February 1994, Revised 25 July 1994, Accepted 27 September 1994, Available online 22 December 1999.

论文官网地址:https://doi.org/10.1016/0169-023X(94)00026-B