On the design of multiple key hashing files for concurrent orthogonal range retrieval between two disks

作者:

Highlights:

摘要

This paper concerns the following problem: given a set of multi-attribute records, a fixed number of buckets and a two-disk system, arrange the records into the buckets and then store the buckets between the disks in such a way that, over all possible orthogonal range queries (ORQs), the disk access concurrency is maximized. We shall adopt the multiple key hashing (MKH) method for arranging records into buckets and use the disk modulo (DM) allocation method for storing buckets onto disks. Since the DM allocation method has been shown to be superior to any other allocation methods for allocating an MKH file onto a two-disk system for answering ORQs, the real issue is knowing how to determine an optimal way for organizing the records into buckets based upon the MKH concept.A performance formula that can be used to evaluate the average response time, over all possible ORQs, of an MKH file in a two-disk system using the DM allocation method is first presented. Based upon this formula, it is shown that our design problem is related to a notoriously difficult problem, namely the Prime Number Problem. Then a performance lower bound and an efficient algorithm for designing optimal MKH files in certain cases are presented. It is pointed out that in some cases the optimal MKH file for ORQs in a two-disk system using the DM allocation method is identical to the optimal MKH file for ORQs in a single-disk system and the optimal average response time in a two-disk system is slightly greater than one half of that in a single-disk system.

论文关键词:Multi-attribute file,record-to-bucket problem,bucket-to-disk problem,partial match query (PMQ),orthogonal range query (ORQ),multiple key hashing (MKH) file,disk modulo (DM) allocation method,strictly optimal allocation method,average response time,performance lower bound,minimum marginal increase (MMI) algorithm

论文评审过程:Received 11 June 1990, Revised 3 March 1991, Available online 17 June 2003.

论文官网地址:https://doi.org/10.1016/0306-4379(91)90022-2