SPY-TEC: An efficient indexing method for similarity search in high-dimensional data spaces

作者:

Highlights:

摘要

Most of all index structures based on the R-tree have failed to support efficient indexing mechanisms for similarity search in high-dimensional data spaces. This is due to the fact that most of the index structures commonly use balanced split strategy in order to guarantee storage utilization and the shape of queries for similarity search is a hypersphere in high-dimensional spaces. In this paper, we propose the Spherical Pyramid-Technique (SPY-TEC), an efficient indexing method for similarity search in high-dimensional data space. The SPY-TEC is based on a special partitioning strategy, which is to divide the d-dimensional data space first into 2d spherical pyramids, and then cut the single spherical pyramid into several spherical slices. This partition provides a transformation of d-dimensional space into one-dimensional space as the Pyramid-Technique [14] does. Thus, we are able to use a B+-tree to manage the transformed one-dimensional data. We also propose the algorithms to process hyperspherical range queries on the data space partitioned by this partitioning strategy. Finally, we show that the SPY-TEC clearly outperforms other related techniques including the Pyramid-Technique in processing hyperspherical range queries through various experiments using synthetic and real data.

论文关键词:Similarity search,High-dimensional index technique,Multimedia database

论文评审过程:Received 9 March 1999, Revised 10 September 1999, Accepted 3 February 2000, Available online 13 June 2000.

论文官网地址:https://doi.org/10.1016/S0169-023X(00)00009-4