On Approximate Nearest Neighbors under l∞ Norm

作者:

Highlights:

摘要

The nearest neighbor search (NNS) problem is the following: Given a set of n points P={p1, …, pn} in some metric space X, preprocess P so as to efficiently answer queries which require finding a point in P closest to a query point q∈X. The approximate nearest neighbor search (c-NNS) is a relaxation of NNS which allows to return any point within c times the distance to the nearest neighbor (called c-nearest neighbor). This problem is of major and growing importance to a variety of applications. In this paper, we give an algorithm for (4 ⌈log1+ρ log 4d⌉+1)-NNS algorithm in ld∞ with O(dn1+ρ logO(1) n) storage and O(d logO(1) n) query time. Moreover, we obtain an algorithm for 3-NNS for l∞ with nlog d+1 storage. The preprocessing time is close to linear in the size of the data structure. The algorithm can be also used (after simple modifications) to output the exact nearest neighbor in time bounded by O(d logO(1) n) plus the number of (4 ⌈log1+ρ log 4d⌉+1)-nearest neighbors of the query point. Building on this result, we also obtain an approximation algorithm for a general class of product metrics. Finally, we show that for any c<3 the c-NNS problem in l∞ is provably as hard as the subset query problem (also called the partial match problem). This indicates that obtaining a sublinear query time and subexponential (in d) space for c<3 might be hard.

论文关键词:

论文评审过程:Received 30 March 1999, Revised 1 August 2000, Available online 25 May 2002.

论文官网地址:https://doi.org/10.1006/jcss.2001.1781