ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms

作者:

Highlights:

摘要

This paper describes ANN-Benchmarks, a tool for evaluating the performance of in-memory approximate nearest neighbor algorithms. It provides a standard interface for measuring the performance and quality achieved by nearest neighbor algorithms on different standard data sets. It supports several different ways of integrating k-NN algorithms, and its configuration system automatically tests a range of parameter settings for each algorithm. Algorithms are compared with respect to many different (approximate) quality measures, and adding more is easy and fast; the included plotting front-ends can visualize these as images, LaTeX plots, and websites with interactive plots. ANN-Benchmarks aims to provide a constantly updated overview of the current state of the art of k-NN algorithms. In the short term, this overview allows users to choose the correct k-NN algorithm and parameters for their similarity search task; in the longer term, algorithm designers will be able to use this overview to test and refine automatic parameter tuning. The paper gives an overview of the system, evaluates the results of the benchmark, and points out directions for future work. Interestingly, very different approaches to k-NN search yield comparable quality-performance trade-offs. The system is available at http://ann-benchmarks.com.

论文关键词:97P30,Benchmarking,Nearest neighbor search,Evaluation

论文评审过程:Received 9 July 2018, Revised 27 November 2018, Accepted 13 February 2019, Available online 21 February 2019, Version of Record 18 October 2019.

论文官网地址:https://doi.org/10.1016/j.is.2019.02.006