New measurements for search engine evaluation proposed and tested

作者:

Highlights:

摘要

A set of measurements is proposed for evaluating Web search engine performance. Some measurements are adapted from the concepts of recall and precision, which are commonly used in evaluating traditional information retrieval systems. Others are newly developed to evaluate search engine stability, an issue unique to Web information retrieval systems. An experiment was conducted to test these new measurements by applying them to a performance comparison of three commercial search engines: Google, AltaVista, and Teoma. Twenty-four subjects ranked four sets of Web pages and their rankings were used as benchmarks against which to compare search engine performance. Results show that the proposed measurements are able to distinguish search engine performance very well.

论文关键词:Web search engines,Evaluation criteria,Information retrieval experiment

论文评审过程:Received 22 January 2003, Accepted 12 May 2003, Available online 19 June 2003.

论文官网地址:https://doi.org/10.1016/S0306-4573(03)00043-8