Evaluating information retrieval system performance based on user preference

作者:Bing Zhou, Yiyu Yao

摘要

One of the challenges of modern information retrieval is to rank the most relevant documents at the top of the large system output. This calls for choosing the proper methods to evaluate the system performance. The traditional performance measures, such as precision and recall, are based on binary relevance judgment and are not appropriate for multi-grade relevance. The main objective of this paper is to propose a framework for system evaluation based on user preference of documents. It is shown that the notion of user preference is general and flexible for formally defining and interpreting multi-grade relevance. We review 12 evaluation methods and compare their similarities and differences. We find that the normalized distance performance measure is a good choice in terms of the sensitivity to document rank order and gives higher credits to systems for their ability to retrieve highly relevant documents.

论文关键词:Multi-grade relevance, Evaluation methods, User preference

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-009-0096-5