Evaluating epistemic uncertainty under incomplete assessments

作者:

Highlights:

摘要

The thesis of this study is to propose an extended methodology for laboratory based Information Retrieval evaluation under incomplete relevance assessments. This new methodology aims to identify potential uncertainty during system comparison that may result from incompleteness. The adoption of this methodology is advantageous, because the detection of epistemic uncertainty – the amount of knowledge (or ignorance) we have about the estimate of a system’s performance – during the evaluation process can guide and direct researchers when evaluating new systems over existing and future test collections. Across a series of experiments we demonstrate how this methodology can lead towards a finer grained analysis of systems. In particular, we show through experimentation how the current practice in Information Retrieval evaluation of using a measurement depth larger than the pooling depth increases uncertainty during system comparison.

论文关键词:Information Retrieval evaluation,Incompleteness,System comparison,Test collections

论文评审过程:Received 29 November 2006, Revised 2 April 2007, Accepted 3 April 2007, Available online 25 May 2007.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.04.002