A probabilistic model of information retrieval: development and comparative experiments: Part 2

作者:

Highlights:

摘要

The paper combines a comprehensive account of the probabilistic model of retrieval with new systematic experiments on TREC Programme material. It presents the model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions. Each step in the argument is matched by comparative retrieval tests, to provide a single coherent account of a major line of research. The experiments demonstrate, for a large test collection, that the probabilistic model is effective and robust, and that it responds appropriately, with major improvements in performance, to key features of retrieval situations.Part 1 covers the foundations and the model development for document collection and relevance data, along with the test apparatus. Part 2 covers the further development and elaboration of the model, with extensive testing, and briefly considers other environment conditions and tasks, model training, concluding with comparisons with other approaches and an overall assessment.Data and results tables for both parts are given in Part 1. Key results are summarised in Part 2.

论文关键词:Information retrieval,Retrieval theory,Probabilistic model,Term weighting,Experiments

论文评审过程:Received 12 November 1999, Accepted 7 February 2000, Available online 28 July 2000.

论文官网地址:https://doi.org/10.1016/S0306-4573(00)00016-9