Assessing bias in search engines

作者:

Highlights:

摘要

This paper deals with the measurement of bias in search engines on the World Wide Web. Bias is taken to mean the balance and representativeness of items in a collection retrieved from a database for a set of queries. This calls for assessing the degree to which the distribution of items in a collection deviates from the ideal. Ascertaining this ideal poses problems similar to those associated with determining relevance in the measurement of recall and precision. Instead of enlisting subject experts or users to determine such an ideal, a family of comparable search engines is used to approximate it for a set of queries. The distribution is obtained by computing the frequencies of occurrence of the uniform resource locators (URLs) in the collection retrieved by several search engines for the given queries. Bias is assessed by measuring the deviation from the ideal of the distribution produced by a particular search engine.

论文关键词:Bias,Search engines,Retrieval performance,System measurement

论文评审过程:Received 18 August 2000, Accepted 25 January 2001, Available online 29 August 2001.

论文官网地址:https://doi.org/10.1016/S0306-4573(01)00020-6