On the effect of relevance scales in crowdsourcing relevance assessments for Information Retrieval evaluation

作者:

Highlights:

• We collect relevance judgments for 4 crowdsourced scales.

• We compare the crowd judgments with two expert-labeled datasets.

• We study the effect on IR evaluation in terms of system effectiveness and topic ease.

• We release the data publicly.

摘要

•We collect relevance judgments for 4 crowdsourced scales.•We compare the crowd judgments with two expert-labeled datasets.•We study the effect on IR evaluation in terms of system effectiveness and topic ease.•We release the data publicly.

论文关键词:Relevance scales,Crowdsourcing,Information Retrieval evaluation,Relevance assessment

论文评审过程:Received 6 April 2021, Revised 21 June 2021, Accepted 5 July 2021, Available online 28 July 2021, Version of Record 28 July 2021.

论文官网地址:https://doi.org/10.1016/j.ipm.2021.102688