Optimizing microtask assignment on crowdsourcing platforms using Markov chain Monte Carlo

作者:

Highlights:

• Overcoming one-off quality estimation approach using Monte Carlo Markov Chain

• Overcoming after-work quality estimation using an iterative quality estimation approach

• Examining Monte Carlo Markov Chain functionality for iterative worker quality estimation in microtasking

摘要

Microtasking is a type of crowdsourcing, denoting the act of breaking a job into several tasks and allocating them to multiple workers to complete. The assignment of tasks to workers is a complex decision-making process, particularly when considering budget and quality constraints. While there is a growing body of knowledge on the development of task assignment algorithms, the current algorithms suffer from shortcomings including: after-worker quality estimation, meaning that workers need to complete all tasks after which point their quality can be estimated; and one-off quality estimation method which estimates workers' quality only at the start of microtasking using a set of pre-defined quality-control tasks. To address these shortcomings, we propose a Markov Chain Monte Carlo–based task assignment approach known as MCMC-TA which provides iterative estimations of workers' quality and dynamic task assignment. Specifically, we apply Gaussian mixture model (GMM) to estimate workers' quality and Markov Chain Monte Carlo to shortlist workers for task assignment. We use Google Fact Evaluation dataset to measure the performance of MCMC-TA and compare it against the state-of-the-art algorithms in terms of AUC and F-Score. The results show that the proposed MCMC-TA algorithm not only outperforms the rival algorithms, but also offers a spammer-resistant result that maximizes the learning of workers' quality with minimal budget.

论文关键词:Crowdsourcing,Task assignment,Markov chain,Crowd labeling,Quality estimation

论文评审过程:Received 25 March 2020, Revised 13 September 2020, Accepted 15 September 2020, Available online 18 September 2020, Version of Record 6 November 2020.

论文官网地址:https://doi.org/10.1016/j.dss.2020.113404