A unified statistical framework for crowd labeling

作者：Jafar Muhammadi, Hamid R. Rabiee, Abbas Hosseini

摘要

Recently, there has been a burst in the number of research projects on human computation via crowdsourcing. Multiple-choice (or labeling) questions could be referred to as a common type of problem which is solved by this approach. As an application, crowd labeling is applied to find true labels for large machine learning datasets. Since crowds are not necessarily experts, the labels they provide are rather noisy and erroneous. This challenge is usually resolved by collecting multiple labels for each sample and then aggregating them to estimate the true label. Although the mechanism leads to high-quality labels, it is not actually cost-effective. As a result, efforts are currently made to maximize the accuracy in estimating true labels, while fixing the number of acquired labels.

论文关键词：Crowdsourcing, Human computation, Mechanical turk , Labeling, Latent model, Inference

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10115-014-0790-7