Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

作者:Sudheendra Vijayanarasimhan, Kristen Grauman

摘要

Active learning and crowdsourcing are promising ways to efficiently build up training sets for object recognition, but thus far techniques are tested in artificially controlled settings. Typically the vision researcher has already determined the dataset’s scope, the labels “actively” obtained are in fact already known, and/or the crowd-sourced collection process is iteratively fine-tuned. We present an approach for live learning of object detectors, in which the system autonomously refines its models by actively requesting crowd-sourced annotations on images crawled from the Web. To address the technical issues such a large-scale system entails, we introduce a novel part-based detector amenable to linear classifiers, and show how to identify its most uncertain instances in sub-linear time with a hashing-based solution. We demonstrate the approach with experiments of unprecedented scale and autonomy, and show it successfully improves the state-of-the-art for the most challenging objects in the PASCAL VOC benchmark. In addition, we show our detector competes well with popular nonlinear classifiers that are much more expensive to train.

论文关键词:Object detection, Active learning, Large-scale learning, Hashing, Crowdsourcing, Image annotation

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11263-014-0721-9