Agnostic active learning

作者:

Highlights:

摘要

We state and analyze the first active learning algorithm that finds an ϵ-optimal hypothesis in any hypothesis class, when the underlying distribution has arbitrary forms of noise. The algorithm, A2 (for Agnostic Active), relies only upon the assumption that it has access to a stream of unlabeled examples drawn i.i.d. from a fixed distribution. We show that A2 achieves an exponential improvement (i.e., requires only O(ln1ϵ) samples to find an ϵ-optimal classifier) over the usual sample complexity of supervised learning, for several settings considered before in the realizable case. These include learning threshold classifiers and learning homogeneous linear separators with respect to an input distribution which is uniform over the unit sphere.

论文关键词:Active learning,Agnostic setting,Sample complexity,Linear separators

论文评审过程:Received 1 June 2007, Revised 18 June 2008, Available online 18 July 2008.

论文官网地址:https://doi.org/10.1016/j.jcss.2008.07.003