Scalable greedy algorithms for transfer learning

作者:

Highlights:

摘要

In this paper we consider the binary transfer learning problem, focusing on how to select and combine sources from a large pool to yield a good performance on a target task. Constraining our scenario to real world, we do not assume the direct access to the source data, but rather we employ the source hypotheses trained from them. We propose an efficient algorithm that selects relevant source hypotheses and feature dimensions simultaneously, building on the literature on the best subset selection problem. Our algorithm achieves state-of-the-art results on three computer vision datasets, substantially outperforming both transfer learning and popular feature selection baselines in a small-sample setting. We also present a randomized variant that achieves the same results with the computational cost independent from the number of source hypotheses and feature dimensions. Also, we theoretically prove that, under reasonable assumptions on the source hypotheses, our algorithm can learn effectively from few examples.

论文关键词:

论文评审过程:Received 8 October 2015, Revised 14 June 2016, Accepted 7 September 2016, Available online 8 September 2016, Version of Record 14 February 2017.

论文官网地址:https://doi.org/10.1016/j.cviu.2016.09.003