Zero-shot learning by mutual information estimation and maximization

作者:

Highlights:

摘要

The key of zero-shot learning is to use the visual-semantic embedding to transfer the knowledge from seen classes to unseen classes. In this paper, we propose to build the visual-semantic embedding by maximizing the mutual information between visual features and corresponding attributes. Then, the mutual information between visual and semantic features can be utilized to guide the knowledge transfer from seen domain to unseen domain. Since we are primarily interested in maximizing mutual information, we introduce the noise-contrastive estimation to calculate lower-bound value of mutual information. Through the noise-contrastive estimation, we reformulate zero-shot learning as a binary classification problem, i.e., classifying the matching visual-semantic pairs (positive samples) and mismatching visual-semantic pairs (negative/noise samples). Experiments conducted on five datasets demonstrate that the proposed mutual information estimators outperforms current state-of-the-art methods both in conventional and generalized zero-shot learning settings.

论文关键词:Zero-shot learning,Mutual information,Noise-contrastive estimation,Visual-semantic embedding

论文评审过程:Received 1 July 2019, Revised 6 January 2020, Accepted 7 January 2020, Available online 14 January 2020, Version of Record 18 May 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2020.105490