Discriminative deep attributes for generalized zero-shot learning

作者:

Highlights:

摘要

We indirectly predict a class by deriving user-defined (i.e., existing) attributes (UA) from an image in generalized zero-shot learning (GZSL). High-quality attributes are essential for GZSL, but the existing UAs are sometimes not discriminative. We observe that the hidden units at each layer in a convolutional neural network (CNN) contain highly discriminative semantic information across a range of objects, parts, scenes, textures, materials, and color. The semantic information in CNN features is similar to the attributes that can distinguish each class. Motivated by this observation, we employ CNN features like novel class representative semantic data, i.e., deep attribute (DA). Precisely, we propose three objective functions (e.g., compatible, discriminative, and intra-independent) to inject the fundamental properties into the generated DA. We substantially outperform the state-of-the-art approaches on four challenging GZSL datasets, including CUB, FLO, AWA1, and SUN. Furthermore, the existing UA and our proposed DA are complementary and can be combined to enhance performance further.

论文关键词:Generalized zero-shot learning,Deep attribute,Discriminative latent attribute

论文评审过程:Received 30 May 2020, Revised 26 August 2021, Accepted 15 November 2021, Available online 19 November 2021, Version of Record 6 December 2021.

论文官网地址:https://doi.org/10.1016/j.patcog.2021.108435