SDE: A Novel Selective, Discriminative and Equalizing Feature Representation for Visual Recognition

作者:Guo-Sen Xie, Xu-Yao Zhang, Shuicheng Yan, Cheng-Lin Liu

摘要

Bag of Words (BoW) model and Convolutional Neural Network (CNN) are two milestones in visual recognition. Both BoW and CNN require a feature pooling operation for constructing the frameworks. Particularly, the max-pooling has been validated as an efficient and effective pooling method compared with other methods such as average pooling and stochastic pooling. In this paper, we first evaluate different pooling methods, and then propose a new feature pooling method termed as selective, discriminative and equalizing pooling (SDE). The SDE representation is a feature learning mechanism by jointly optimizing the pooled representations with the target of learning more selective, discriminative and equalizing features. We use bilevel optimization to solve the joint optimization problem. Experiments on seven benchmark datasets (including both single-label and multi-label ones) well validate the effectiveness of our framework. Particularly, we achieve the state-of-the-art fused results (mAP) of 93.21 and 93.97% on the PASCAL VOC2007 and VOC2012 datasets, respectively.

论文关键词:Convolutional Neural Network, Feature learning, Pooling, Bag of Words, Bilevel optimization

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11263-017-1007-9