Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks

作者：Zhao Yang, Yuanxin Zhu, Tie Liu, Sai Zhao, Yunyan Wang, Dapeng Tao

摘要

Convolutional neural networks (CNNs) have demonstrated remarkable performance in the field of computer vision. However, they are prone to suffer from the class imbalance problem, in which the number of some classes is significantly higher or lower than that of other classes. Commonly, there are two main strategies to handle the problem, including dataset-level methods via resampling and algorithmic-level methods by modifying the existing learning frameworks. However, most of these methods need extra data resampling or elaborate algorithm design. In this work we provide an effective but extremely simple approach to tackle the imbalance problem in CNNs with cross-entropy loss. Specifically, we multiply a coefficient \( \alpha > 1 \) to output of the last layer in a CNN model. With this modification, the final loss function can dynamically adjust the contributions of examples from different classes during the imbalanced training procedure. Because of its simplicity, the proposed method can be easily applied in the off-the-shelf models with little change. To prove the effectiveness on imbalance problem, we design three experiments on classification tasks of increasing complexity. The experimental results show that our approach could improve the convergence rate in the training stage and/or increase accuracy for test.

论文关键词：Convolutional neural networks, Imbalance learning, Output layer multiplication

论文评审过程：

论文官网地址：https://doi.org/10.1007/s11063-020-10366-w