Inverse random under sampling for class imbalance problem and its application to multi-label classification

作者:

Highlights:

摘要

In this paper, a novel inverse random under sampling (IRUS) method is proposed for the class imbalance problem. The main idea is to severely under sample the majority class thus creating a large number of distinct training sets. For each training set we then find a decision boundary which separates the minority class from the majority class. By combining the multiple designs through fusion, we construct a composite boundary between the majority class and the minority class. The proposed methodology is applied on 22 UCI data sets and experimental results indicate a significant increase in performance when compared with many existing class-imbalance learning methods. We also present promising results for multi-label classification, a challenging research problem in many modern applications such as music, text and image categorization.

论文关键词:Class imbalance problem,Multi-label classification,Inverse random under sampling

论文评审过程:Received 21 October 2010, Revised 8 March 2012, Accepted 21 March 2012, Available online 13 April 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.03.014