Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data

作者:Mateusz Lango, Jerzy Stefanowski

摘要

Roughly Balanced Bagging is one of the most efficient ensembles specialized for class imbalanced data. In this paper, we study its basic properties that may influence its good classification performance. We experimentally analyze them with respect to bootstrap construction, deciding on the number of component classifiers, their diversity, and ability to deal with the most difficult types of the minority examples. Then, we introduce two generalizations of this ensemble for dealing with a higher number of attributes and for adapting it to handle multiple minority classes. Experiments with synthetic and real life data confirm usefulness of both proposals.

论文关键词:Class imbalance, Roughly balanced bagging, Types of minority examples, Feature selection, Multiple imbalanced classes

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-017-0446-7