Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
作者:Mateusz Lango, Jerzy Stefanowski
摘要
Roughly Balanced Bagging is one of the most efficient ensembles specialized for class imbalanced data. In this paper, we study its basic properties that may influence its good classification performance. We experimentally analyze them with respect to bootstrap construction, deciding on the number of component classifiers, their diversity, and ability to deal with the most difficult types of the minority examples. Then, we introduce two generalizations of this ensemble for dealing with a higher number of attributes and for adapting it to handle multiple minority classes. Experiments with synthetic and real life data confirm usefulness of both proposals.
论文关键词:Class imbalance, Roughly balanced bagging, Types of minority examples, Feature selection, Multiple imbalanced classes
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10844-017-0446-7