Classifier chains for multi-label classification

作者:Jesse Read, Bernhard Pfahringer, Geoff Holmes, Eibe Frank

摘要

The widely known binary relevance method for multi-label classification, which considers each label as an independent binary problem, has often been overlooked in the literature due to the perceived inadequacy of not directly modelling label correlations. Most current methods invest considerable complexity to model interdependencies between labels. This paper shows that binary relevance-based methods have much to offer, and that high predictive performance can be obtained without impeding scalability to large datasets. We exemplify this with a novel classifier chains method that can model label correlations while maintaining acceptable computational complexity. We extend this approach further in an ensemble framework. An extensive empirical evaluation covers a broad range of multi-label datasets with a variety of evaluation metrics. The results illustrate the competitiveness of the chaining method against related and state-of-the-art methods, both in terms of predictive performance and time complexity.

论文关键词:Multi-label classification, Problem transformation, Ensemble methods, Scalable methods

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-011-5256-5