Using the One-vs-One decomposition to improve the performance of class noise filters via an aggregation strategy in multi-class classification problems

作者:

Highlights:

摘要

Noise filters are preprocessing techniques designed to improve data quality in classification tasks by detecting and eliminating examples that contain errors or noise. However, filtering can also remove correct examples and examples containing valuable information, which could be useful for learning. This fact usually implies a margin of improvement on the noise detection accuracy for almost any noise filter. This paper proposes a scheme to improve the performance of noise filters in multi-class classification problems, based on decomposing the dataset into multiple binary subproblems. Decomposition strategies have proven to be successful in improving classification performance in multi-class problems by generating simpler binary subproblems. Similarly, we adapt the principles of the One-vs-One decomposition strategy to noise filtering, making the noise identification process simpler. In order to integrate the filtering results achieved in the binary subproblems, our proposal uses a soft voting approach considering a reliability level based on the aggregation of the noise degree prediction calculated for each binary classifier. The experimental results show that the One-vs-One decomposition strategy usually increases the performance of the noise filters studied, which can detect more accurately the noisy examples.

论文关键词:Noisy data,Class noise,Noise filters,Decomposition strategies,Classification

论文评审过程:Received 12 March 2015, Revised 25 August 2015, Accepted 22 September 2015, Available online 2 October 2015, Version of Record 8 November 2015.

论文官网地址:https://doi.org/10.1016/j.knosys.2015.09.023