Classification in the presence of class noise using a probabilistic Kernel Fisher method

作者:

Highlights:

摘要

In machine learning, class noise occurs frequently and deteriorates the classifier derived from the noisy data set. This paper presents two promising classifiers for this problem based on a probabilistic model proposed by Lawrence and Schölkopf (2001). The proposed algorithms are able to tolerate class noise, and extend the earlier work of Lawrence and Schölkopf in two ways. First, we present a novel incorporation of their probabilistic noise model in the Kernel Fisher discriminant; second, the distribution assumption previously made is relaxed in our work. The methods were investigated on simulated noisy data sets and a real world comparative genomic hybridization (CGH) data set. The results show that the proposed approaches substantially improve standard classifiers in noisy data sets, and achieve larger performance gain in non-Gaussian data sets and small size data sets.

论文关键词:Classification,Class noise,Labeling noise,Kernel Fisher discriminant

论文评审过程:Received 21 December 2006, Revised 1 May 2007, Accepted 4 May 2007, Available online 18 May 2007.

论文官网地址:https://doi.org/10.1016/j.patcog.2007.05.006