A non-parametric semi-supervised discretization method

作者:Alexis Bondu, Marc Boullé, Vincent Lemaire

摘要

Semi-supervised classification methods aim to exploit labeled and unlabeled examples to train a predictive model. Most of these approaches make assumptions on the distribution of classes. This article first proposes a new semi-supervised discretization method, which adopts very low informative prior on data. This method discretizes the numerical domain of a continuous input variable, while keeping the information relative to the prediction of classes. Then, an in-depth comparison of this semi-supervised method with the original supervised MODL approach is presented. We demonstrate that the semi-supervised approach is asymptotically equivalent to the supervised approach, improved with a post-optimization of the intervals bounds location.

论文关键词:Bayesian, Semi-supervised, Discretization

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-009-0230-2