Improving classification performance using unlabeled data: Naive Bayesian case

作者:

Highlights:

摘要

In many applications, an enormous amount of unlabeled data is available with little cost. Therefore, it is natural to ask whether we can take advantage of these unlabeled data in classification learning. In this paper, we analyzed the role of unlabeled data in the context of naive Bayesian learning. Experimental results show that including unlabeled data as part of training data can significantly improve the performance of classification accuracy.

论文关键词:Machine learning,Semi-supervised learning,Naive Bayesian,Classification

论文评审过程:Received 11 July 2005, Accepted 3 May 2006, Available online 8 September 2006.

论文官网地址:https://doi.org/10.1016/j.knosys.2006.05.014