Prior class dissimilarity based linear neighborhood propagation

作者:

Highlights:

摘要

The insufficiency of labeled training data for representing the distribution of entire dataset is a major obstacle in various practical data mining applications. Semi-supervised learning algorithms, which attempt to learn from both labeled and unlabeled data, provide possibilities to solve this problem. Graph-based semi-supervised learning has recently become one of the most active research areas. In this paper, a novel graph-based semi-supervised learning approach entitled Class Dissimilarity based Linear Neighborhood Propagation (CD-LNP) is proposed, which assumes that each data point can be linearly reconstructed from its neighborhood. The neighborhood graph of the input data is constructed according to a certain kind of dissimilarity between data points, which is specially designed to integrate the class information. Our algorithm can propagate the labels from the labeled points to entire data set using these linear neighborhoods with sufficient smoothness. Experiment results demonstrate that our approach outperforms other popular graph-based semi-supervised learning methods.

论文关键词:Semi-supervised learning,Classification,Graph-based method,Linear neighborhood propagation,Prior information,Dissimilarity

论文评审过程:Received 14 July 2014, Revised 28 February 2015, Accepted 13 March 2015, Available online 23 March 2015.

论文官网地址:https://doi.org/10.1016/j.knosys.2015.03.011