Semisupervised learning from different information sources

作者:Tao Li, Mitsunori Ogihara

摘要

This paper studies the use of a semisupervised learning algorithm from different information sources. We first offer a theoretical explanation as to why minimising the disagreement between individual models could lead to the performance improvement. Based on the observation, this paper proposes a semisupervised learning approach that attempts to minimise this disagreement by employing a co-updating method and making use of both labeled and unlabeled data. Three experiments to test the effectiveness of the approach are presented in this paper: (i) webpage classification from both content and hyperlinks; (ii) functional classification of gene using gene expression data and phylogenetic data and (iii) machine self-maintaining from both sensory and image data. The results show the effectiveness and efficiency of our approach and suggest its application potentials.

论文关键词:Decision tree, Minimise disagreement, Semisupervised, Support vector machines, Unlabelled data

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-004-0155-8