Clusterer ensemble

作者:

Highlights:

摘要

Ensemble methods that train multiple learners and then combine their predictions have been shown to be very effective in supervised learning. This paper explores ensemble methods for unsupervised learning. Here, an ensemble comprises multiple clusterers, each of which is trained by k-means algorithm with different initial points. The clusters discovered by different clusterers are aligned, i.e. similar clusters are assigned with the same label, by counting their overlapped data items. Then, four methods are developed to combine the aligned clusterers. Experiments show that clustering performance could be significantly improved by ensemble methods, where utilizing mutual information to select a subset of clusterers for weighted voting is a nice choice. Since the proposed methods work by analyzing the clustering results instead of the internal mechanisms of the component clusterers, they are applicable to diverse kinds of clustering algorithms.

论文关键词:Machine learning,Ensemble learning,Clustering,Unsupervised learning,Selective ensemble

论文评审过程:Received 7 October 2003, Accepted 10 December 2005, Available online 13 December 2005.

论文官网地址:https://doi.org/10.1016/j.knosys.2005.11.003