Clustering aggregation by probability accumulation

作者:

Highlights:

摘要

Since a large number of clustering algorithms exist, aggregating different clustered partitions into a single consolidated one to obtain better results has become an important problem. In Fred and Jain's evidence accumulation algorithm, they construct a co-association matrix on original partition labels, and then apply minimum spanning tree to this matrix for the combined clustering. In this paper, we will propose a novel clustering aggregation scheme, probability accumulation. In this algorithm, the construction of correlation matrices takes the cluster sizes of original clusterings into consideration. An alternate improved algorithm with additional pre- and post-processing is also proposed. Experimental results on both synthetic and real data-sets show that the proposed algorithms perform better than evidence accumulation, as well as some other methods.

论文关键词:Clustering aggregation,Evidence accumulation,Probability accumulation

论文评审过程:Received 11 May 2007, Revised 15 September 2008, Accepted 21 September 2008, Available online 8 October 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.09.013