Finding centroid clusterings with entropy-based criteria

作者:Tianming Hu, Sam Yuan Sung

摘要

We investigate the following problem: Given a set of candidate clusterings for a common set of objects, find a centroid clustering that is most compatible to the input set. First, we propose a series of entropy-based distance functions for comparing various clusterings. Such functions enable us to directly select the local centroid from the candidate set. Second, we present two combining methods for the global centroid. The selected/combined centroid clustering is likely to be a good choice, i.e., top or middle ranked in terms of closeness to the true clustering. Finally, we evaluate their effectiveness on both artificial and real data sets.

论文关键词:Cluster analysis, Centroid clustering, Consensus clustering, Conditional entropy, Metric distance function

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-006-0017-7