Performance guarantees for hierarchical clustering

作者:

Highlights:

摘要

We show that for any data set in any metric space, it is possible to construct a hierarchical clustering with the guarantee that for every k, the induced k-clustering has cost at most eight times that of the optimal k-clustering. Here the cost of a clustering is taken to be the maximum radius of its clusters. Our algorithm is similar in simplicity and efficiency to popular agglomerative heuristics for hierarchical clustering, and we show that these heuristics have unbounded approximation factors.

论文关键词:Hierarchical clustering,Complete linkage,k-Center

论文评审过程:Received 11 February 2003, Revised 5 December 2003, Available online 30 November 2004.

论文官网地址:https://doi.org/10.1016/j.jcss.2004.10.006