Modified global k-means algorithm for minimum sum-of-squares clustering problems

作者:

Highlights:

摘要

k-Means algorithm and its variations are known to be fast clustering algorithms. However, they are sensitive to the choice of starting points and inefficient for solving clustering problems in large data sets. Recently, a new version of the k-means algorithm, the global k-means algorithm has been developed. It is an incremental algorithm that dynamically adds one cluster center at a time and uses each data point as a candidate for the k-th cluster center. Results of numerical experiments show that the global k-means algorithm considerably outperforms the k-means algorithms. In this paper, a new version of the global k-means algorithm is proposed. A starting point for the k-th cluster center in this algorithm is computed by minimizing an auxiliary cluster function. Results of numerical experiments on 14 data sets demonstrate the superiority of the new algorithm, however, it requires more computational time than the global k-means algorithm.

论文关键词:Minimum sum-of-squares clustering,Nonsmooth optimization,k-Means algorithm,Global k-means algorithm

论文评审过程:Received 22 December 2005, Revised 20 August 2007, Accepted 4 April 2008, Available online 11 April 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.04.004