FCM-Based Model Selection Algorithms for Determining the Number of Clusters

作者:

Highlights:

摘要

Clustering is an important research topic that has practical applications in many fields. It has been demonstrated that fuzzy clustering, using algorithms such as the fuzzy C-means (FCM), has clear advantages over crisp and probabilistic clustering methods. Like most clustering algorithms, however, FCM and its derivatives need the number of clusters in the given data set as one of their initializing parameters. The main goal of this paper is to develop an effective fuzzy algorithm for automatically determining the number of clusters. After a brief review of the relevant literature, we present a new algorithm for determining the number of clusters in a given data set and a new validity index for measuring the “goodness” of clustering. Experimental results and comparisons are given to illustrate the performance of the new algorithm.

论文关键词:Clustering,Fuzzy C-means,Validity index,Overlapping clusters

论文评审过程:Received 16 December 2002, Revised 29 March 2004, Accepted 29 March 2004, Available online 17 June 2004.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.03.012