A stability based validity method for fuzzy clustering

作者:

Highlights:

摘要

An important goal in cluster analysis is the internal validation of results using an objective criterion. Of particular relevance in this respect is the estimation of the optimum number of clusters capturing the intrinsic structure of your data. This paper proposes a method to determine this optimum number based on the evaluation of fuzzy partition stability under bootstrap resampling. The method is first characterized on synthetic data with respect to hyper-parameters, like the fuzzifier, and spatial clustering parameters, such as feature space dimensionality, clusters degree of overlap, and number of clusters. The method is then validated on experimental datasets. Furthermore, the performance of the proposed method is compared to that obtained using a number of traditional fuzzy validity rules based on the cluster compactness-to-separation criteria. The proposed method provides accurate and reliable results, and offers better generalization capabilities than the classical approaches.

论文关键词:Fuzzy c-means,Cluster validity,Number of clusters,Cluster stability

论文评审过程:Received 8 October 2008, Revised 25 September 2009, Accepted 2 October 2009, Available online 12 October 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.10.001