Automatic clustering using genetic algorithms

作者:

Highlights:

摘要

In face of the clustering problem, many clustering methods usually require the designer to provide the number of clusters as input. Unfortunately, the designer has no idea, in general, about this information beforehand. In this article, we develop a genetic algorithm based clustering method called automatic genetic clustering for unknown K (AGCUK). In the AGCUK algorithm, noising selection and division–absorption mutation are designed to keep a balance between selection pressure and population diversity. In addition, the Davies–Bouldin index is employed to measure the validity of clusters. Experimental results on artificial and real-life data sets are given to illustrate the effectiveness of the AGCUK algorithm in automatically evolving the number of clusters and providing the clustering partition.

论文关键词:Clustering,Genetic algorithms,Noising method,Davies–Bouldin index,K-means algorithm

论文评审过程:Available online 1 July 2011.

论文官网地址:https://doi.org/10.1016/j.amc.2011.06.007