A hybrid particle swarm optimization approach for clustering and classification of datasets

作者:

Highlights:

摘要

This paper introduces a new hybrid cluster validity method based on particle swarm optimization, for successfully solving one of the most popular clustering/classifying complex datasets problems. The proposed method for the solution of the clustering/classifying problem, designated as PSORS index method, combines a particle swarm optimization (PSO) algorithm, Rough Set (RS) theory and a modified form of the Huang index function. In contrast to the Huang index method which simply assigns a constant number of clusters to each attribute, this method could cluster the values of the individual attributes within the dataset and achieves both the optimal number of clusters and the optimal classification accuracy. The validity of the proposed approach is investigated by comparing the classification results obtained for a real-world dataset with those obtained by pseudo-supervised classification BPNN, decision-tree and Huang index methods. There is good evidence to show that the proposed PSORS index method not only has a superior clustering accomplishment than the considered methods, but also achieves better classification accuracy.

论文关键词:Particle swarm optimization,Rough Set,PSORS index method,Classification,Pseudo-supervised classification method

论文评审过程:Received 22 July 2010, Revised 2 December 2010, Accepted 4 December 2010, Available online 9 December 2010.

论文官网地址:https://doi.org/10.1016/j.knosys.2010.12.003