A k-populations algorithm for clustering categorical data

作者:

Highlights:

摘要

In this paper, the conventional k-modes-type algorithms for clustering categorical data are extended by representing the clusters of categorical data with k-populations instead of the hard-type centroids used in the conventional algorithms. Use of a population-based centroid representation makes it possible to preserve the uncertainty inherent in data sets as long as possible before actual decisions are made. The k-populations algorithm was found to give markedly better clustering results through various experiments.

论文关键词:Clustering,Categorical data,Hierarchical algorithm,k-Modes algorithm,Fuzzy k-modes algorithm

论文评审过程:Received 13 October 2004, Accepted 1 November 2004, Available online 1 February 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.11.017