Automatic aspect discrimination in data clustering

作者:

Highlights:

摘要

The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets.

论文关键词:Clustering,Aspect discrimination,Attribute weighting,Cluster validation

论文评审过程:Received 12 May 2011, Revised 9 May 2012, Accepted 14 May 2012, Available online 23 May 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.05.011