A tree-structured framework for purifying “complex” clusters with structural roles of individual data

作者:

Highlights:

摘要

How can we find a natural clustering of a “complex” dataset, which may contain an unknown number of overlapping clusters of arbitrary shape and be contaminated by noise? A tree-structured framework is proposed in this paper to purify such clusters by exploring the structural role of each data. In practice, each individual object within the internal organization of the data has its own specific role—“centroid”, hub or outlier—due to distinctive associations with their respective neighbors. Adjacent centroids always interact on each other and serve as mediate nodes of one tree being members of some cluster. Hubs closed to some centroid become leaf nodes responsible for the termination of the growth of trees. Outliers that weakly touch with any centroid are often discarded from any trees as global noise. All the data can thus be labeled by a specified criterion of “centroids”-connected structural consistency (CCSC). Free of domain-specific information, our framework with CCSC could widely adapt to many clustering-related applications. Theoretical and experimental contributions both confirm that our framework is easy to interpret and implement, efficient and effective in “complex” clustering.

论文关键词:Clustering,Image segmentation,k-neighborhood,Reverse k-neighborhood,Structural objects,Symmetric neighborhood

论文评审过程:Received 8 February 2009, Revised 20 May 2010, Accepted 21 May 2010, Available online 8 June 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.05.031