Fast multiscale clustering and manifold identification

作者:

Highlights:

摘要

We present a novel multiscale clustering algorithm inspired by algebraic multigrid techniques. Our method begins with assembling data points according to local similarities. It uses an aggregation process to obtain reliable scale-dependent global properties, which arise from the local similarities. As the aggregation process proceeds, these global properties affect the formation of coherent clusters. The global features that can be utilized are for example density, shape, intrinsic dimensionality and orientation. The last three features are a part of the manifold identification process which is performed in parallel to the clustering process. The algorithm detects clusters that are distinguished by their multiscale nature, separates between clusters with different densities, and identifies and resolves intersections between clusters. The algorithm is tested on synthetic and real data sets, its running time complexity is linear in the size of the data set.

论文关键词:Algebraic multigrid (AMG),Aggregation,Graph partitioning,Similarity-based clustering,Manifold,Data analysis,Astrophysical models

论文评审过程:Received 9 February 2006, Accepted 6 April 2006, Available online 21 June 2006.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.04.007