PHA: A fast potential-based hierarchical agglomerative clustering method

作者:

Highlights:

摘要

A novel potential-based hierarchical agglomerative (PHA) clustering method is proposed. In this method, we first construct a hypothetical potential field of all the data points, and show that this potential field is closely related to nonparametric estimation of the global probability density function of the data points. Then we propose a new similarity metric incorporating both the potential field which represents global data distribution information and the distance matrix which represents local data distribution information. Finally we develop another equivalent similarity metric based on an edge weighted tree of all the data points, which leads to a fast agglomerative clustering algorithm with time complexity O(N2). The proposed PHA method is evaluated by comparing with six other typical agglomerative clustering methods on four synthetic data sets and two real data sets. Experiments show that it runs much faster than the other methods and produces the most satisfying results in most cases.

论文关键词:Clustering,Algorithm,Pattern recognition,Potential field

论文评审过程:Received 6 April 2012, Revised 27 October 2012, Accepted 15 November 2012, Available online 23 November 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.11.017