Density decay graph-based density peak clustering

作者:

Highlights:

摘要

In 2014, Rodriguez and Laio proposed a famous clustering algorithm based on a fast search and find density peaks dubbed as DPC (Rodriguez and Laio, 2014). DPC has been widely used in many fields because of its simplicity and effectiveness. However, DPC has two obvious drawbacks. First, initial cluster centers must be manually selected from the decision graph, which is extremely difficult and even impossible in some cases. Second, the result of DPC is affected by the so called “chain reaction” problem. To overcome these two drawbacks, we propose a density decay graph-based density peak clustering (DGDPC) algorithm. The key to DGDPC is the concept of density decay graph, which is inspired by the decay phenomenon (a common phenomenon in nature). In our algorithm, we form initial clusters according to density decay graphs and then merge clusters based on a simple method. As a result, there is no need to manually select cluster centers and the chain reaction problem is alleviated. Although DGDPC introduces an additional parameter m, m is robust and easy to determine in advance. Experiments on 10 synthetic datasets and 10 real-world datasets show that our algorithm outperforms DPC, DGB (Wu and Wilamowski, 2016), K-means (MacQueen et al., 1967), DBSCAN (Ester et al., 1996), and Single-link (Murtagh and Contreras, 2012) in most cases.

论文关键词:Clustering algorithm,Density peaks,Density Decay Graph,Decay effect

论文评审过程:Received 20 December 2020, Revised 21 March 2021, Accepted 20 April 2021, Available online 22 April 2021, Version of Record 27 April 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107075