Parametric and non-parametric unsupervised cluster analysis

作者:

Highlights:

摘要

Much work has been published on methods for assessing the probable number of clusters or structures within unknown data sets. This paper aims to look in more detail at two methods, a broad parametric method, based around the assumption of Gaussian clusters and the other a non-parametric method which utilises methods of scale-space filtering to extract robust structures within a data set. It is shown that, whilst both methods are capable of determining cluster validity for data sets in which clusters tend towards a multivariate Gaussian distribution, the parametric method inevitably fails for clusters which have a non-Gaussian structure whilst the scale-space method is more robust.

论文关键词:Cluster analysis,Maximum likelihood methods,Scale-space filtering,Probability density estimation

论文评审过程:Received 12 October 1995, Revised 10 May 1996, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(96)00079-9