Maximum certainty data partitioning

作者:

Highlights:

摘要

Problems in data analysis often require the unsupervised partitioning of a dataset into clusters. Many methods exist for such partitioning but most have the weakness of being model-based (most assuming hyper-ellipsoidal clusters) or computationally infeasible in anything more than a three-dimensional data space. We re-consider the notion of cluster analysis in information-theoretic terms and show that minimisation of partition entropy can be used to estimate the number and structure of probable data generators.

论文关键词:Cluster analysis,Data partitioning,Information theory

论文评审过程:Received 5 August 1998, Accepted 18 March 1999, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(99)00086-2