Real-time and automatic two-class clustering by analytical formulas

作者：

Highlights：

•

摘要

Several feature-preserving two-class clustering methods are investigated in this paper. By preserving certain features of the input data, some formulas useful in calculating the two class representatives and population percentages are derived. The derived formulas are expressed in general forms suitable for any dimensionality higher than two. The complexities of the investigated methods are all of order N if the data size is N and hence are much faster than any other clustering method which uses N x N dissimilarity matrix. Additionally, all investigated methods use no initial guesses. Experimental results are included to make a comparison among the four investigated methods so that only two methods are recommended. Further comparisons with the k-means method and hierarchical clustering methods also are included. The proposed feature-preserving approach was found to be fast, automatic and suitable for any field requiring fast high-dimensional two-class clustering.

论文关键词：Two-class clustering,High-dimensional space,Feature-preserving,Analytical fast clustering,General formk-means,Hierarchical agglomerative clustering

论文评审过程：Received 5 October 1995, Revised 28 February 1996, Available online 7 June 2001.

论文官网地址：https://doi.org/10.1016/0031-3203(96)00033-7