Clustering techniques: The user's dilemma

作者:

Highlights:

摘要

Numerous papers on clustering techniques and their applications in engineering, medical, and biological areas have appeared in pattern recognition literature during the past decade. This paper attempts to set some guidelines for a potential user of a clustering technique. We examine eight clustering programs which are representative of the various available techniques and compare their performances from several points of view. A formal comparative analysis is also performed with a portion of Munson's handprinted character data set. We believe that an understanding of the intrinsic characteristics of a clustering technique is essential to the intelligent application of the technique. Further, the output of a clustering program, along with whatever information a user may have about the data set, should be used together to form hypotheses about the structure of the data set.

论文关键词:Clustering technique,Patterns,Features,Squared error,Distance measures,Dendrogram,Similarity matrix,Hierarchical clustering,Minimum spanning tree,Admissability criteria

论文评审过程:Received 24 October 1975, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(76)90045-5