Soft subspace clustering of interval-valued data with regularizations

作者:

Highlights:

摘要

Data analysis plays an indispensable role in understanding different phenomena. One of the vital means of handling these data is to group them into a set of clusters given a measure of similarity. Usually, clustering methods deal with objects described by single-valued variables. Nevertheless, this representation is too restrictive for representing complex data, such as lists, histograms, or even intervals. Furthermore, in some problems, many dimensions are irrelevant and can mask existing clusters. In this regard, new interval-valued data clustering methods with regularizations and adaptive distances are proposed. These approaches consider that the boundaries of the interval-valued variables have the same and different importance for the clustering process. The algorithms optimize an objective function alternating three steps for obtaining the representatives of each group, a fuzzy partition, and the relevance weights of the variables. Experiments on synthetic and real data sets corroborate the robustness and usefulness of the proposed methods.

论文关键词:Fuzzy clustering,Adaptive distances,Interval-valued data analysis,Regularization clustering

论文评审过程:Received 17 February 2021, Revised 28 April 2021, Accepted 2 June 2021, Available online 10 June 2021, Version of Record 10 June 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107191