Semi-supervised consensus clustering based on closed patterns

作者:

Highlights:

摘要

Semi-supervised consensus clustering, also called semi-supervised ensemble clustering, is a recently emerged technique that integrates prior knowledge into consensus clustering in order to improve the quality of the clustering result. In this article, we propose a novel semi-supervised consensus clustering algorithm extending the previous work on the MultiCons multiple consensus clustering approach. By using closed pattern mining technique, the proposed Semi-MultiCons algorithm manages to generate a recommended consensus solution with a relevant inferred number of clusters k based on ensemble members with different k and pairwise constraints. Compared with other semi-supervised and/or consensus clustering approaches, Semi-MultiCons does not require the number of generated clusters k as an input parameter, and is able to alleviate the widely reported negative effect related to the integration of constraints into clustering. The experimental results demonstrate that the proposed method outperforms state of the art semi-supervised consensus clustering algorithms.

论文关键词:Semi-supervised learning,Semi-supervised clustering,Semi-supervised consensus clustering,Semi-supervised ensemble clustering,Frequent closed itemsets,Closed patterns

论文评审过程:Received 1 May 2021, Revised 7 September 2021, Accepted 10 October 2021, Available online 26 October 2021, Version of Record 11 November 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107599