Multi-view document clustering via ensemble method

作者:Syed Fawad Hussain, Muhammad Mushtaq, Zahid Halim

摘要

Multi-view clustering has become an important extension of ensemble clustering. In multi-view clustering, we apply clustering algorithms on different views of the data to obtain different cluster labels for the same set of objects. These results are then combined in such a manner that the final clustering gives better result than individual clustering of each multi-view data. Multi view clustering can be applied at various stages of the clustering paradigm. This paper proposes a novel multi-view clustering algorithm that combines different ensemble techniques. Our approach is based on computing different similarity matrices on the individual datasets and aggregates these to form a combined similarity matrix, which is then used to obtain the final clustering. We tested our approach on several datasets and perform a comparison with other state-of-the-art algorithms. Our results show that the proposed algorithm outperforms several other methods in terms of accuracy while maintaining the overall complexity of the individual approaches.

论文关键词:Multi-view clustering, Ensemble clustering, Affinity matrix, Similarity matrices

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10844-014-0307-6