Orthogonal nonnegative matrix tri-factorization for co-clustering: Multiplicative updates on Stiefel manifolds

作者:

Highlights:

摘要

Matrix factorization-based methods become popular in dyadic data analysis, where a fundamental problem, for example, is to perform document clustering or co-clustering words and documents given a term-document matrix. Nonnegative matrix tri-factorization (NMTF) emerges as a promising tool for co-clustering, seeking a 3-factor decomposition X≈USV⊤ with all factor matrices restricted to be nonnegative, i.e., U⩾0,S⩾0,V⩾0. In this paper we develop multiplicative updates for orthogonal NMTF where X≈USV⊤ is pursued with orthogonality constraints, U⊤U=I, and V⊤V=I, exploiting true gradients on Stiefel manifolds. Experiments on various document data sets demonstrate that our method works well for document clustering and is useful in revealing polysemous words via co-clustering words and documents.

论文关键词:Co-clustering,Document clustering,Multiplicative updates,Nonnegative matrix factorization,Stiefel manifolds

论文评审过程:Received 1 October 2008, Revised 21 December 2009, Accepted 26 December 2009, Available online 27 January 2010.

论文官网地址:https://doi.org/10.1016/j.ipm.2009.12.007