Biased unconstrained non-negative matrix factorization for clustering

作者:

Highlights:

摘要

Clustering remains a challenging research hotspot in data mining. Non-negative matrix factorization (NMF) is an effective technique for clustering, which aims to find the product of two non-negative low-dimensional matrices that approximates the original matrix. Since the matrices must satisfy the non-negative constraints, the Karush–Kuhn–Tucker conditions need to be used to obtain the update rules for the matrices, which limits the choice of update methods. Moreover, this method has no learning rate and the updating process is completely dependent on the data itself. In addition, the two low-dimensional matrices in NMF are randomly initialized, and the clustering performance of the model is reduced. To address these problems, this paper proposes a biased unconstrained non-negative matrix factorization (BUNMF) model, which integrates the norm and adds bias. Specifically, BUNMF uses a non-linear activation function to make elements of the matrices to remain non-negative, and converts the constrained problem into an unconstrained problem. The matrices are renewed by sequentially updating the matrices’ elements using stochastic gradient descent to obtain an update rule with a learning rate. Furthermore, the BUNMF model is constructed by three different activation functions and their iteration update algorithms are given through detailed reasoning. Finally, experimental results on eight public datasets show the effectiveness of the proposed model.

论文关键词:Non-negative matrix factorization,Unconstrained regularization,Stochastic gradient descent,Clustering

论文评审过程:Received 10 August 2021, Revised 18 November 2021, Accepted 20 December 2021, Available online 29 December 2021, Version of Record 12 January 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.108040