Automatic generic document summarization based on non-negative matrix factorization

作者:

Highlights:

摘要

In existing unsupervised methods, Latent Semantic Analysis (LSA) is used for sentence selection. However, the obtained results are less meaningful, because singular vectors are used as the bases for sentence selection from given documents, and singular vector components can have negative values. We propose a new unsupervised method using Non-negative Matrix Factorization (NMF) to select sentences for automatic generic document summarization. The proposed method uses non-negative constraints, which are more similar to the human cognition process. As a result, the method selects more meaningful sentences for generic document summarization than those selected using LSA.

论文关键词:Generic summarization,NMF,LSA,Semantic feature,Semantic variable

论文评审过程:Received 20 August 2007, Revised 11 February 2008, Accepted 13 June 2008, Available online 8 August 2008.

论文官网地址:https://doi.org/10.1016/j.ipm.2008.06.002