Combining review-based collaborative filtering and matrix factorization: A solution to rating's sparsity problem

作者:

Highlights:

• Collaborative filtering suffers from the sparsity issue.

• The proposed method helps solve the overfitting problem of matrix factorization when rating data is extremely sparse.

• The Review-based Matrix Factorization framework gives a clear path for using review text in collaborative filtering.

• The proposed solution solves the sparsity problem from both model and data perspectives.

摘要

An important factor affecting the performance of collaborative filtering for recommendation systems is the sparsity of the rating matrix caused by insufficient rating data. Improving the recommendation model and introducing side information are two main research approaches to address the problem. We combine these two approaches and propose the Review-Based Matrix Factorization method in this paper. The method consists of two phases. The first phase is review-based collaborative filtering, where an item-topic rating matrix is constructed by the feature-level opinion mining of online review text. This rating matrix is used to derive item similarities, which can be used to infer unknown users' ratings of the items. The second phase consists of rating imputation, where we first fill some of the empty elements of the user-item rating matrix, then conduct matrix factorization to learn the latent user and item factors to generate recommendations. Experiments on two actual datasets show that our method improves the accuracy of recommendation compared with similar algorithms.

论文关键词:Collaborative filtering,Sparsity problem,Online reviews,Matrix factorization,Rating imputation

论文评审过程:Received 7 September 2021, Revised 30 December 2021, Accepted 28 January 2022, Available online 3 February 2022, Version of Record 20 March 2022.

论文官网地址:https://doi.org/10.1016/j.dss.2022.113748