Unsupervised tip-mining from customer reviews

作者:

Highlights:

• A completely unsupervised algorithm, which we refer to as TipSelector, for automated tip-mining from customer reviews

• A method for automatically evaluating the informativeness of a given set of tips without humans in the loop

• A user study design that evaluates both the usefulness and novelty of the tips returned by a tip-mining algorithm

摘要

In recent years, large review-hosting platforms have extended their functionality to allow their users to submit tips: short pieces of text that deliver valuable insight on a specific aspect of the reviewed business. These tips are meant to serve as a concise source of information that complements the often overwhelming number of customer reviews. Recent work has tackled the problem of automatically generating tips by mining review text. The motivation for this effort is to obtain tips for businesses or business aspects that have been overlooked by users. Another motivating factor is the quality of the user-submitted tips, which often provide trivial or redundant information. Existing tip-mining methods are limited by a reliance on training data, which is unlikely to be available and is also very costly to create for different domains. In this work, we present TipSelector, a completely unsupervised algorithm that delivers high quality-tips without the need for annotated training data. We verify the efficacy of TipSelector via an evaluation that includes real data from the hospitality industry and comparisons with the state-of-the-art. A secondary contribution of our work is a method for automatically evaluating tip-mining algorithms without humans in the loop. As we demonstrate in our experiments, this method can be used to enable large-scale evaluations and complement the user studies that are typically used for this purpose.

论文关键词:Online reputation,Reviews,Unsupervised learning,Tips

论文评审过程:Received 3 October 2017, Revised 30 December 2017, Accepted 30 January 2018, Available online 6 February 2018, Version of Record 6 March 2018.

论文官网地址:https://doi.org/10.1016/j.dss.2018.01.011