Cognitive-inspired domain adaptation of sentiment lexicons

作者:

Highlights:

摘要

Sentiment lexicons are essential tools for polarity classification and opinion mining. In contrast to machine learning methods that only leverage text features or raw text for sentiment analysis, methods that use sentiment lexicons embrace higher interpretability. Although a number of domain-specific sentiment lexicons are made available, it is impractical to build an ex ante lexicon that fully reflects the characteristics of the language usage in endless domains. In this article, we propose a novel approach to simultaneously train a vanilla sentiment classifier and adapt word polarities to the target domain. Specifically, we sequentially track the wrongly predicted sentences and use them as the supervision instead of addressing the gold standard as a whole to emulate the life-long cognitive process of lexicon learning. An exploration-exploitation mechanism is designed to trade off between searching for new sentiment words and updating the polarity score of one word. Experimental results on several popular datasets show that our approach significantly improves the sentiment classification performance for a variety of domains by means of improving the quality of sentiment lexicons. Case-studies also illustrate how polarity scores of the same words are discovered for different domains.

论文关键词:Sentiment lexicon,Domain adaptation,Exploration-exploitation,Word polarity,Knowledge engineering

论文评审过程:Received 26 July 2018, Revised 30 October 2018, Accepted 6 November 2018, Available online 9 January 2019, Version of Record 9 January 2019.

论文官网地址:https://doi.org/10.1016/j.ipm.2018.11.002