Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news

作者:

Highlights:

摘要

Sentiment classification of stock market news involves identifying positive and negative news articles, and is an emerging technique for making stock trend predictions which can facilitate investor decision making. In this paper, we propose the presence and intensity of emotion words as features to classify the sentiment of stock market news articles. To identify such words and their intensity, a contextual entropy model is developed to expand a set of seed words generated from a small corpus of stock market news articles with sentiment annotation. The contextual entropy model measures the similarity between two words by comparing their contextual distributions using an entropy measure, allowing for the discovery of words similar to the seed words. Experimental results show that the proposed method can discover more useful emotion words and their corresponding intensity, thus improving classification performance. Performance was further improved by the incorporation of intensity into the classification, and the proposed method outperformed the previously-proposed pointwise mutual information (PMI)-based expansion methods.

论文关键词:Sentiment classification,Contextual entropy model,Word expansion,Natural language processing,Stock market news

论文评审过程:Received 7 August 2012, Revised 30 December 2012, Accepted 2 January 2013, Available online 10 January 2013.

论文官网地址:https://doi.org/10.1016/j.knosys.2013.01.001