Enriching semantic knowledge bases for opinion mining in big data applications

作者：

Highlights：

•

摘要

This paper presents a novel method for contextualizing and enriching large semantic knowledge bases for opinion mining with a focus on Web intelligence platforms and other high-throughput big data applications. The method is not only applicable to traditional sentiment lexicons, but also to more comprehensive, multi-dimensional affective resources such as SenticNet. It comprises the following steps: (i) identify ambiguous sentiment terms, (ii) provide context information extracted from a domain-specific training corpus, and (iii) ground this contextual information to structured background knowledge sources such as ConceptNet and WordNet. A quantitative evaluation shows a significant improvement when using an enriched version of SenticNet for polarity classification. Crowdsourced gold standard data in conjunction with a qualitative evaluation sheds light on the strengths and weaknesses of the concept grounding, and on the quality of the enrichment process.

论文关键词：Web intelligence,Social Web,Big data,Knowledge extraction,Opinion mining,Sentiment analysis,Disambiguation,Contextualization,Common-sense knowledge,Concept grounding

论文评审过程：Available online 10 May 2014.

论文官网地址：https://doi.org/10.1016/j.knosys.2014.04.039