Active learning for sentiment analysis on data streams: Methodology and workflow implementation in the ClowdFlows platform

作者:

Highlights:

• We present a cloud based platform for data stream processing with workflows.

• The ClowdFlows platform enables processing of multiple concurrent data streams.

• We implement an active learning scenario for sentiment analysis on data streams.

• Machine learning methods are shown to be suitable for sentiment analysis.

• Active learning improves the accuracy of sentiment classification.

摘要

•We present a cloud based platform for data stream processing with workflows.•The ClowdFlows platform enables processing of multiple concurrent data streams.•We implement an active learning scenario for sentiment analysis on data streams.•Machine learning methods are shown to be suitable for sentiment analysis.•Active learning improves the accuracy of sentiment classification.

论文关键词:Active learning,Stream mining,Sentiment analysis,Stream-based active learning,Workflows,Data mining platform

论文评审过程:Received 30 October 2013, Revised 27 March 2014, Accepted 7 April 2014, Available online 30 April 2014.

论文官网地址:https://doi.org/10.1016/j.ipm.2014.04.001