Antisocial online behavior detection using deep learning

作者：

Highlights：

• Comprehensive benchmark on deep learning regimes for AOB detection.

• Usage of transformer-based language models.

• Interpretability module to understand the model's logic.

• Interpretability module to detect unintended bias.

摘要

Digitalization shifts human communication to online platforms, which has many benefits but also builds up a space for antisocial online behavior (AOB) such as harassment, insult and other forms of hateful textual content. Online platforms have good reasons to monitor and moderate such content. The paper examines the viability of automatic content monitoring using deep machine learning and natural language processing (NLP). More specifically, we consolidate prior work in the field of antisocial online behavior detection and compare relevant approaches to recent NLP models in an empirical study. Covering important methodological advancements in NLP including bidirectional encoding, attention, hierarchical text representations, and pre-trained transformer-based language models, and extending previous approaches by introducing a pseudo-sentence hierarchical attention network, the paper provides a comprehensive summary of the state-of-affairs in NLP-based AOB detection, clarifies the detection accuracy that is attainable with today's technology, discusses whether this degree is sufficient for deploying deep learning-based text screening systems, and approaches the interpretability topic.

论文关键词：Antisocial online behavior,Natural language processing,Text classification,Deep learning,Cyberbullying,Attention mechanism

论文评审过程：Received 17 January 2020, Revised 1 June 2020, Accepted 7 July 2020, Available online 15 July 2020, Version of Record 25 September 2020.

论文官网地址：https://doi.org/10.1016/j.dss.2020.113362