Collective behavior learning by differentiating personal preference from peer influence

作者:

Highlights:

摘要

Networked data, generated by social media, presents opportunities and challenges to the study of collective behaviors in a social networking environment. In this paper, we focus on multi-label classification on networked data, for which behaviors are represented as labels and an individual can have multiple labels. Existing relational learning methods exploit the connectivity of individuals and they have shown better performance than traditional multi-label classification methods. However, an individual’s behavior may be influenced by other factors, particularly personal preference. Hence, we propose a novel approach that integrates causal analysis into multi-label classification to learn collective behaviors. We employ propensity score matching and causal effect estimation to distinguish the contributions of peer influence and personal preference to collective behaviors and incorporate the findings into the design of the classifier. We further study behavior heterogeneity across subgroups in social networks, as people with different demographic features may behave differently due to different impacts of peer influence and personal preference. We estimate conditional average causal effects to analyze the impacts of peer influence and personal preference in different subgroups in social networks. Experiments on real-world datasets demonstrate that our proposed methods improve classification performance over existing methods.

论文关键词:Classification with networked data,Causal analysis,Collective behavior,Propensity score

论文评审过程:Received 18 June 2017, Revised 11 June 2018, Accepted 14 June 2018, Available online 31 July 2018, Version of Record 10 September 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2018.06.027