Leveraging unstructured call log data for customer churn prediction

作者:

Highlights:

摘要

Customer retention is important in the financial services industry. Machine learning has been incorporated into customer data analytics to predict client churn risks. Despite its success, existing approaches primarily use only structured data, e.g., demographics and account history. Data mining with unstructured data, e.g., customer interaction, can reveal more insights, which has not been adequately leveraged. In this research, we propose a customer churn prediction model utilizing the unstructured data, which is the spoken contents in phone communication. We collected a large-scale call center dataset with two million calls from more than two hundred thousand customers and conducted extensive experiments. The results show that our model can accurately predict the client churn risks and generate meaningful insights using interpretable machine learning with personality traits and customer segments. We discuss how these insights can help managers develop retention strategies customized for different customer segments.

论文关键词:Customer retention,Churn prediction,Personality mining,Call log analysis,Interpretable machine learning

论文评审过程:Received 12 March 2020, Revised 7 September 2020, Accepted 30 October 2020, Available online 4 December 2020, Version of Record 9 December 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2020.106586