Process data properties matter: Introducing gated convolutional neural networks (GCNN) and key-value-predict attention networks (KVP) for next event prediction with deep learning

作者:

Highlights:

• Gated convolutional neural networks (GCNN) and key-value-predict attention networks (KVP) enable next event prediction

• Networks are evaluated in a comprehensive evaluation study based on 11 real-life benchmark datasets

• GCNN and KVP outperform state-of-the-art approaches in 34 out of 44 dataset-metric-combinations

• Process data properties such as sparsity, variation, and repetitiveness determine the suitability of a network

摘要

Predicting next events in predictive process monitoring enables companies to manage and control processes at an early stage and reduce their action distance. In recent years, approaches have steadily moved from classical statistical methods towards the application of deep neural network architectures, which outperform the former and enable analysis without explicit knowledge of the underlying process model. While the focus of prior research was on the long short-term memory network architecture, more deep learning architectures offer promising extensions that have proven useful for other applications of sequential data. In our work, we introduce a gated convolutional neural network and a key-value-predict attention network to the task of next event prediction. In a comprehensive evaluation study on 11 real-life benchmark datasets, we show that these two novel architectures surpass prior work in 34 out of 44 metric-dataset combinations. For our evaluation, we consider the effects of process data properties, such as sparsity, variation, and repetitiveness, and discuss their impact on the prediction quality of the different deep learning architectures. Similarly, we evaluate their classification properties in terms of generalization and handling class imbalance. Our results provide guidance for researchers and practitioners alike on how to select, validate, and comprehensively benchmark (novel) predictive process monitoring models. In particular, we highlight the importance of sufficiently diverse process data properties in event logs and the comprehensive reporting of multiple performance indicators to achieve meaningful results.

论文关键词:Process mining,Predictive process monitoring,Machine learning,Deep learning,Gated convolutional neural network,Key-value-predict attention network

论文评审过程:Received 7 July 2020, Revised 8 January 2021, Accepted 8 January 2021, Available online 11 January 2021, Version of Record 21 February 2021.

论文官网地址:https://doi.org/10.1016/j.dss.2021.113494