Cluster sequence mining from event sequence data and its application to damage correlation analysis

作者:

Highlights:

摘要

We propose a novel mining algorithm called cluster sequence mining (CSM) to extract cluster pairs with occurrence correlation from event sequence data. CSM extracts patterns with a pair of clusters that satisfies space proximity of the individual clusters and temporal proximity between events from different clusters in time intervals. CSM extends a unique co-occurring cluster mining (CCM) algorithm by considering the order of event occurrences and distribution of time intervals. The probability density of time intervals is inferred using Bayesian inference for robustness against uncertainty. To improve inference accuracy of the density function of time intervals, we utilize the idea of dynamic programming (DP) matching to obtain the correspondence between multiple event occurrences. With an experiment using synthetic data, we confirm that CSM is capable of extracting clusters with a high F-measure and low estimation error of the time interval distribution even under uncertainty. In addition, we find that DP matching can improve the inference accuracy of the density function of time intervals. Finally, CSM is applied to a real-world acoustic emission event sequence data set to evaluate damage interactions in a fuel cell.

论文关键词:Pattern mining,Occurrence correlation,Bayesian inference,Fuel cell

论文评审过程:Received 7 September 2018, Revised 7 May 2019, Accepted 8 May 2019, Available online 13 May 2019, Version of Record 12 June 2019.

论文官网地址:https://doi.org/10.1016/j.knosys.2019.05.012