TA4L: Efficient temporal abstraction of multivariate time series

作者:

Highlights:

摘要

In this work, we introduce TA4L, a new efficient algorithm to transform multivariate time series into Lexicographical Symbolic Time Interval Sequences (LSTISs), that is, sequences ready to feed time-interval related pattern (TIRP) mining algorithms. The ultimate goal is to make explicit the embedded, ad-hoc pre-processes related to TIRP mining algorithms while offering an efficient solution for the required pre-processing. On the one hand, TA4L divides the signals into segments based on time duration (instead of the often-used practice based on the number of samples), which allows the construction of consistent time intervals. Concatenation of intervals is controlled by a maximum time gap constraint that reinforces the generated time intervals’ consistency. Moreover, different ways to parallelise the algorithm are explored that are accompanied by efficient data structures to speed up the pre-processing cost. TA4L has been experimentally evaluated with synthetic and real datasets, and the results show that TA4L requires significantly less computation time than other state-of-the-art approaches, revealing that it is an effective algorithm.

论文关键词:Multivariate time series,Temporal abstraction,Time interval sequences,Time interval related patterns

论文评审过程:Received 11 March 2021, Revised 15 February 2022, Accepted 9 March 2022, Available online 16 March 2022, Version of Record 21 March 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.108554