Finding the most unusual time series subsequence: algorithms and applications

作者:Eamonn Keogh, Jessica Lin, Sang-Hee Lee, Helga Van Herle

摘要

In this work we introduce the new problem of finding time seriesdiscords. Time series discords are subsequences of longer time series that are maximally different to all the rest of the time series subsequences. They thus capture the sense of the most unusual subsequence within a time series. While discords have many uses for data mining, they are particularly attractive as anomaly detectors because they only require one intuitive parameter (the length of the subsequence) unlike most anomaly detection algorithms that typically require many parameters. While the brute force algorithm to discover time series discords is quadratic in the length of the time series, we show a simple algorithm that is three to four orders of magnitude faster than brute force, while guaranteed to produce identical results. We evaluate our work with a comprehensive set of experiments on diverse data sources including electrocardiograms, space telemetry, respiration physiology, anthropological and video datasets.

论文关键词:Time series data mining, Anomaly detection, Clustering

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-006-0034-6