Pattern-based time-series subsequence clustering using radial distribution functions

作者:Anne M. Denton, Christopher A. Besemann, Dietmar H. Dorr

摘要

Clustering of time series subsequence data commonly produces results that are unspecific to the data set. This paper introduces a clustering algorithm, that creates clusters exclusively from those subsequences that occur more frequently in a data set than would be expected by random chance. As such, it partially adopts a pattern mining perspective into clustering. When subsequences are being labeled based on such clusters, they may remain without label. In fact, if the clustering was done on an unrelated time series it is expected that the subsequences should not receive a label. We show that pattern-based clusters are indeed specific to the data set for 7 out of 10 real-world sets we tested, and for window-lengths up to 128 time points. While kernel-density-based clustering can be used to find clusters with similar properties for window sizes of 8–16 time points, its performance degrades fast for increasing window sizes.

论文关键词:Density-based clustering, Time series subsequence clustering, Clustering noisy data, Noise elimination, Time series labeling

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-008-0125-7