Experiencing SAX: a novel symbolic representation of time series

作者:Jessica Lin, Eamonn Keogh, Li Wei, Stefano Lonardi

摘要

Many high level representations of time series have been proposed for data mining, including Fourier transforms, wavelets, eigenwaves, piecewise polynomial models, etc. Many researchers have also considered symbolic representations of time series, noting that such representations would potentiality allow researchers to avail of the wealth of data structures and algorithms from the text processing and bioinformatics communities. While many symbolic representations of time series have been introduced over the past decades, they all suffer from two fatal flaws. First, the dimensionality of the symbolic representation is the same as the original data, and virtually all data mining algorithms scale poorly with dimensionality. Second, although distance measures can be defined on the symbolic approaches, these distance measures have little correlation with distance measures defined on the original time series.

论文关键词:Time series, Data mining, Symbolic representation, Discretize

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10618-007-0064-z