Data Set A is a Pattern Matching Problem

作者:Jens Kohlmorgen, Klaus-Robert Müller

摘要

Several data sets have been proposed for benchmarking in time series prediction. A popular one is Data Set A from the Santa Fe Competition. This data set was the subject of analysis in many papers. In this note, it is shown that predicting the continuation of Data Set A is nothing else than a pattern matching problem. Looking at studies of this data set, it is remarkable that most of the very good forecasts of Data Set A used upsampled training data. We explain why upsampling is crucial for this data set. Finally, it is demonstrated that simple pattern matching performs as good as sophisticated prediction methods on Data Set A.

论文关键词:benchmarking, pattern matching, Santa Fe Competition, time series prediction

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1009684621686