Data complexity meta-features for regression problems

作者:Ana C. Lorena, Aron I. Maciel, Péricles B. C. de Miranda, Ivan G. Costa, Ricardo B. C. Prudêncio

摘要

In meta-learning, classification problems can be described by a variety of features, including complexity measures. These measures allow capturing the complexity of the frontier that separates the classes. For regression problems, on the other hand, there is a lack of such type of measures. This paper presents and analyses measures devoted to estimate the complexity of the function that should fitted to the data in regression problems. As case studies, they are employed as meta-features in three meta-learning setups: (i) the first one predicts the regression function type of some synthetic datasets; (ii) the second one is designed to tune the parameter values of support vector regressors; and (iii) the third one aims to predict the performance of various regressors for a given dataset. The results show the suitability of the new measures to describe the regression datasets and their utility in the meta-learning tasks considered. In cases (ii) and (iii) the achieved results are also similar or better than those obtained by the use of classical meta-features in meta-learning.

论文关键词:Meta-learning, Meta-features, Complexity measures

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-017-5681-1