Meta-features for meta-learning

作者:

Highlights:

摘要

Meta-learning is increasingly used to support the recommendation of machine learning algorithms and their configurations. These recommendations are made based on meta-data, consisting of performance evaluations of algorithms and characterizations on prior datasets. These characterizations, also called meta-features, describe properties of the data which are predictive for the performance of machine learning algorithms trained on them. Unfortunately, despite being used in many studies, meta-features are not uniformly described, organized and computed, making many empirical studies irreproducible and hard to compare. This paper aims to deal with this by systematizing and standardizing data characterization measures for classification datasets used in meta-learning. Moreover, it presents an extensive list of meta-features and characterization tools, which can be used as a guide for new practitioners. By identifying particularities and subtle issues related to the characterization measures, this survey points out possible future directions that the development of meta-features for meta-learning can assume.

论文关键词:Meta-features,Characterization measures,Meta-learning,Classification problems

论文评审过程:Received 2 August 2021, Revised 15 December 2021, Accepted 30 December 2021, Available online 7 January 2022, Version of Record 22 January 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.108101