The minimum description length principle for pattern mining: a survey
作者:Esther Galbrun
摘要
Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the selection of patterns constitutes a major challenge. The Minimum Description Length (MDL) principle, a model selection method grounded in information theory, has been applied to pattern mining with the aim to obtain compact high-quality sets of patterns. After giving an outline of relevant concepts from information theory and coding, we review MDL-based methods for mining different kinds of patterns from various types of data. Finally, we open a discussion on some issues regarding these methods.
论文关键词:Data mining, Pattern mining, Frequent itemset mining, Minimum description length principle, Information theory
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10618-022-00846-z