Bayesian invariant measurements of generalization

作者：Huaiyu Zhu, Richard Rohwer

摘要

The problem of evaluating different learning rules and other statistical estimators is analysed. A new general theory of statistical inference is developed by combining Bayesian decision theory with information geometry. It is coherent and invariant. For each sample a unique ideal estimate exists and is given by an average over the posterior. An optimal estimate within a model is given by a projection of the ideal estimate. The ideal estimate is a sufficient statistic of the posterior, so practical learning rules are functions of the ideal estimator. If the sole purpose of learning is to extract information from the data, the learning rule must also approximate the ideal estimator. This framework is applicable to both Bayesian and non-Bayesian methods, with arbitrary statistical models, and to supervised, unsupervised and reinforcement learning schemes.

论文关键词：Neural Network, General Theory, Nonlinear Dynamics, Invariant Measurement, Statistical Estimator

论文评审过程：

论文官网地址：https://doi.org/10.1007/BF02309013