A new approach to filtering and adaptive control: stability results

作者:

Highlights:

摘要

A new approach to adaptive control is proposed. The principal distinguishing feature is the direct estimation and updating of the criterion function by means of a filtering operation on a vector of transitional pseudo-return functions. The data storage and computational problems often associated with explicit probability distribution updating via Bayes' rule are thus avoided. Convergence properties are established for a simple linear criterion function filter designed for a class of adaptive control problems typified by a well-known two-armed bandit problem. Optimality properties are established for the filter in a companion paper.

论文关键词:

论文评审过程:Available online 21 March 2002.

论文官网地址:https://doi.org/10.1016/0096-3003(78)90025-5