Self-improving reactive agents based on reinforcement learning, planning and teaching

作者:Long-Ji Lin

摘要

To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning.

论文关键词:Reinforcement learning, planning, teaching, connectionist networks

论文评审过程:

论文官网地址:https://doi.org/10.1007/BF00992699