Elevator Group Control Using Multiple Reinforcement Learning Agents

作者：Robert H. Crites, Andrew G. Barto

摘要

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithms have appeared that approximate dynamic programming on an incremental basis. They can be trained on the basis of real or simulated experiences, focusing their computation on areas of state space that are actually visited during control, making them computationally tractable on very large problems. If each member of a team of agents employs one of these algorithms, a new collective learning algorithm emerges for the team as a whole. In this paper we demonstrate that such collective RL algorithms can be powerful heuristic methods for addressing large-scale control problems.

论文关键词：Reinforcement learning, multiple agents, teams, elevator group control, discrete event dynamic systems

论文评审过程：

论文官网地址：https://doi.org/10.1023/A:1007518724497