Shaping multi-agent systems with gradient reinforcement learning

作者：Olivier Buffet, Alain Dutech, François Charpillet

摘要

An original reinforcement learning (RL) methodology is proposed for the design of multi-agent systems. In the realistic setting of situated agents with local perception, the task of automatically building a coordinated system is of crucial importance. To that end, we design simple reactive agents in a decentralized way as independent learners. But to cope with the difficulties inherent to RL used in that framework, we have developed an incremental learning algorithm where agents face a sequence of progressively more complex tasks. We illustrate this general framework by computer experiments where agents have to coordinate to reach a global goal.

论文关键词：Reinforcement learning, Multi-agent systems, Partially observable Markov decision processes, Shaping, Policy-gradient

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10458-006-9010-5