GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation

作者：Haoqiang Chen, Yadong Liu, Zongtan Zhou, Dewen Hu, Ming Zhang

摘要

Multi-agent reinforcement learning (MARL) is an important way to realize multi-agent cooperation. But there are still many challenges, including the scalability and the uncertainty of the environment that limit its application. In this paper, we explored to solve those problems through the graph network and the attention mechanism. Finally we succeeded in extending the existing algorithm and obtaining a new algorithm called GAMA. Specifically through the graph network, we made the environment information shared among agents. Meanwhile, the unimportant information was filtered out with the help of the attention mechanism, which helped to improve the communication efficiency. As a result, GAMA obtained the highest mean episode rewards compared to the baselines as well as excellent scalability. The reason why we choose the graph network is that understanding the relationship among agents plays a key role in solving multi-agent problems. And the graph network is very suitable for relational induction bias. Through the integration with the attention mechanism, it was shown that agents could figure out their relationship and focus on the influential environment factors in our experiment.

论文关键词：Multi-agent, Reinforcement learning, Graph network, Attention mechanism

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10489-020-01755-8