Parallel learner: A practical deep reinforcement learning framework for multi-scenario games

作者:

Highlights:

摘要

Traditional reinforcement learning methods are only applicable to single-scenario tasks. When it comes to multi-scenario, the single-scenario agents fail to perform well. That is, the traditional reinforcement learning methods own the poor generalization when facing different tasks simultaneously. In this work, we propose a practical deep reinforcement learning framework that can perform on multiple 3D scenarios concurrently. We adopt the Actor–Learner framework to realize the parallelization of multiple scenarios and resolve the policy lag problem by generalizing Retrace() to a new value function. We prove its convergence theoretically. Besides, we design an auxiliary recognition task and an auxiliary control task inspired by the hard shared representation in multi-task learning to improve the performance of our multi-scenario agent. Experimental results show that our method outperforms state-of-the-art algorithms on DMLab-30, achieving more advantages on multi-scenario games. We verify the effectiveness of each part of our framework by the ablation experiments. We also find our parallel learner transferable by testing on the untrained scenarios.

论文关键词:Deep reinforcement learning,Incomplete information,Multi-scenario,Multi-task

论文评审过程:Received 9 July 2021, Revised 10 November 2021, Accepted 12 November 2021, Available online 27 November 2021, Version of Record 29 December 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107753