Cooperative traffic signal control using Multi-step return and Off-policy Asynchronous Advantage Actor-Critic Graph algorithm

作者：

Highlights：

• MOA3CG algorithm is proposed for multi-intersection cooperative traffic signal control.

• The MOA3CG algorithm is based on multi-step return, off-policy and a graph.

• An AMTSPC for selection of actions is proposed and adopted in MOA3CG.

• Experiments show that MOA3CG algorithm outperforms the state-of-the-art algorithms.

摘要

•MOA3CG algorithm is proposed for multi-intersection cooperative traffic signal control.•The MOA3CG algorithm is based on multi-step return, off-policy and a graph.•An AMTSPC for selection of actions is proposed and adopted in MOA3CG.•Experiments show that MOA3CG algorithm outperforms the state-of-the-art algorithms.

论文关键词：Cooperative traffic signal control,Coordination graph algorithm,Multiagent deep reinforcement learning,Transfer planning,Asynchronous Advantage Actor-Critic (A3C) algorithm

论文评审过程：Received 20 March 2019, Revised 12 July 2019, Accepted 17 July 2019, Available online 22 July 2019, Version of Record 27 September 2019.

论文官网地址：https://doi.org/10.1016/j.knosys.2019.07.026