Cooperative traffic signal control using Multi-step return and Off-policy Asynchronous Advantage Actor-Critic Graph algorithm
作者:
Highlights:
• MOA3CG algorithm is proposed for multi-intersection cooperative traffic signal control.
• The MOA3CG algorithm is based on multi-step return, off-policy and a graph.
• An AMTSPC for selection of actions is proposed and adopted in MOA3CG.
• Experiments show that MOA3CG algorithm outperforms the state-of-the-art algorithms.
摘要
•MOA3CG algorithm is proposed for multi-intersection cooperative traffic signal control.•The MOA3CG algorithm is based on multi-step return, off-policy and a graph.•An AMTSPC for selection of actions is proposed and adopted in MOA3CG.•Experiments show that MOA3CG algorithm outperforms the state-of-the-art algorithms.
论文关键词:Cooperative traffic signal control,Coordination graph algorithm,Multiagent deep reinforcement learning,Transfer planning,Asynchronous Advantage Actor-Critic (A3C) algorithm
论文评审过程:Received 20 March 2019, Revised 12 July 2019, Accepted 17 July 2019, Available online 22 July 2019, Version of Record 27 September 2019.
论文官网地址:https://doi.org/10.1016/j.knosys.2019.07.026