Hierarchical traffic signal optimization using reinforcement learning and traffic prediction with long-short term memory

作者：

Highlights：

•

摘要

Multi-agent systems can be used for modelling large-scale distributed systems in real world applications. In intelligent transportation system (ITS), many interacting entities influence the performance of the system. As part of ITS, traffic signal control can be modelled using a multi-agent system. In this paper, a hierarchical multi-agent system including two levels is employed to control traffic signals. Each traffic signal is controlled by an agent that sits in the physical level, i.e., in the first level. For the other levels, the traffic network is divided into a number of regions, each controlled by a region controller agent. The first level agents utilize reinforcement learning to find the best policy, while they send their local information to the above level agents. The local information is used to train a long short-term memory (LSTM) neural network for traffic status prediction. The agents in the above level can control the traffic signals by finding the best joint policy using the predicted traffic information. Experimental results show the effectiveness of the proposed method in a traffic network including 16 intersections.

论文关键词：Traffic signal control,Hierarchical multi-agent system,Reinforcement learning,Traffic prediction,Long short-term memory (LSTM)

论文评审过程：Received 11 October 2020, Revised 29 November 2020, Accepted 4 January 2021, Available online 19 January 2021, Version of Record 29 January 2021.

论文官网地址：https://doi.org/10.1016/j.eswa.2021.114580