Deep Reinforcement Learning for Traffic Signal Control

Wei, Hua

Deep Reinforcement Learning for Traffic Signal Control

Open Access

Author:: Wei, Hua
Graduate Program:: Information Sciences and Technology
Degree:: Doctor of Philosophy
Document Type:: Dissertation
Date of Defense:: October 12, 2020
Committee Members:: Zhenhui Li, Dissertation Advisor/Co-Advisor
Zhenhui Li, Committee Chair/Co-Chair
C Lee Giles, Committee Member
Xiang Zhang, Committee Member
Vikash Varun Gayah, Outside Member
Mary Beth Rosson, Program Head/Chair
Keywords:: reinforcement learning
traffic signal control
urban computing
Abstract:: Traffic congestion is a growing problem that continues to plague urban areas with negative outcomes to both the traveling public and society as a whole. Signalized intersections are one of the most prevalent bottleneck types in urban environments and thus traffic signal control tends to play a large role in urban traffic management. Nowadays the widely-used traffic signal control systems (e.g., SCATS and SCOOT) are still based on manually designed traffic signal plans. Recently, there are emerging research studies using reinforcement learning (RL) to tackle the traffic signal control problem. In this dissertation, we propose to consider reinforcement learning to intelligently optimize the signal timing plans in real-time to reduce traffic congestion. Although some efforts using reinforcement learning (RL) techniques are proposed to adjust traffic signals dynamically, they only use ad-hoc designs when formulating the traffic signal control problem. They lack a principled approach to formulate the problem under the framework of RL. Secondly, since RL directly learns from the data via a trial-and-error search, it requires a decent number of interactions with the environment before the algorithms converge. In real-world problems, every interaction means real cost (e.g., traffic congestion, traffic accidents). Hence, a more data-efficient method is necessary. Thirdly, discrepancies between simulation and reality confine the application of RL in the real world, despite its massive success in domains like Games. Most RL methods mainly conduct experiments in the simulator since the simulator can generate data in a cheaper and faster way than real experimentation. Hence, how to address the performance gap of RL methods between simulation and the real-world is required for applying RL in the real world. This dissertation presents how to utilize mobility data and RL-based methods for traffic signal control. I have investigated the key challenges for RL-based traffic signal control methods, including how to formulate the objective function and improve learning efficiency for city-wide traffic signal control. Besides, I have also managed to mitigate the performance gap of RL methods between simulation and the real-world. We have achieved significant improvement over the state-of-the-art or currently employed methods, which will provide us with promising solutions to traffic signal control problems and implications for smart city applications.

Tools