Loading…

A Graph Deep Reinforcement Learning Traffic Signal Control for Multiple Intersections Considering Missing Data

Efficient traffic signal control (TSC) for multiple intersections is an important way to solve traffic congestion. With the development of deep reinforcement learning (DRL), an increasing number of DRL methods are applied to TSC. However, the prevailing methods in DRL-based TSC does not adequately a...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on vehicular technology 2024-12, Vol.73 (12), p.18307-18319
Main Authors: Xu, Dongwei, Yu, Zefeng, Liao, Xiangwang, Guo, Haifeng
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Efficient traffic signal control (TSC) for multiple intersections is an important way to solve traffic congestion. With the development of deep reinforcement learning (DRL), an increasing number of DRL methods are applied to TSC. However, the prevailing methods in DRL-based TSC does not adequately address the issue of missing data within the agent state space. Furthermore, these methods often insufficiently account for the intricate interactions and relationships among the agents involved. Therefore, a graph deep reinforcement learning traffic signal control for multiple intersections considering missing data is proposed in this paper. Firstly, we propose an agent state space estimation method based on wasserstein generative adversarial network (WGAN). This method is adept at addressing the issue of diverse types of missing data within the state space to ensure its integrity. Secondly, we propose a graph deep reinforcement learning based on two-stage attention network and GraphSage (TAGGRL) to improve TSC efficiency for multiple intersections. A dynamic interaction graph based on two-stage attention network is constructed to facilitate effective interactions among agents. GraphSAGE is constructed for aggregating multi-agent state features. Then, the decision network outputs Q-values based on the extracted features, which guide agents in executing phase-specific actions to enhance the smoothness of multiple intersections. Finally, the experimental results confirm that the agent state space estimation based on WGAN successfully solves the problem of missing data within the state space, thereby enhancing the robustness of TSC for multiple intersections. Furthermore, the TAGGRL model surpasses the baseline model in terms of TSC efficiency for multiple intersections.
ISSN:0018-9545
1939-9359
DOI:10.1109/TVT.2024.3444475