Loading…

Urban network geofencing with dynamic speed limit policy via deep reinforcement learning

•Urban network geofencing (UNG) is combined with dynamic speed limit policies to improve traffic efficiency and sustainability.•TD3 algorithm is employed to train the UNG controller with actor-critic framework for the first time.•Simulation-based case study on a real-world local network shows that T...

Full description

Saved in:
Bibliographic Details
Published in:Transportation research. Part A, Policy and practice Policy and practice, 2024-05, Vol.183, p.104067, Article 104067
Main Authors: Lu, Wenqi, Yi, Ziwei, Gidofalvi, Gyözö, Simoni, Michele D., Rui, Yikang, Ran, Bin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Urban network geofencing (UNG) is combined with dynamic speed limit policies to improve traffic efficiency and sustainability.•TD3 algorithm is employed to train the UNG controller with actor-critic framework for the first time.•Simulation-based case study on a real-world local network shows that TD3-UNG controller reduces the comprehensive emissions by 27.73%.•DRL-based UNG controllers can efficiently reduce total time spent, emissions, and fuel consumption in various scenarios.•The simulation-based platform with the TD3-UNG can be used as an economical way to make beneficial speed limit policies. Urban environment and mobility are threatened by traffic congestion due to the growth in the number of vehicles and urbanization. To address this problem, we propose a deep reinforcement learning-based (DRL) urban network geofencing (UNG) strategy for traffic management to improve traffic operations and sustainability. The proposed solution creates a real-time geofence that consists of several sub-networks where dynamic speed limit policies are implemented. The road links in each sub-network share the same speed limit policy in a control cycle. An actor-critic framework is developed to learn the discrete speed limits of sub-networks in a continuous action space, and a reward function is developed based on the average speeds of vehicles on the network. A twin delayed deep deterministic policy gradient (TD3) method is introduced for calibrating the actor-critic networks and solving the overestimation bias problem arising with the function approximation. Based on the traffic simulation of a real-world local network in Shanghai, the performance of the geofencing methods is investigated in various scenarios characterized by different levels of traffic demand and control settings. The findings suggest that the proposed TD3-UNG controller is capable of generating beneficial dynamic speed limit policies to reduce total time spent, emissions, and fuel consumption in various scenarios.
ISSN:0965-8564
1879-2375
DOI:10.1016/j.tra.2024.104067