Loading…
Automated eco-driving in urban scenarios using deep reinforcement learning
•We demonstrate the use of reinforcement learning for eco-driving strategies.•Only minimal data on the traffic situation are provided to the agent.•No explicit prediction of the traffic situation is required.•The energy saving potential was determined to be up to 11% compared with a green light opti...
Saved in:
Published in: | Transportation research. Part C, Emerging technologies Emerging technologies, 2021-05, Vol.126, p.102967, Article 102967 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •We demonstrate the use of reinforcement learning for eco-driving strategies.•Only minimal data on the traffic situation are provided to the agent.•No explicit prediction of the traffic situation is required.•The energy saving potential was determined to be up to 11% compared with a green light optimal speed advice system.
Urban settings are challenging environments to implement eco-driving strategies for automated vehicles. It is often assumed that sufficient information on the preceding vehicle pulk is available to accurately predict the traffic situation. Because vehicle-to-vehicle communication was introduced only recently, this assumption will not be valid until a sufficiently high penetration of the vehicle fleet has been reached. Thus, in the present study, we employed Reinforcement Learning (RL) to develop eco-driving strategies for cases where little data on the traffic situation are available.
An A-segment electric vehicle was simulated using detailed efficiency models to accurately determine its energy-saving potential. A probabilistic traffic environment featuring signalized urban roads and multiple preceding vehicles was integrated into the simulation model. Only information on the traffic light timing and minimal sensor data were provided to the control algorithm. A twin-delayed deep deterministic policy gradient (TD3) agent was implemented and trained to control the vehicle efficiently and safely in this environment.
Energy savings of up to 19% compared with a simulated human driver and up to 11% compared with a fine-tuned Green Light Optimal Speed Advice (GLOSA) algorithm were determined in a probabilistic traffic scenario reflecting real-world conditions. Overall, the RL agents showed a better travel time and energy consumption trade-off than the GLOSA reference. |
---|---|
ISSN: | 0968-090X 1879-2359 |
DOI: | 10.1016/j.trc.2021.102967 |