Loading…

A Balanced Collision Avoidance Algorithm for USVs in Complex Environment: A Deep Reinforcement Learning Approach

The collision avoidance in real-time is crucial for unmanned surface vehicles (USVs) in a complex environment. Traditional methods make it hard to ensure the balance of control decisions. To balance safety and practicality, a collision avoidance algorithm based on deep reinforcement learning (DRL) a...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on intelligent transportation systems 2024-12, Vol.25 (12), p.21404-21415
Main Authors:	Lou, Mengmeng, Yang, Xiaofei, Hu, Jiabao, Zhu, Zhiyu, Shen, Hao, Xiang, Zhengrong, Zhang, Bin
Format:	Article
Language:	English
Subjects:	Collision avoidance deep deterministic policy gradient Deep reinforcement learning dynamic window algorithm Heuristic algorithms Marine vehicles Navigation obstacle avoidance Real-time systems Safety Training Trajectory Unmanned surface vehicles Vehicle dynamics velocity obstacle
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The collision avoidance in real-time is crucial for unmanned surface vehicles (USVs) in a complex environment. Traditional methods make it hard to ensure the balance of control decisions. To balance safety and practicality, a collision avoidance algorithm based on deep reinforcement learning (DRL) and a two-level incentive reward based on the principle of complementarity is proposed. To address the vital sparse reward problem of Deep Deterministic Policy Gradient (DDPG), the trajectory evaluation function of the dynamic window algorithm (DWA) is referred to construct the primary reward strategy, and a secondary incentive reward is constructed based on velocity obstacle (VO) to eliminate potential collision risks. To improve the efficiency of training, the electronic chart (EC) and Unity3D are used to build an immersive simulation platform. Based on it, simulations are made to verify the performance. In addition, field experiments are first conducted in various encounter scenarios to verify the effectiveness. The results show that it can take safe collision avoidance actions and get practical paths in various situations.
ISSN:	1524-9050 1558-0016
DOI:	10.1109/TITS.2024.3478319