Loading…
Global path planning algorithm based on double DQN for multi-tasks amphibious unmanned surface vehicle
It is a key to making path planning for an amphibious unmanned surface vehicle (USV). A global path planning algorithm based on double deep Q networks (DDQN) is proposed. Firstly, an environment model is constructed by an electronic nautical chart and elevation map to train and verify the algorithm....
Saved in:
Published in: | Ocean engineering 2022-12, Vol.266, p.112809, Article 112809 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | It is a key to making path planning for an amphibious unmanned surface vehicle (USV). A global path planning algorithm based on double deep Q networks (DDQN) is proposed. Firstly, an environment model is constructed by an electronic nautical chart and elevation map to train and verify the algorithm. Secondly, based on the kinematics of amphibious USV, a Markov decision process (MDP) framework is built, and various reward functions are designed for diverse tasks. During the training, obstacles and water depth information of the environment are used, the amphibious USV agent is guided to the target area. Meanwhile, based on the prior knowledge, an action mask approach is integrated to deal with the invalid actions generated by the amphibious USV. Path smoothing is also integrated to smooth the path. According to different criteria, reasonable paths can be generated and adjusted by the weights of the reward function. To verify our algorithm, a small-scale simulation environment is established, and two scenarios are introduced. The results show that our DDQN algorithm can generate reasonable global paths for diverse tasks. Moreover, compared with DQN, A*, and RRT algorithms, the paths generated by our method have better performance.
•A global path planning algorithm based on DDQN is proposed for our amphibious USV.•Based on the kinematics of the amphibious USV, a Markov decision process framework is built.•The action mask and path smoothing are integrated into our algorithm to generate final path.•Compared with DQN, A*, and RRT, the results show that our method outperforms them. |
---|---|
ISSN: | 0029-8018 1873-5258 |
DOI: | 10.1016/j.oceaneng.2022.112809 |