Loading…
Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Zhao, Luman Li, Guoyuan Zhang, Houxiang |
description | In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect. |
format | article |
fullrecord | <record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_3141415</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_3141415</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_31414153</originalsourceid><addsrcrecordid>eNqNiz0KwkAQRtNYiHqH8QABY0xjF4M_RWzEPkx2JzIymYHdoNd3EQ8gX_Fe8b559jyL9SiA6qE1l6x-YyClGPfQ2NizEtyIdbDgaCSdoCUMyvr4fq7mSfIDRvIp1ymYQEqTi3BkU6hfxh7V0TKbDSiRVj8usvXpeG8uuQscJ9ZOLWBXFNtq05XFLq0q_2k-1Q4_gQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><source>NORA - Norwegian Open Research Archives</source><creator>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</creator><creatorcontrib>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</creatorcontrib><description>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</description><language>eng</language><publisher>IEEE</publisher><creationdate>2024</creationdate><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,780,885,26567</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/3141415$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Zhao, Luman</creatorcontrib><creatorcontrib>Li, Guoyuan</creatorcontrib><creatorcontrib>Zhang, Houxiang</creatorcontrib><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><description>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</description><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>3HK</sourceid><recordid>eNqNiz0KwkAQRtNYiHqH8QABY0xjF4M_RWzEPkx2JzIymYHdoNd3EQ8gX_Fe8b559jyL9SiA6qE1l6x-YyClGPfQ2NizEtyIdbDgaCSdoCUMyvr4fq7mSfIDRvIp1ymYQEqTi3BkU6hfxh7V0TKbDSiRVj8usvXpeG8uuQscJ9ZOLWBXFNtq05XFLq0q_2k-1Q4_gQ</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Zhao, Luman</creator><creator>Li, Guoyuan</creator><creator>Zhang, Houxiang</creator><general>IEEE</general><scope>3HK</scope></search><sort><creationdate>2024</creationdate><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><author>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_31414153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Zhao, Luman</creatorcontrib><creatorcontrib>Li, Guoyuan</creatorcontrib><creatorcontrib>Zhang, Houxiang</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhao, Luman</au><au>Li, Guoyuan</au><au>Zhang, Houxiang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</atitle><date>2024</date><risdate>2024</risdate><abstract>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</abstract><pub>IEEE</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng |
recordid | cdi_cristin_nora_11250_3141415 |
source | NORA - Norwegian Open Research Archives |
title | Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T03%3A13%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Global%20and%20Local%20Awareness:%20Combine%20Reinforcement%20Learning%20and%20Model-Based%20Control%20for%20Collision%20Avoidance&rft.au=Zhao,%20Luman&rft.date=2024&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_3141415%3C/cristin_3HK%3E%3Cgrp_id%3Ecdi_FETCH-cristin_nora_11250_31414153%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |