Loading…

Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance

In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhao, Luman, Li, Guoyuan, Zhang, Houxiang
Format:	Article
Language:	English
Online Access:	Request full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Zhao, Luman Li, Guoyuan Zhang, Houxiang
description	In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.
format	article
fullrecord	<record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_3141415</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_3141415</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_31414153</originalsourceid><addsrcrecordid>eNqNiz0KwkAQRtNYiHqH8QABY0xjF4M_RWzEPkx2JzIymYHdoNd3EQ8gX_Fe8b559jyL9SiA6qE1l6x-YyClGPfQ2NizEtyIdbDgaCSdoCUMyvr4fq7mSfIDRvIp1ymYQEqTi3BkU6hfxh7V0TKbDSiRVj8usvXpeG8uuQscJ9ZOLWBXFNtq05XFLq0q_2k-1Q4_gQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><source>NORA - Norwegian Open Research Archives</source><creator>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</creator><creatorcontrib>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</creatorcontrib><description>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</description><language>eng</language><publisher>IEEE</publisher><creationdate>2024</creationdate><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,780,885,26567</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/3141415$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Zhao, Luman</creatorcontrib><creatorcontrib>Li, Guoyuan</creatorcontrib><creatorcontrib>Zhang, Houxiang</creatorcontrib><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><description>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</description><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>3HK</sourceid><recordid>eNqNiz0KwkAQRtNYiHqH8QABY0xjF4M_RWzEPkx2JzIymYHdoNd3EQ8gX_Fe8b559jyL9SiA6qE1l6x-YyClGPfQ2NizEtyIdbDgaCSdoCUMyvr4fq7mSfIDRvIp1ymYQEqTi3BkU6hfxh7V0TKbDSiRVj8usvXpeG8uuQscJ9ZOLWBXFNtq05XFLq0q_2k-1Q4_gQ</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Zhao, Luman</creator><creator>Li, Guoyuan</creator><creator>Zhang, Houxiang</creator><general>IEEE</general><scope>3HK</scope></search><sort><creationdate>2024</creationdate><title>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</title><author>Zhao, Luman ; Li, Guoyuan ; Zhang, Houxiang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_31414153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Zhao, Luman</creatorcontrib><creatorcontrib>Li, Guoyuan</creatorcontrib><creatorcontrib>Zhang, Houxiang</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhao, Luman</au><au>Li, Guoyuan</au><au>Zhang, Houxiang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance</atitle><date>2024</date><risdate>2024</risdate><abstract>In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.</abstract><pub>IEEE</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_cristin_nora_11250_3141415
source	NORA - Norwegian Open Research Archives
title	Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T03%3A13%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Global%20and%20Local%20Awareness:%20Combine%20Reinforcement%20Learning%20and%20Model-Based%20Control%20for%20Collision%20Avoidance&rft.au=Zhao,%20Luman&rft.date=2024&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_3141415%3C/cristin_3HK%3E%3Cgrp_id%3Ecdi_FETCH-cristin_nora_11250_31414153%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true