Loading…

Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning

This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing rein...

Full description

Saved in:

Bibliographic Details
Published in:	Applied mathematics and computation 2024-09, Vol.476, p.128785, Article 128785
Main Authors:	Meng, Hao, Pang, Denghao, Cao, Jinde, Guo, Yechen, Niazi, Azmat Ullah Khan
Format:	Article
Language:	English
Subjects:	Cooperative control Heterogeneous multi-agent systems Optimal bipartite consensus Reinforcement learning
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3
cites	cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3
container_end_page
container_issue
container_start_page	128785
container_title	Applied mathematics and computation
container_volume	476
creator	Meng, Hao Pang, Denghao Cao, Jinde Guo, Yechen Niazi, Azmat Ullah Khan
description	This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.
doi_str_mv	10.1016/j.amc.2024.128785
format	article
fullrecord	<record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_amc_2024_128785</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0096300324002492</els_id><sourcerecordid>S0096300324002492</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</originalsourceid><addsrcrecordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><source>ScienceDirect Freedom Collection</source><source>Backfile Package - Computer Science (Legacy) [YCS]</source><source>Backfile Package - Mathematics (Legacy) [YMT]</source><creator>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creator><creatorcontrib>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creatorcontrib><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><identifier>ISSN: 0096-3003</identifier><identifier>EISSN: 1873-5649</identifier><identifier>DOI: 10.1016/j.amc.2024.128785</identifier><language>eng</language><publisher>Elsevier Inc</publisher><subject>Cooperative control ; Heterogeneous multi-agent systems ; Optimal bipartite consensus ; Reinforcement learning</subject><ispartof>Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785</ispartof><rights>2024 Elsevier Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</citedby><cites>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</cites><orcidid>0000-0001-7669-3614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0096300324002492$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3427,3562,27923,27924,45971,46002</link.rule.ids></links><search><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><title>Applied mathematics and computation</title><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><subject>Cooperative control</subject><subject>Heterogeneous multi-agent systems</subject><subject>Optimal bipartite consensus</subject><subject>Reinforcement learning</subject><issn>0096-3003</issn><issn>1873-5649</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</recordid><startdate>20240901</startdate><enddate>20240901</enddate><creator>Meng, Hao</creator><creator>Pang, Denghao</creator><creator>Cao, Jinde</creator><creator>Guo, Yechen</creator><creator>Niazi, Azmat Ullah Khan</creator><general>Elsevier Inc</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></search><sort><creationdate>20240901</creationdate><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><author>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cooperative control</topic><topic>Heterogeneous multi-agent systems</topic><topic>Optimal bipartite consensus</topic><topic>Reinforcement learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><collection>CrossRef</collection><jtitle>Applied mathematics and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Meng, Hao</au><au>Pang, Denghao</au><au>Cao, Jinde</au><au>Guo, Yechen</au><au>Niazi, Azmat Ullah Khan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</atitle><jtitle>Applied mathematics and computation</jtitle><date>2024-09-01</date><risdate>2024</risdate><volume>476</volume><spage>128785</spage><pages>128785-</pages><artnum>128785</artnum><issn>0096-3003</issn><eissn>1873-5649</eissn><abstract>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</abstract><pub>Elsevier Inc</pub><doi>10.1016/j.amc.2024.128785</doi><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0096-3003
ispartof	Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785
issn	0096-3003 1873-5649
language	eng
recordid	cdi_crossref_primary_10_1016_j_amc_2024_128785
source	ScienceDirect Freedom Collection; Backfile Package - Computer Science (Legacy) [YCS]; Backfile Package - Mathematics (Legacy) [YMT]
subjects	Cooperative control Heterogeneous multi-agent systems Optimal bipartite consensus Reinforcement learning
title	Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T08%3A12%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimal%20bipartite%20consensus%20control%20for%20heterogeneous%20unknown%20multi-agent%20systems%20via%20reinforcement%20learning&rft.jtitle=Applied%20mathematics%20and%20computation&rft.au=Meng,%20Hao&rft.date=2024-09-01&rft.volume=476&rft.spage=128785&rft.pages=128785-&rft.artnum=128785&rft.issn=0096-3003&rft.eissn=1873-5649&rft_id=info:doi/10.1016/j.amc.2024.128785&rft_dat=%3Celsevier_cross%3ES0096300324002492%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true