Loading…

Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning

This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing rein...

Full description

Saved in:
Bibliographic Details
Published in:Applied mathematics and computation 2024-09, Vol.476, p.128785, Article 128785
Main Authors: Meng, Hao, Pang, Denghao, Cao, Jinde, Guo, Yechen, Niazi, Azmat Ullah Khan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3
cites cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3
container_end_page
container_issue
container_start_page 128785
container_title Applied mathematics and computation
container_volume 476
creator Meng, Hao
Pang, Denghao
Cao, Jinde
Guo, Yechen
Niazi, Azmat Ullah Khan
description This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.
doi_str_mv 10.1016/j.amc.2024.128785
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_amc_2024_128785</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0096300324002492</els_id><sourcerecordid>S0096300324002492</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</originalsourceid><addsrcrecordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><source>ScienceDirect Freedom Collection</source><source>Backfile Package - Computer Science (Legacy) [YCS]</source><source>Backfile Package - Mathematics (Legacy) [YMT]</source><creator>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creator><creatorcontrib>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creatorcontrib><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><identifier>ISSN: 0096-3003</identifier><identifier>EISSN: 1873-5649</identifier><identifier>DOI: 10.1016/j.amc.2024.128785</identifier><language>eng</language><publisher>Elsevier Inc</publisher><subject>Cooperative control ; Heterogeneous multi-agent systems ; Optimal bipartite consensus ; Reinforcement learning</subject><ispartof>Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785</ispartof><rights>2024 Elsevier Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</citedby><cites>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</cites><orcidid>0000-0001-7669-3614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0096300324002492$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3427,3562,27923,27924,45971,46002</link.rule.ids></links><search><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><title>Applied mathematics and computation</title><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><subject>Cooperative control</subject><subject>Heterogeneous multi-agent systems</subject><subject>Optimal bipartite consensus</subject><subject>Reinforcement learning</subject><issn>0096-3003</issn><issn>1873-5649</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</recordid><startdate>20240901</startdate><enddate>20240901</enddate><creator>Meng, Hao</creator><creator>Pang, Denghao</creator><creator>Cao, Jinde</creator><creator>Guo, Yechen</creator><creator>Niazi, Azmat Ullah Khan</creator><general>Elsevier Inc</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></search><sort><creationdate>20240901</creationdate><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><author>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cooperative control</topic><topic>Heterogeneous multi-agent systems</topic><topic>Optimal bipartite consensus</topic><topic>Reinforcement learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><collection>CrossRef</collection><jtitle>Applied mathematics and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Meng, Hao</au><au>Pang, Denghao</au><au>Cao, Jinde</au><au>Guo, Yechen</au><au>Niazi, Azmat Ullah Khan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</atitle><jtitle>Applied mathematics and computation</jtitle><date>2024-09-01</date><risdate>2024</risdate><volume>476</volume><spage>128785</spage><pages>128785-</pages><artnum>128785</artnum><issn>0096-3003</issn><eissn>1873-5649</eissn><abstract>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations. •A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</abstract><pub>Elsevier Inc</pub><doi>10.1016/j.amc.2024.128785</doi><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0096-3003
ispartof Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785
issn 0096-3003
1873-5649
language eng
recordid cdi_crossref_primary_10_1016_j_amc_2024_128785
source ScienceDirect Freedom Collection; Backfile Package - Computer Science (Legacy) [YCS]; Backfile Package - Mathematics (Legacy) [YMT]
subjects Cooperative control
Heterogeneous multi-agent systems
Optimal bipartite consensus
Reinforcement learning
title Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T08%3A12%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimal%20bipartite%20consensus%20control%20for%20heterogeneous%20unknown%20multi-agent%20systems%20via%20reinforcement%20learning&rft.jtitle=Applied%20mathematics%20and%20computation&rft.au=Meng,%20Hao&rft.date=2024-09-01&rft.volume=476&rft.spage=128785&rft.pages=128785-&rft.artnum=128785&rft.issn=0096-3003&rft.eissn=1873-5649&rft_id=info:doi/10.1016/j.amc.2024.128785&rft_dat=%3Celsevier_cross%3ES0096300324002492%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true