Loading…
Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning
This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing rein...
Saved in:
Published in: | Applied mathematics and computation 2024-09, Vol.476, p.128785, Article 128785 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3 |
---|---|
cites | cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3 |
container_end_page | |
container_issue | |
container_start_page | 128785 |
container_title | Applied mathematics and computation |
container_volume | 476 |
creator | Meng, Hao Pang, Denghao Cao, Jinde Guo, Yechen Niazi, Azmat Ullah Khan |
description | This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations.
•A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted. |
doi_str_mv | 10.1016/j.amc.2024.128785 |
format | article |
fullrecord | <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_amc_2024_128785</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0096300324002492</els_id><sourcerecordid>S0096300324002492</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</originalsourceid><addsrcrecordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><source>ScienceDirect Freedom Collection</source><source>Backfile Package - Computer Science (Legacy) [YCS]</source><source>Backfile Package - Mathematics (Legacy) [YMT]</source><creator>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creator><creatorcontrib>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</creatorcontrib><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations.
•A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><identifier>ISSN: 0096-3003</identifier><identifier>EISSN: 1873-5649</identifier><identifier>DOI: 10.1016/j.amc.2024.128785</identifier><language>eng</language><publisher>Elsevier Inc</publisher><subject>Cooperative control ; Heterogeneous multi-agent systems ; Optimal bipartite consensus ; Reinforcement learning</subject><ispartof>Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785</ispartof><rights>2024 Elsevier Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</citedby><cites>FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</cites><orcidid>0000-0001-7669-3614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0096300324002492$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3427,3562,27923,27924,45971,46002</link.rule.ids></links><search><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><title>Applied mathematics and computation</title><description>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations.
•A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</description><subject>Cooperative control</subject><subject>Heterogeneous multi-agent systems</subject><subject>Optimal bipartite consensus</subject><subject>Reinforcement learning</subject><issn>0096-3003</issn><issn>1873-5649</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM9OwzAMhyMEEmPwANzyAi1O2zSNOKGJf9KkXeAcpZkzMtpkSrKhvT2dxpmTLf_0WfZHyD2DkgFrH7alHk1ZQdWUrOpExy_IjHWiLnjbyEsyA5BtUQPU1-QmpS0AiJY1MzKsdtmNeqC92-mYXUZqgk_o0z6duhzDQG2I9AszxrBBj2FK9v7bhx9Px_2QXaGncabpmDKOiR6cphGdnyiD4ykZUEfv_OaWXFk9JLz7q3Py-fL8sXgrlqvX98XTsjCVFLkQsm-Ytdw0XFqhNW96kLwVCHpdM-Adgm2rXmrQspa84b00fd9JoRl22Nt6Tth5r4khpYhW7eL0ZDwqBuqkS23VpEuddKmzrol5PDM4HXZwGFUyDr3BtYtosloH9w_9C-D9dnY</recordid><startdate>20240901</startdate><enddate>20240901</enddate><creator>Meng, Hao</creator><creator>Pang, Denghao</creator><creator>Cao, Jinde</creator><creator>Guo, Yechen</creator><creator>Niazi, Azmat Ullah Khan</creator><general>Elsevier Inc</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></search><sort><creationdate>20240901</creationdate><title>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</title><author>Meng, Hao ; Pang, Denghao ; Cao, Jinde ; Guo, Yechen ; Niazi, Azmat Ullah Khan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cooperative control</topic><topic>Heterogeneous multi-agent systems</topic><topic>Optimal bipartite consensus</topic><topic>Reinforcement learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Meng, Hao</creatorcontrib><creatorcontrib>Pang, Denghao</creatorcontrib><creatorcontrib>Cao, Jinde</creatorcontrib><creatorcontrib>Guo, Yechen</creatorcontrib><creatorcontrib>Niazi, Azmat Ullah Khan</creatorcontrib><collection>CrossRef</collection><jtitle>Applied mathematics and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Meng, Hao</au><au>Pang, Denghao</au><au>Cao, Jinde</au><au>Guo, Yechen</au><au>Niazi, Azmat Ullah Khan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning</atitle><jtitle>Applied mathematics and computation</jtitle><date>2024-09-01</date><risdate>2024</risdate><volume>476</volume><spage>128785</spage><pages>128785-</pages><artnum>128785</artnum><issn>0096-3003</issn><eissn>1873-5649</eissn><abstract>This study focuses on addressing optimal bipartite consensus control (OBCC) problems in heterogeneous multi-agent systems (MASs) without relying on the agents' dynamics. Motivated by the need for model-free and optimal consensus control in complex MASs, a novel distributed scheme utilizing reinforcement learning (RL) is proposed to overcome these challenges. The MAS network is randomly partitioned into sub-networks where agents collaborate within each subgroup to attain tracking control and ensure convergence of positions and speeds to a common value. However, agents from distinct subgroups compete to achieve diverse tracking objectives. Furthermore, the heterogeneous MASs considered have unknown first and second-order dynamics, adding to the complexity of the problem. To address the OBCC issue, the policy iteration (PI) algorithm is used to acquire solutions for discrete-time Hamilton-Jacobi-Bellman (HJB) equations while implementing a data-driven actor-critic neural network (ACNN) framework. Ultimately, the accuracy of our proposed approach is confirmed through the presentation of numerical simulations.
•A novel approach is proposed to investigate the OBCC of heterogeneous MASs in competition-cooperation relationship.•To avoid the use of system dynamics, a model-free RL algorithm is proposed, utilizing available input-output data to construct a new system and achieve the OBCC of heterogeneous MASs.•The challenge of dimensional discrepancies in heterogeneous MASs is overcome by the incorporation of estimated velocities, thereby converting the heterogeneous systems into homogeneous systems.•A distributed actor-critic neural network based on PI is proposed to obtain the optimal control policy, and the convergence analysis of the PI algorithm is conducted.</abstract><pub>Elsevier Inc</pub><doi>10.1016/j.amc.2024.128785</doi><orcidid>https://orcid.org/0000-0001-7669-3614</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0096-3003 |
ispartof | Applied mathematics and computation, 2024-09, Vol.476, p.128785, Article 128785 |
issn | 0096-3003 1873-5649 |
language | eng |
recordid | cdi_crossref_primary_10_1016_j_amc_2024_128785 |
source | ScienceDirect Freedom Collection; Backfile Package - Computer Science (Legacy) [YCS]; Backfile Package - Mathematics (Legacy) [YMT] |
subjects | Cooperative control Heterogeneous multi-agent systems Optimal bipartite consensus Reinforcement learning |
title | Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T08%3A12%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimal%20bipartite%20consensus%20control%20for%20heterogeneous%20unknown%20multi-agent%20systems%20via%20reinforcement%20learning&rft.jtitle=Applied%20mathematics%20and%20computation&rft.au=Meng,%20Hao&rft.date=2024-09-01&rft.volume=476&rft.spage=128785&rft.pages=128785-&rft.artnum=128785&rft.issn=0096-3003&rft.eissn=1873-5649&rft_id=info:doi/10.1016/j.amc.2024.128785&rft_dat=%3Celsevier_cross%3ES0096300324002492%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c297t-79b41ff5c459f7aa54b09567e0ad31058e0f62b9a0a939545b9cbb897a1e8ebf3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |