Loading…
Multi-agent reinforcement learning for autonomous vehicles: a survey
In the near future, autonomous vehicles (AVs) may cohabit with human drivers in mixed traffic. This cohabitation raises serious challenges, both in terms of traffic flow and individual mobility, as well as from the road safety point of view. Mixed traffic may fail to fulfill expected security requir...
Saved in:
Published in: | Autonomous intelligent systems 2022-11, Vol.2 (1), p.1-12, Article 27 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693 |
---|---|
cites | cdi_FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693 |
container_end_page | 12 |
container_issue | 1 |
container_start_page | 1 |
container_title | Autonomous intelligent systems |
container_volume | 2 |
creator | Dinneweth, Joris Boubezoul, Abderrahmane Mandiau, René Espié, Stéphane |
description | In the near future, autonomous vehicles (AVs) may cohabit with human drivers in mixed traffic. This cohabitation raises serious challenges, both in terms of traffic flow and individual mobility, as well as from the road safety point of view. Mixed traffic may fail to fulfill expected security requirements due to the heterogeneity and unpredictability of human drivers, and autonomous cars could then monopolize the traffic. Using multi-agent reinforcement learning (MARL) algorithms, researchers have attempted to design autonomous vehicles for both scenarios, and this paper investigates their recent advances. We focus on articles tackling decision-making problems and identify four paradigms. While some authors address mixed traffic problems with or without social-desirable AVs, others tackle the case of fully-autonomous traffic. While the latter case is essentially a communication problem, most authors addressing the mixed traffic admit some limitations. The current human driver models found in the literature are too simplistic since they do not cover the heterogeneity of the drivers’ behaviors. As a result, they fail to generalize over the wide range of possible behaviors. For each paper investigated, we analyze how the authors formulated the MARL problem in terms of observation, action, and rewards to match the paradigm they apply. |
doi_str_mv | 10.1007/s43684-022-00045-z |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_4555466927294fd8845023658422666c</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_4555466927294fd8845023658422666c</doaj_id><sourcerecordid>2736515035</sourcerecordid><originalsourceid>FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693</originalsourceid><addsrcrecordid>eNp9kc1LwzAchosoOOb-AU8FTx6qv3wn3sb8hIkXBW8h7dLZ0TUzaQfbX29qxY-LpyQvz-8hyZskpwguEIC4DJRwSTPAOAMAyrL9QTLCgkDGEX89_LU_TiYhrCKEhSJU0lFy_djVbZWZpW3a1NuqKZ0v7Lo_1db4pmqWaYxS07WucWvXhXRr36qituEqNWno_NbuTpKj0tTBTr7WcfJye_M8u8_mT3cPs-k8K4iQ-8yqHBluiUKCcwZQMsOxFEiSHPJSKrCGMaFUyXJBQUIhUWHoggNFOEdckXHyMHgXzqz0xldr43famUp_Bs4vtfFtfzlNGWOUc4UFVrRcSEkZYMKZpBhzzovoOh9cb6b-o7qfznWfAaUMIaW2KLJnA7vx7r2zodUr1_kmPlXHr-UMMSAsUnigCu9C8Lb81iLQfVF6KErHovRnUXofh8gwFCLcLK3_Uf8z9QFFrZJs</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2736515035</pqid></control><display><type>article</type><title>Multi-agent reinforcement learning for autonomous vehicles: a survey</title><source>Publicly Available Content Database</source><source>Springer Nature - SpringerLink Journals - Fully Open Access</source><creator>Dinneweth, Joris ; Boubezoul, Abderrahmane ; Mandiau, René ; Espié, Stéphane</creator><creatorcontrib>Dinneweth, Joris ; Boubezoul, Abderrahmane ; Mandiau, René ; Espié, Stéphane</creatorcontrib><description>In the near future, autonomous vehicles (AVs) may cohabit with human drivers in mixed traffic. This cohabitation raises serious challenges, both in terms of traffic flow and individual mobility, as well as from the road safety point of view. Mixed traffic may fail to fulfill expected security requirements due to the heterogeneity and unpredictability of human drivers, and autonomous cars could then monopolize the traffic. Using multi-agent reinforcement learning (MARL) algorithms, researchers have attempted to design autonomous vehicles for both scenarios, and this paper investigates their recent advances. We focus on articles tackling decision-making problems and identify four paradigms. While some authors address mixed traffic problems with or without social-desirable AVs, others tackle the case of fully-autonomous traffic. While the latter case is essentially a communication problem, most authors addressing the mixed traffic admit some limitations. The current human driver models found in the literature are too simplistic since they do not cover the heterogeneity of the drivers’ behaviors. As a result, they fail to generalize over the wide range of possible behaviors. For each paper investigated, we analyze how the authors formulated the MARL problem in terms of observation, action, and rewards to match the paradigm they apply.</description><identifier>ISSN: 2730-616X</identifier><identifier>EISSN: 2730-616X</identifier><identifier>DOI: 10.1007/s43684-022-00045-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Algorithms ; Artificial Intelligence ; Automation ; Autonomous cars ; Autonomous Vehicles ; Control and Systems Theory ; Decision making ; Driver behavior ; Engineering ; Engineering Sciences ; Heterogeneity ; Humans and Autonomous Entities in Sustainable Intelligent Transportation Solutions ; Intelligent systems ; Learning ; Machine Learning ; Multi-agent reinforcement learning ; Multiagent systems ; Review ; Robotics and Automation ; Simulation ; Taxonomy ; Traffic flow ; Traffic safety ; Unmanned aerial vehicles</subject><ispartof>Autonomous intelligent systems, 2022-11, Vol.2 (1), p.1-12, Article 27</ispartof><rights>The Author(s) 2022</rights><rights>The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693</citedby><cites>FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693</cites><orcidid>0000-0001-7722-9848 ; 0000-0002-3449-8279 ; 0000-0001-6417-733X ; 0000-0003-3967-1242</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2736515035/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2736515035?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,25753,27924,27925,37012,44590,75126</link.rule.ids><backlink>$$Uhttps://hal.science/hal-04451199$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Dinneweth, Joris</creatorcontrib><creatorcontrib>Boubezoul, Abderrahmane</creatorcontrib><creatorcontrib>Mandiau, René</creatorcontrib><creatorcontrib>Espié, Stéphane</creatorcontrib><title>Multi-agent reinforcement learning for autonomous vehicles: a survey</title><title>Autonomous intelligent systems</title><addtitle>Auton. Intell. Syst</addtitle><description>In the near future, autonomous vehicles (AVs) may cohabit with human drivers in mixed traffic. This cohabitation raises serious challenges, both in terms of traffic flow and individual mobility, as well as from the road safety point of view. Mixed traffic may fail to fulfill expected security requirements due to the heterogeneity and unpredictability of human drivers, and autonomous cars could then monopolize the traffic. Using multi-agent reinforcement learning (MARL) algorithms, researchers have attempted to design autonomous vehicles for both scenarios, and this paper investigates their recent advances. We focus on articles tackling decision-making problems and identify four paradigms. While some authors address mixed traffic problems with or without social-desirable AVs, others tackle the case of fully-autonomous traffic. While the latter case is essentially a communication problem, most authors addressing the mixed traffic admit some limitations. The current human driver models found in the literature are too simplistic since they do not cover the heterogeneity of the drivers’ behaviors. As a result, they fail to generalize over the wide range of possible behaviors. For each paper investigated, we analyze how the authors formulated the MARL problem in terms of observation, action, and rewards to match the paradigm they apply.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Automation</subject><subject>Autonomous cars</subject><subject>Autonomous Vehicles</subject><subject>Control and Systems Theory</subject><subject>Decision making</subject><subject>Driver behavior</subject><subject>Engineering</subject><subject>Engineering Sciences</subject><subject>Heterogeneity</subject><subject>Humans and Autonomous Entities in Sustainable Intelligent Transportation Solutions</subject><subject>Intelligent systems</subject><subject>Learning</subject><subject>Machine Learning</subject><subject>Multi-agent reinforcement learning</subject><subject>Multiagent systems</subject><subject>Review</subject><subject>Robotics and Automation</subject><subject>Simulation</subject><subject>Taxonomy</subject><subject>Traffic flow</subject><subject>Traffic safety</subject><subject>Unmanned aerial vehicles</subject><issn>2730-616X</issn><issn>2730-616X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kc1LwzAchosoOOb-AU8FTx6qv3wn3sb8hIkXBW8h7dLZ0TUzaQfbX29qxY-LpyQvz-8hyZskpwguEIC4DJRwSTPAOAMAyrL9QTLCgkDGEX89_LU_TiYhrCKEhSJU0lFy_djVbZWZpW3a1NuqKZ0v7Lo_1db4pmqWaYxS07WucWvXhXRr36qituEqNWno_NbuTpKj0tTBTr7WcfJye_M8u8_mT3cPs-k8K4iQ-8yqHBluiUKCcwZQMsOxFEiSHPJSKrCGMaFUyXJBQUIhUWHoggNFOEdckXHyMHgXzqz0xldr43famUp_Bs4vtfFtfzlNGWOUc4UFVrRcSEkZYMKZpBhzzovoOh9cb6b-o7qfznWfAaUMIaW2KLJnA7vx7r2zodUr1_kmPlXHr-UMMSAsUnigCu9C8Lb81iLQfVF6KErHovRnUXofh8gwFCLcLK3_Uf8z9QFFrZJs</recordid><startdate>20221116</startdate><enddate>20221116</enddate><creator>Dinneweth, Joris</creator><creator>Boubezoul, Abderrahmane</creator><creator>Mandiau, René</creator><creator>Espié, Stéphane</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><general>Springer</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>1XC</scope><scope>VOOES</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-7722-9848</orcidid><orcidid>https://orcid.org/0000-0002-3449-8279</orcidid><orcidid>https://orcid.org/0000-0001-6417-733X</orcidid><orcidid>https://orcid.org/0000-0003-3967-1242</orcidid></search><sort><creationdate>20221116</creationdate><title>Multi-agent reinforcement learning for autonomous vehicles: a survey</title><author>Dinneweth, Joris ; Boubezoul, Abderrahmane ; Mandiau, René ; Espié, Stéphane</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Automation</topic><topic>Autonomous cars</topic><topic>Autonomous Vehicles</topic><topic>Control and Systems Theory</topic><topic>Decision making</topic><topic>Driver behavior</topic><topic>Engineering</topic><topic>Engineering Sciences</topic><topic>Heterogeneity</topic><topic>Humans and Autonomous Entities in Sustainable Intelligent Transportation Solutions</topic><topic>Intelligent systems</topic><topic>Learning</topic><topic>Machine Learning</topic><topic>Multi-agent reinforcement learning</topic><topic>Multiagent systems</topic><topic>Review</topic><topic>Robotics and Automation</topic><topic>Simulation</topic><topic>Taxonomy</topic><topic>Traffic flow</topic><topic>Traffic safety</topic><topic>Unmanned aerial vehicles</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dinneweth, Joris</creatorcontrib><creatorcontrib>Boubezoul, Abderrahmane</creatorcontrib><creatorcontrib>Mandiau, René</creatorcontrib><creatorcontrib>Espié, Stéphane</creatorcontrib><collection>SpringerOpen</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Autonomous intelligent systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dinneweth, Joris</au><au>Boubezoul, Abderrahmane</au><au>Mandiau, René</au><au>Espié, Stéphane</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-agent reinforcement learning for autonomous vehicles: a survey</atitle><jtitle>Autonomous intelligent systems</jtitle><stitle>Auton. Intell. Syst</stitle><date>2022-11-16</date><risdate>2022</risdate><volume>2</volume><issue>1</issue><spage>1</spage><epage>12</epage><pages>1-12</pages><artnum>27</artnum><issn>2730-616X</issn><eissn>2730-616X</eissn><abstract>In the near future, autonomous vehicles (AVs) may cohabit with human drivers in mixed traffic. This cohabitation raises serious challenges, both in terms of traffic flow and individual mobility, as well as from the road safety point of view. Mixed traffic may fail to fulfill expected security requirements due to the heterogeneity and unpredictability of human drivers, and autonomous cars could then monopolize the traffic. Using multi-agent reinforcement learning (MARL) algorithms, researchers have attempted to design autonomous vehicles for both scenarios, and this paper investigates their recent advances. We focus on articles tackling decision-making problems and identify four paradigms. While some authors address mixed traffic problems with or without social-desirable AVs, others tackle the case of fully-autonomous traffic. While the latter case is essentially a communication problem, most authors addressing the mixed traffic admit some limitations. The current human driver models found in the literature are too simplistic since they do not cover the heterogeneity of the drivers’ behaviors. As a result, they fail to generalize over the wide range of possible behaviors. For each paper investigated, we analyze how the authors formulated the MARL problem in terms of observation, action, and rewards to match the paradigm they apply.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s43684-022-00045-z</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7722-9848</orcidid><orcidid>https://orcid.org/0000-0002-3449-8279</orcidid><orcidid>https://orcid.org/0000-0001-6417-733X</orcidid><orcidid>https://orcid.org/0000-0003-3967-1242</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2730-616X |
ispartof | Autonomous intelligent systems, 2022-11, Vol.2 (1), p.1-12, Article 27 |
issn | 2730-616X 2730-616X |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_4555466927294fd8845023658422666c |
source | Publicly Available Content Database; Springer Nature - SpringerLink Journals - Fully Open Access |
subjects | Algorithms Artificial Intelligence Automation Autonomous cars Autonomous Vehicles Control and Systems Theory Decision making Driver behavior Engineering Engineering Sciences Heterogeneity Humans and Autonomous Entities in Sustainable Intelligent Transportation Solutions Intelligent systems Learning Machine Learning Multi-agent reinforcement learning Multiagent systems Review Robotics and Automation Simulation Taxonomy Traffic flow Traffic safety Unmanned aerial vehicles |
title | Multi-agent reinforcement learning for autonomous vehicles: a survey |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T00%3A14%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-agent%20reinforcement%20learning%20for%20autonomous%20vehicles:%20a%20survey&rft.jtitle=Autonomous%20intelligent%20systems&rft.au=Dinneweth,%20Joris&rft.date=2022-11-16&rft.volume=2&rft.issue=1&rft.spage=1&rft.epage=12&rft.pages=1-12&rft.artnum=27&rft.issn=2730-616X&rft.eissn=2730-616X&rft_id=info:doi/10.1007/s43684-022-00045-z&rft_dat=%3Cproquest_doaj_%3E2736515035%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c378z-e9b1a6e391766500f5a6287183b0bf890ea55799f5b74080c81ca4d60412b1693%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2736515035&rft_id=info:pmid/&rfr_iscdi=true |