Loading…

A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators

This study proposes an innovative reinforcement learning-based time-delay control (RL-TDC) scheme to provide more intelligent, timely, and aggressive control efforts than the existing simple-structured adaptive time-delay controls (ATDCs) that are well-known for achieving good tracking performances...

Full description

Saved in:

Bibliographic Details
Main Authors:	Baek, Seungmin, Baek, Jongchan, Choi, Jinsuk, Han, Soohee
Format:	Conference Proceeding
Language:	English
Subjects:	Adaptation models Delay effects Estimation Heuristic algorithms Manipulator dynamics Reliability Stability analysis
Online Access:	Request full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page	2729
container_issue
container_start_page	2722
container_title
container_volume
creator	Baek, Seungmin Baek, Jongchan Choi, Jinsuk Han, Soohee
description	This study proposes an innovative reinforcement learning-based time-delay control (RL-TDC) scheme to provide more intelligent, timely, and aggressive control efforts than the existing simple-structured adaptive time-delay controls (ATDCs) that are well-known for achieving good tracking performances in practical applications. The proposed control scheme adopts a state-of-the-art RL algorithm called soft actor critic (SAC) with which the inertia gain matrix of the time-delay control is adjusted toward maximizing the expected return obtained from tracking errors over all the future time periods. By learning the dynamics of the robot manipulator with a data-driven approach, and capturing its intractable and complicated phenomena, the proposed RL-TDC is trained to effectively suppress the inherent time delay estimation (TDE) errors arising from time delay control, thereby ensuring the best tracking performance within the given control capacity limits. As expected, it is demonstrated through simulation with a robot manipulator that the proposed RL-TDC avoids conservative small control actions when large ones are required, for maximizing the tracking performance. It is observed that the stability condition is fully exploited to provide more effective control actions.
doi_str_mv	10.23919/ACC53348.2022.9867835
format	conference_proceeding
fullrecord	<record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9867835</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9867835</ieee_id><sourcerecordid>9867835</sourcerecordid><originalsourceid>FETCH-LOGICAL-i133t-510c265647f54e5d041fee5d64012fc4b7e4d41c4db8762de608036ae6a0a4c23</originalsourceid><addsrcrecordid>eNot0N1KwzAYgOEoCM65KxAkN9CZ_yaHpf4NKsKYx-Nr81UibVLaKOzuFdzRc_YevITcc7YV0nH3UNW1llLZrWBCbJ01pZX6gmxcabkxWmnujLwkKyFLW2hr-DW5WZYvxrhzhq2Ir-geQ-zT3OGIMdMGYY4hfhYtLOhp5WHK4QfpIYxYPOIAJ1qnmOc0UIie7vJCq2kaQgc5pEhzovvUpkzfIIbpe4Cc5uWWXPUwLLg5uyYfz0-H-rVo3l92ddUUgUuZC81ZJ4w2quy1Qu2Z4j3-aRTjou9UW6LyinfKt7Y0wqNhlkkDaICB6oRck7v_bkDE4zSHEebT8TxF_gK7AFcs</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators</title><source>IEEE Xplore All Conference Series</source><creator>Baek, Seungmin ; Baek, Jongchan ; Choi, Jinsuk ; Han, Soohee</creator><creatorcontrib>Baek, Seungmin ; Baek, Jongchan ; Choi, Jinsuk ; Han, Soohee</creatorcontrib><description>This study proposes an innovative reinforcement learning-based time-delay control (RL-TDC) scheme to provide more intelligent, timely, and aggressive control efforts than the existing simple-structured adaptive time-delay controls (ATDCs) that are well-known for achieving good tracking performances in practical applications. The proposed control scheme adopts a state-of-the-art RL algorithm called soft actor critic (SAC) with which the inertia gain matrix of the time-delay control is adjusted toward maximizing the expected return obtained from tracking errors over all the future time periods. By learning the dynamics of the robot manipulator with a data-driven approach, and capturing its intractable and complicated phenomena, the proposed RL-TDC is trained to effectively suppress the inherent time delay estimation (TDE) errors arising from time delay control, thereby ensuring the best tracking performance within the given control capacity limits. As expected, it is demonstrated through simulation with a robot manipulator that the proposed RL-TDC avoids conservative small control actions when large ones are required, for maximizing the tracking performance. It is observed that the stability condition is fully exploited to provide more effective control actions.</description><identifier>EISSN: 2378-5861</identifier><identifier>EISBN: 9781665451963</identifier><identifier>EISBN: 1665451963</identifier><identifier>DOI: 10.23919/ACC53348.2022.9867835</identifier><language>eng</language><publisher>American Automatic Control Council</publisher><subject>Adaptation models ; Delay effects ; Estimation ; Heuristic algorithms ; Manipulator dynamics ; Reliability ; Stability analysis</subject><ispartof>2022 American Control Conference (ACC), 2022, p.2722-2729</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9867835$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9867835$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Baek, Seungmin</creatorcontrib><creatorcontrib>Baek, Jongchan</creatorcontrib><creatorcontrib>Choi, Jinsuk</creatorcontrib><creatorcontrib>Han, Soohee</creatorcontrib><title>A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators</title><title>2022 American Control Conference (ACC)</title><addtitle>ACC</addtitle><description>This study proposes an innovative reinforcement learning-based time-delay control (RL-TDC) scheme to provide more intelligent, timely, and aggressive control efforts than the existing simple-structured adaptive time-delay controls (ATDCs) that are well-known for achieving good tracking performances in practical applications. The proposed control scheme adopts a state-of-the-art RL algorithm called soft actor critic (SAC) with which the inertia gain matrix of the time-delay control is adjusted toward maximizing the expected return obtained from tracking errors over all the future time periods. By learning the dynamics of the robot manipulator with a data-driven approach, and capturing its intractable and complicated phenomena, the proposed RL-TDC is trained to effectively suppress the inherent time delay estimation (TDE) errors arising from time delay control, thereby ensuring the best tracking performance within the given control capacity limits. As expected, it is demonstrated through simulation with a robot manipulator that the proposed RL-TDC avoids conservative small control actions when large ones are required, for maximizing the tracking performance. It is observed that the stability condition is fully exploited to provide more effective control actions.</description><subject>Adaptation models</subject><subject>Delay effects</subject><subject>Estimation</subject><subject>Heuristic algorithms</subject><subject>Manipulator dynamics</subject><subject>Reliability</subject><subject>Stability analysis</subject><issn>2378-5861</issn><isbn>9781665451963</isbn><isbn>1665451963</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2022</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNot0N1KwzAYgOEoCM65KxAkN9CZ_yaHpf4NKsKYx-Nr81UibVLaKOzuFdzRc_YevITcc7YV0nH3UNW1llLZrWBCbJ01pZX6gmxcabkxWmnujLwkKyFLW2hr-DW5WZYvxrhzhq2Ir-geQ-zT3OGIMdMGYY4hfhYtLOhp5WHK4QfpIYxYPOIAJ1qnmOc0UIie7vJCq2kaQgc5pEhzovvUpkzfIIbpe4Cc5uWWXPUwLLg5uyYfz0-H-rVo3l92ddUUgUuZC81ZJ4w2quy1Qu2Z4j3-aRTjou9UW6LyinfKt7Y0wqNhlkkDaICB6oRck7v_bkDE4zSHEebT8TxF_gK7AFcs</recordid><startdate>20220608</startdate><enddate>20220608</enddate><creator>Baek, Seungmin</creator><creator>Baek, Jongchan</creator><creator>Choi, Jinsuk</creator><creator>Han, Soohee</creator><general>American Automatic Control Council</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20220608</creationdate><title>A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators</title><author>Baek, Seungmin ; Baek, Jongchan ; Choi, Jinsuk ; Han, Soohee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i133t-510c265647f54e5d041fee5d64012fc4b7e4d41c4db8762de608036ae6a0a4c23</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Adaptation models</topic><topic>Delay effects</topic><topic>Estimation</topic><topic>Heuristic algorithms</topic><topic>Manipulator dynamics</topic><topic>Reliability</topic><topic>Stability analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Baek, Seungmin</creatorcontrib><creatorcontrib>Baek, Jongchan</creatorcontrib><creatorcontrib>Choi, Jinsuk</creatorcontrib><creatorcontrib>Han, Soohee</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Baek, Seungmin</au><au>Baek, Jongchan</au><au>Choi, Jinsuk</au><au>Han, Soohee</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators</atitle><btitle>2022 American Control Conference (ACC)</btitle><stitle>ACC</stitle><date>2022-06-08</date><risdate>2022</risdate><spage>2722</spage><epage>2729</epage><pages>2722-2729</pages><eissn>2378-5861</eissn><eisbn>9781665451963</eisbn><eisbn>1665451963</eisbn><abstract>This study proposes an innovative reinforcement learning-based time-delay control (RL-TDC) scheme to provide more intelligent, timely, and aggressive control efforts than the existing simple-structured adaptive time-delay controls (ATDCs) that are well-known for achieving good tracking performances in practical applications. The proposed control scheme adopts a state-of-the-art RL algorithm called soft actor critic (SAC) with which the inertia gain matrix of the time-delay control is adjusted toward maximizing the expected return obtained from tracking errors over all the future time periods. By learning the dynamics of the robot manipulator with a data-driven approach, and capturing its intractable and complicated phenomena, the proposed RL-TDC is trained to effectively suppress the inherent time delay estimation (TDE) errors arising from time delay control, thereby ensuring the best tracking performance within the given control capacity limits. As expected, it is demonstrated through simulation with a robot manipulator that the proposed RL-TDC avoids conservative small control actions when large ones are required, for maximizing the tracking performance. It is observed that the stability condition is fully exploited to provide more effective control actions.</abstract><pub>American Automatic Control Council</pub><doi>10.23919/ACC53348.2022.9867835</doi><tpages>8</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	EISSN: 2378-5861
ispartof	2022 American Control Conference (ACC), 2022, p.2722-2729
issn	2378-5861
language	eng
recordid	cdi_ieee_primary_9867835
source	IEEE Xplore All Conference Series
subjects	Adaptation models Delay effects Estimation Heuristic algorithms Manipulator dynamics Reliability Stability analysis
title	A Reinforcement Learning-based Adaptive Time-Delay Control and Its Application to Robot Manipulators
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T17%3A42%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20Reinforcement%20Learning-based%20Adaptive%20Time-Delay%20Control%20and%20Its%20Application%20to%20Robot%20Manipulators&rft.btitle=2022%20American%20Control%20Conference%20(ACC)&rft.au=Baek,%20Seungmin&rft.date=2022-06-08&rft.spage=2722&rft.epage=2729&rft.pages=2722-2729&rft.eissn=2378-5861&rft_id=info:doi/10.23919/ACC53348.2022.9867835&rft.eisbn=9781665451963&rft.eisbn_list=1665451963&rft_dat=%3Cieee_CHZPO%3E9867835%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i133t-510c265647f54e5d041fee5d64012fc4b7e4d41c4db8762de608036ae6a0a4c23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9867835&rfr_iscdi=true