Loading…

Physical interaction as communication: Learning robot objectives online from human corrections

When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interact...

Full description

Saved in:

Bibliographic Details
Published in:	The International journal of robotics research 2022-01, Vol.41 (1), p.20-44
Main Authors:	Losey, Dylan P., Bajcsy, Andrea, O’Malley, Marcia K., Dragan, Anca D.
Format:	Article
Language:	English
Subjects:	Human performance Robot arms Robot learning Robots
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93
cites	cdi_FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93
container_end_page	44
container_issue	1
container_start_page	20
container_title	The International journal of robotics research
container_volume	41
creator	Losey, Dylan P. Bajcsy, Andrea O’Malley, Marcia K. Dragan, Anca D.
description	When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.
doi_str_mv	10.1177/02783649211050958
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2621188954</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_02783649211050958</sage_id><sourcerecordid>2621188954</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93</originalsourceid><addsrcrecordid>eNp1kE1Lw0AQhhdRsFZ_gLcFz6kz-5FsvEnRKhT0oFfDZt20Kc1u3U2E_ns3VvAgngZmnucdeAm5RJghFsU1sELxXJQMESSUUh2RCRYCM45Ffkwm4z0bgVNyFuMGAHgO5YS8Pa_3sTV6S1vX26BN33pHdaTGd93g0mVc3NCl1cG1bkWDr31Pfb2xCf20kXq3bZ2lTfAdXQ-ddkkNwX4HxXNy0uhttBc_c0pe7-9e5g_Z8mnxOL9dZoZL2WfWgq0FclHXuqiZFMhAA6qagWys4sKCaQSTOgepFFMKGOPa4HsikJuST8nVIXcX_MdgY19t_BBcelmxPHWiVClFovBAmeBjDLapdqHtdNhXCNVYY_WnxuTMDk7UK_ub-r_wBdNgcpQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2621188954</pqid></control><display><type>article</type><title>Physical interaction as communication: Learning robot objectives online from human corrections</title><source>SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024: Reading List</source><creator>Losey, Dylan P. ; Bajcsy, Andrea ; O’Malley, Marcia K. ; Dragan, Anca D.</creator><creatorcontrib>Losey, Dylan P. ; Bajcsy, Andrea ; O’Malley, Marcia K. ; Dragan, Anca D.</creatorcontrib><description>When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.</description><identifier>ISSN: 0278-3649</identifier><identifier>EISSN: 1741-3176</identifier><identifier>DOI: 10.1177/02783649211050958</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Human performance ; Robot arms ; Robot learning ; Robots</subject><ispartof>The International journal of robotics research, 2022-01, Vol.41 (1), p.20-44</ispartof><rights>The Author(s) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93</citedby><cites>FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27922,27923,79134</link.rule.ids></links><search><creatorcontrib>Losey, Dylan P.</creatorcontrib><creatorcontrib>Bajcsy, Andrea</creatorcontrib><creatorcontrib>O’Malley, Marcia K.</creatorcontrib><creatorcontrib>Dragan, Anca D.</creatorcontrib><title>Physical interaction as communication: Learning robot objectives online from human corrections</title><title>The International journal of robotics research</title><description>When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.</description><subject>Human performance</subject><subject>Robot arms</subject><subject>Robot learning</subject><subject>Robots</subject><issn>0278-3649</issn><issn>1741-3176</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp1kE1Lw0AQhhdRsFZ_gLcFz6kz-5FsvEnRKhT0oFfDZt20Kc1u3U2E_ns3VvAgngZmnucdeAm5RJghFsU1sELxXJQMESSUUh2RCRYCM45Ffkwm4z0bgVNyFuMGAHgO5YS8Pa_3sTV6S1vX26BN33pHdaTGd93g0mVc3NCl1cG1bkWDr31Pfb2xCf20kXq3bZ2lTfAdXQ-ddkkNwX4HxXNy0uhttBc_c0pe7-9e5g_Z8mnxOL9dZoZL2WfWgq0FclHXuqiZFMhAA6qagWys4sKCaQSTOgepFFMKGOPa4HsikJuST8nVIXcX_MdgY19t_BBcelmxPHWiVClFovBAmeBjDLapdqHtdNhXCNVYY_WnxuTMDk7UK_ub-r_wBdNgcpQ</recordid><startdate>202201</startdate><enddate>202201</enddate><creator>Losey, Dylan P.</creator><creator>Bajcsy, Andrea</creator><creator>O’Malley, Marcia K.</creator><creator>Dragan, Anca D.</creator><general>SAGE Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202201</creationdate><title>Physical interaction as communication: Learning robot objectives online from human corrections</title><author>Losey, Dylan P. ; Bajcsy, Andrea ; O’Malley, Marcia K. ; Dragan, Anca D.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Human performance</topic><topic>Robot arms</topic><topic>Robot learning</topic><topic>Robots</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Losey, Dylan P.</creatorcontrib><creatorcontrib>Bajcsy, Andrea</creatorcontrib><creatorcontrib>O’Malley, Marcia K.</creatorcontrib><creatorcontrib>Dragan, Anca D.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>The International journal of robotics research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Losey, Dylan P.</au><au>Bajcsy, Andrea</au><au>O’Malley, Marcia K.</au><au>Dragan, Anca D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Physical interaction as communication: Learning robot objectives online from human corrections</atitle><jtitle>The International journal of robotics research</jtitle><date>2022-01</date><risdate>2022</risdate><volume>41</volume><issue>1</issue><spage>20</spage><epage>44</epage><pages>20-44</pages><issn>0278-3649</issn><eissn>1741-3176</eissn><abstract>When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.1177/02783649211050958</doi><tpages>25</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0278-3649
ispartof	The International journal of robotics research, 2022-01, Vol.41 (1), p.20-44
issn	0278-3649 1741-3176
language	eng
recordid	cdi_proquest_journals_2621188954
source	SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024: Reading List
subjects	Human performance Robot arms Robot learning Robots
title	Physical interaction as communication: Learning robot objectives online from human corrections
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T11%3A19%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Physical%20interaction%20as%20communication:%20Learning%20robot%20objectives%20online%20from%20human%20corrections&rft.jtitle=The%20International%20journal%20of%20robotics%20research&rft.au=Losey,%20Dylan%20P.&rft.date=2022-01&rft.volume=41&rft.issue=1&rft.spage=20&rft.epage=44&rft.pages=20-44&rft.issn=0278-3649&rft.eissn=1741-3176&rft_id=info:doi/10.1177/02783649211050958&rft_dat=%3Cproquest_cross%3E2621188954%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c355t-ee0eb4134bba7b254120a018b205fe834e0cf425a605882880223ac1d8b213c93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2621188954&rft_id=info:pmid/&rft_sage_id=10.1177_02783649211050958&rfr_iscdi=true