Loading…
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms
Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challe...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 1280 |
container_issue | |
container_start_page | 1274 |
container_title | |
container_volume | |
creator | Ghadirzadeh, Ali Chen, Xi Poklukar, Petra Finn, Chelsea Bjorkman, Marten Kragic, Danica |
description | Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting with a low-dimensional latent variable. We experimentally evaluate our framework on a simulated reaching and a real-robot picking task using 400 simulated robots generated by varying the physical parameters of an existing set of robotic platforms. Our results show that the proposed method can successfully adapt a trained policy to different robotic platforms with novel physical parameters and the superiority of our meta-learning algorithm compared to state-of-the-art methods for the introduced few-shot policy adaptation problem. |
doi_str_mv | 10.1109/IROS51168.2021.9636628 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9636628</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9636628</ieee_id><sourcerecordid>9636628</sourcerecordid><originalsourceid>FETCH-LOGICAL-i241t-445cac528b8f09ec4593a1c37a770e9b090d66b5deebebdd60ddb9347aceadf83</originalsourceid><addsrcrecordid>eNotj81KAzEYAKMgWGufQJC8wNZk87PJsRarhdWWVs_lS_KtRtpN2QSkb69gT3OagSHknrMp58w-LDerreJcm2nNaj61Wmhdmwtyw7VWkjdcqksyqrkSFTNaX5NJzt-MMc4aa6wekbdHOGGO0NNXLFC1CEMf-0_apYEu8KfafqVC12kf_YnOAhwLlJh6OvNDyplukkslerreQ_kzDvmWXHWwzzg5c0w-Fk_v85eqXT0v57O2irXkpZJSefCqNs50zKKXygrgXjTQNAytY5YFrZ0KiA5dCJqF4KyQDXiE0BkxJnf_3YiIu-MQDzCcdud98QvUXVCm</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms</title><source>IEEE Xplore All Conference Series</source><creator>Ghadirzadeh, Ali ; Chen, Xi ; Poklukar, Petra ; Finn, Chelsea ; Bjorkman, Marten ; Kragic, Danica</creator><creatorcontrib>Ghadirzadeh, Ali ; Chen, Xi ; Poklukar, Petra ; Finn, Chelsea ; Bjorkman, Marten ; Kragic, Danica</creatorcontrib><description>Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting with a low-dimensional latent variable. We experimentally evaluate our framework on a simulated reaching and a real-robot picking task using 400 simulated robots generated by varying the physical parameters of an existing set of robotic platforms. Our results show that the proposed method can successfully adapt a trained policy to different robotic platforms with novel physical parameters and the superiority of our meta-learning algorithm compared to state-of-the-art methods for the introduced few-shot policy adaptation problem.</description><identifier>EISSN: 2153-0866</identifier><identifier>EISBN: 1665417145</identifier><identifier>EISBN: 9781665417143</identifier><identifier>DOI: 10.1109/IROS51168.2021.9636628</identifier><language>eng</language><publisher>IEEE</publisher><subject>Adaptation models ; Hardware ; Probabilistic logic ; Reinforcement learning ; Robot motion ; Training data ; Uncertainty</subject><ispartof>2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, p.1274-1280</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9636628$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,23909,23910,25118,27902,54530,54907</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9636628$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ghadirzadeh, Ali</creatorcontrib><creatorcontrib>Chen, Xi</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Finn, Chelsea</creatorcontrib><creatorcontrib>Bjorkman, Marten</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><title>Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms</title><title>2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)</title><addtitle>IROS</addtitle><description>Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting with a low-dimensional latent variable. We experimentally evaluate our framework on a simulated reaching and a real-robot picking task using 400 simulated robots generated by varying the physical parameters of an existing set of robotic platforms. Our results show that the proposed method can successfully adapt a trained policy to different robotic platforms with novel physical parameters and the superiority of our meta-learning algorithm compared to state-of-the-art methods for the introduced few-shot policy adaptation problem.</description><subject>Adaptation models</subject><subject>Hardware</subject><subject>Probabilistic logic</subject><subject>Reinforcement learning</subject><subject>Robot motion</subject><subject>Training data</subject><subject>Uncertainty</subject><issn>2153-0866</issn><isbn>1665417145</isbn><isbn>9781665417143</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj81KAzEYAKMgWGufQJC8wNZk87PJsRarhdWWVs_lS_KtRtpN2QSkb69gT3OagSHknrMp58w-LDerreJcm2nNaj61Wmhdmwtyw7VWkjdcqksyqrkSFTNaX5NJzt-MMc4aa6wekbdHOGGO0NNXLFC1CEMf-0_apYEu8KfafqVC12kf_YnOAhwLlJh6OvNDyplukkslerreQ_kzDvmWXHWwzzg5c0w-Fk_v85eqXT0v57O2irXkpZJSefCqNs50zKKXygrgXjTQNAytY5YFrZ0KiA5dCJqF4KyQDXiE0BkxJnf_3YiIu-MQDzCcdud98QvUXVCm</recordid><startdate>20210927</startdate><enddate>20210927</enddate><creator>Ghadirzadeh, Ali</creator><creator>Chen, Xi</creator><creator>Poklukar, Petra</creator><creator>Finn, Chelsea</creator><creator>Bjorkman, Marten</creator><creator>Kragic, Danica</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20210927</creationdate><title>Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms</title><author>Ghadirzadeh, Ali ; Chen, Xi ; Poklukar, Petra ; Finn, Chelsea ; Bjorkman, Marten ; Kragic, Danica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i241t-445cac528b8f09ec4593a1c37a770e9b090d66b5deebebdd60ddb9347aceadf83</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Adaptation models</topic><topic>Hardware</topic><topic>Probabilistic logic</topic><topic>Reinforcement learning</topic><topic>Robot motion</topic><topic>Training data</topic><topic>Uncertainty</topic><toplevel>online_resources</toplevel><creatorcontrib>Ghadirzadeh, Ali</creatorcontrib><creatorcontrib>Chen, Xi</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Finn, Chelsea</creatorcontrib><creatorcontrib>Bjorkman, Marten</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghadirzadeh, Ali</au><au>Chen, Xi</au><au>Poklukar, Petra</au><au>Finn, Chelsea</au><au>Bjorkman, Marten</au><au>Kragic, Danica</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms</atitle><btitle>2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)</btitle><stitle>IROS</stitle><date>2021-09-27</date><risdate>2021</risdate><spage>1274</spage><epage>1280</epage><pages>1274-1280</pages><eissn>2153-0866</eissn><eisbn>1665417145</eisbn><eisbn>9781665417143</eisbn><abstract>Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting with a low-dimensional latent variable. We experimentally evaluate our framework on a simulated reaching and a real-robot picking task using 400 simulated robots generated by varying the physical parameters of an existing set of robotic platforms. Our results show that the proposed method can successfully adapt a trained policy to different robotic platforms with novel physical parameters and the superiority of our meta-learning algorithm compared to state-of-the-art methods for the introduced few-shot policy adaptation problem.</abstract><pub>IEEE</pub><doi>10.1109/IROS51168.2021.9636628</doi><tpages>7</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | EISSN: 2153-0866 |
ispartof | 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, p.1274-1280 |
issn | 2153-0866 |
language | eng |
recordid | cdi_ieee_primary_9636628 |
source | IEEE Xplore All Conference Series |
subjects | Adaptation models Hardware Probabilistic logic Reinforcement learning Robot motion Training data Uncertainty |
title | Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T22%3A59%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Bayesian%20Meta-Learning%20for%20Few-Shot%20Policy%20Adaptation%20Across%20Robotic%20Platforms&rft.btitle=2021%20IEEE/RSJ%20International%20Conference%20on%20Intelligent%20Robots%20and%20Systems%20(IROS)&rft.au=Ghadirzadeh,%20Ali&rft.date=2021-09-27&rft.spage=1274&rft.epage=1280&rft.pages=1274-1280&rft.eissn=2153-0866&rft_id=info:doi/10.1109/IROS51168.2021.9636628&rft.eisbn=1665417145&rft.eisbn_list=9781665417143&rft_dat=%3Cieee_CHZPO%3E9636628%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i241t-445cac528b8f09ec4593a1c37a770e9b090d66b5deebebdd60ddb9347aceadf83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9636628&rfr_iscdi=true |