Loading…

Shared autonomy via hindsight optimization for teleoperation and teaming

In shared autonomy, a user and autonomous system work together to achieve shared goals. To collaborate effectively, the autonomous system must know the user’s goal. As such, most prior works follow a predict-then-act model, first predicting the user’s goal with high confidence, then assisting given...

Full description

Saved in:

Bibliographic Details
Published in:	The International journal of robotics research 2018-06, Vol.37 (7), p.717-742
Main Authors:	Javdani, Shervin, Admoni, Henny, Pellegrinelli, Stefania, Srinivasa, Siddhartha S., Bagnell, J. Andrew
Format:	Article
Language:	English
Subjects:	Autonomy Markov analysis Markov chains Optimization
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	In shared autonomy, a user and autonomous system work together to achieve shared goals. To collaborate effectively, the autonomous system must know the user’s goal. As such, most prior works follow a predict-then-act model, first predicting the user’s goal with high confidence, then assisting given that goal. Unfortunately, confidently predicting the user’s goal may not be possible until they have nearly achieved it, causing predict-then-act methods to provide little assistance. However, the system can often provide useful assistance even when confidence for any single goal is low (e.g. move towards multiple goals). In this work, we formalize this insight by modeling shared autonomy as a partially observable Markov decision process (POMDP), providing assistance that minimizes the expected cost-to-go with an unknown goal. As solving this POMDP optimally is intractable, we use hindsight optimization to approximate. We apply our framework to both shared-control teleoperation and human–robot teaming. Compared with predict-then-act methods, our method achieves goals faster, requires less user input, decreases user idling time, and results in fewer user–robot collisions.
ISSN:	0278-3649 1741-3176
DOI:	10.1177/0278364918776060