Loading…

Application of Soft Actor-Critic Reinforcement Learning to a Search and Rescue Task for Humanoid Robots

This paper proposes a novel maximum entropy based reinforcement learning for dealing with a robotic search and rescue task in a complex enclosed environment. The search and rescue task is described as a Markov Decision Process, under which an auxiliary reward function at multiple stages is designed...

Full description

Saved in:
Bibliographic Details
Main Authors: Ji, Hongxuan, Yin, Chenkun
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper proposes a novel maximum entropy based reinforcement learning for dealing with a robotic search and rescue task in a complex enclosed environment. The search and rescue task is described as a Markov Decision Process, under which an auxiliary reward function at multiple stages is designed for the robot and its interaction with the specified environment. A variant of the state-of-art reinforcement learning algorithm, goal-based Soft Actor-Critic (SAC), is developed to train a humanoid robot. Simulation results verify the effectiveness of the proposed goal-based SAC algorithm and its advantages comparing with the prototype of SAC algorithm for the same task.
ISSN:2688-0938
DOI:10.1109/CAC57257.2022.10056003