Back-stepping Experience Replay with Application to Model-free   Reinforcement Learning for a Soft Snake Robot

Xinda Qi; Dong Chen; Zhaojian Li; Xiaobo Tan

arXiv:2401.11372·cs.RO·September 25, 2024·1 cites

Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot

Xinda Qi, Dong Chen, Zhaojian Li, Xiaobo Tan

PDF

Open Access

TL;DR

This paper introduces Back-stepping Experience Replay (BER), a novel method to improve learning efficiency in model-free reinforcement learning for soft robots by constructing reversed trajectories, demonstrated on a soft snake robot with significant speed improvements.

Contribution

The paper presents BER, a new off-policy RL technique compatible with arbitrary algorithms, tailored for systems with approximate reversibility, applied successfully to a soft snake robot.

Findings

01

Achieved 100% success rate in target reaching tasks.

02

Robot's average speed was 48% faster than baseline methods.

03

BER effectively enhances learning efficiency in soft robot locomotion.

Abstract

In this paper, we propose a novel technique, Back-stepping Experience Replay (BER), that is compatible with arbitrary off-policy reinforcement learning (RL) algorithms. BER aims to enhance learning efficiency in systems with approximate reversibility, reducing the need for complex reward shaping. The method constructs reversed trajectories using back-stepping transitions to reach random or fixed targets. Interpretable as a bi-directional approach, BER addresses inaccuracies in back-stepping transitions through a distillation of the replay experience during learning. Given the intricate nature of soft robots and their complex interactions with environments, we present an application of BER in a model-free RL approach for the locomotion and navigation of a soft snake robot, which is capable of serpentine motion enabled by anisotropic friction between the body and ground. In addition, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoft Robotics and Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning

MethodsExperience Replay · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings