Reinforcement Learning Controllers for Soft Robots using Learned   Environments

Uljad Berdica; Matthew Jackson; Niccol\`o Enrico Veronese; Jakob; Foerster; Perla Maiolino

arXiv:2410.18519·cs.RO·October 28, 2024

Reinforcement Learning Controllers for Soft Robots using Learned Environments

Uljad Berdica, Matthew Jackson, Niccol\`o Enrico Veronese, Jakob, Foerster, Perla Maiolino

PDF

1 Repo

TL;DR

This paper presents a reinforcement learning framework for soft robot control that uses learned environment models and safety-oriented exploration, enabling efficient policy learning without prior robot knowledge.

Contribution

It introduces a novel RL approach with learned synthetic environments and safety exploration protocols for soft robotic manipulators, improving control policy training.

Findings

01

Effective RL control policies learned in simulated environments

02

Safe actuation space exploration enhances real-world applicability

03

Framework enables benchmarking in soft robotics control

Abstract

Soft robotic manipulators offer operational advantage due to their compliant and deformable structures. However, their inherently nonlinear dynamics presents substantial challenges. Traditional analytical methods often depend on simplifying assumptions, while learning-based techniques can be computationally demanding and limit the control policies to existing data. This paper introduces a novel approach to soft robotic control, leveraging state-of-the-art policy gradient methods within parallelizable synthetic environments learned from data. We also propose a safety oriented actuation space exploration protocol via cascaded updates and weighted randomness. Specifically, our recurrent forward dynamics model is learned by generating a training dataset from a physically safe \textit{mean reverting} random walk in actuation space to explore the partially-observed state-space. We demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uljad/SoRoLEX
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.