TL;DR
This paper presents a novel reset-free trial-and-error learning algorithm enabling damaged robots to autonomously recover their locomotion abilities in complex environments without human intervention.
Contribution
The paper introduces RTE, a new RL algorithm that pre-generates behaviors and allows complex robots to recover from damage without resets or extensive training.
Findings
Robots successfully recover locomotion after damage in simulation and real-world tests.
RTE outperforms traditional RL methods in recovery speed and success rate.
Robots operate autonomously without human intervention during recovery.
Abstract
The high probability of hardware failures prevents many advanced robots (e.g., legged robots) from being confidently deployed in real-world situations (e.g., post-disaster rescue). Instead of attempting to diagnose the failures, robots could adapt by trial-and-error in order to be able to complete their tasks. In this situation, damage recovery can be seen as a Reinforcement Learning (RL) problem. However, the best RL algorithms for robotics require the robot and the environment to be reset to an initial state after each episode, that is, the robot is not learning autonomously. In addition, most of the RL methods for robotics do not scale well with complex robots (e.g., walking robots) and either cannot be used at all or take too long to converge to a solution (e.g., hours of learning). In this paper, we introduce a novel learning algorithm called "Reset-free Trial-and-Error" (RTE) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
