Efficient Reinforcement Learning for Jumping Monopods
Riccardo Bussola, Michele Focchi, Andrea Del Prete, Daniele, Fontanelli, Luigi Palopoli

TL;DR
This paper presents an RL-based control method for a jumping monopod that incorporates physical knowledge to reduce learning time and improve adaptability, outperforming traditional optimization and end-to-end RL methods.
Contribution
The paper introduces a guided RL approach that injects physical knowledge into the learning process for complex monopod jumping tasks, enhancing efficiency and robustness.
Findings
Significantly reduces learning time compared to standard RL.
Improves the monopod’s ability to adapt to uneven terrain.
Outperforms optimization-based and end-to-end RL methods.
Abstract
In this work, we consider the complex control problem of making a monopod reach a target with a jump. The monopod can jump in any direction and the terrain underneath its foot can be uneven. This is a template of a much larger class of problems, which are extremely challenging and computationally expensive to solve using standard optimisation-based techniques. Reinforcement Learning (RL) could be an interesting alternative, but the application of an end-to-end approach in which the controller must learn everything from scratch, is impractical. The solution advocated in this paper is to guide the learning process within an RL framework by injecting physical knowledge. This expedient brings to widespread benefits, such as a drastic reduction of the learning time, and the ability to learn and compensate for possible errors in the low-level controller executing the motion. We demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Virology and Viral Diseases · Protein Degradation and Inhibitors
