Meta Reinforcement Learning for Optimal Design of Legged Robots
\'Alvaro Belmonte-Baeza, Joonho Lee, Giorgio Valsecchi, Marco Hutter

TL;DR
This paper introduces a meta reinforcement learning framework for optimizing legged robot designs, enabling rapid adaptation and high performance without relying on predefined control rules or motion templates.
Contribution
The authors develop a model-free meta reinforcement learning approach for end-to-end design and control optimization of quadrupedal robots, surpassing traditional methods constrained by predefined motions.
Findings
Meta policy adapts quickly to different robot designs.
Achieves near-optimal control performance after adaptation.
Outperforms model-based baseline in design optimization.
Abstract
The process of robot design is a complex task and the majority of design decisions are still based on human intuition or tedious manual tuning. A more informed way of facing this task is computational design methods where design parameters are concurrently optimized with corresponding controllers. Existing approaches, however, are strongly influenced by predefined control rules or motion templates and cannot provide end-to-end solutions. In this paper, we present a design optimization framework using model-free meta reinforcement learning, and its application to the optimizing kinematics and actuator parameters of quadrupedal robots. We use meta reinforcement learning to train a locomotion policy that can quickly adapt to different designs. This policy is used to evaluate each design instance during the design optimization. We demonstrate that the policy can control robots of different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
