Meta Reinforcement Learning for Optimal Design of Legged Robots

\'Alvaro Belmonte-Baeza; Joonho Lee; Giorgio Valsecchi; Marco Hutter

arXiv:2210.02750·cs.RO·October 7, 2022

Meta Reinforcement Learning for Optimal Design of Legged Robots

\'Alvaro Belmonte-Baeza, Joonho Lee, Giorgio Valsecchi, Marco Hutter

PDF

TL;DR

This paper introduces a meta reinforcement learning framework for optimizing legged robot designs, enabling rapid adaptation and high performance without relying on predefined control rules or motion templates.

Contribution

The authors develop a model-free meta reinforcement learning approach for end-to-end design and control optimization of quadrupedal robots, surpassing traditional methods constrained by predefined motions.

Findings

01

Meta policy adapts quickly to different robot designs.

02

Achieves near-optimal control performance after adaptation.

03

Outperforms model-based baseline in design optimization.

Abstract

The process of robot design is a complex task and the majority of design decisions are still based on human intuition or tedious manual tuning. A more informed way of facing this task is computational design methods where design parameters are concurrently optimized with corresponding controllers. Existing approaches, however, are strongly influenced by predefined control rules or motion templates and cannot provide end-to-end solutions. In this paper, we present a design optimization framework using model-free meta reinforcement learning, and its application to the optimizing kinematics and actuator parameters of quadrupedal robots. We use meta reinforcement learning to train a locomotion policy that can quickly adapt to different designs. This policy is used to evaluate each design instance during the design optimization. We demonstrate that the policy can control robots of different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.