UniLegs: Universal Multi-Legged Robot Control through Morphology-Agnostic Policy Distillation
Weijie Xi, Zhanxiang Cao, Chenlin Ming, Jianying Zheng, Guyue Zhou

TL;DR
This paper presents a two-stage policy distillation framework using Transformers to create a universal controller for various legged robot morphologies, achieving high performance and generalization to unseen designs.
Contribution
It introduces a morphology-agnostic policy distillation method with Transformer architecture, enabling a single controller to handle diverse legged robot morphologies.
Findings
Transformer-based policy achieves 94.47% of specialized teacher performance on training morphologies.
The approach maintains 72.64% performance on unseen robot designs.
Transformer architecture outperforms MLP baselines in modeling joint relationships.
Abstract
Developing controllers that generalize across diverse robot morphologies remains a significant challenge in legged locomotion. Traditional approaches either create specialized controllers for each morphology or compromise performance for generality. This paper introduces a two-stage teacher-student framework that bridges this gap through policy distillation. First, we train specialized teacher policies optimized for individual morphologies, capturing the unique optimal control strategies for each robot design. Then, we distill this specialized expertise into a single Transformer-based student policy capable of controlling robots with varying leg configurations. Our experiments across five distinct legged morphologies demonstrate that our approach preserves morphology-specific optimal behaviors, with the Transformer architecture achieving 94.47% of teacher performance on training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Modular Robots and Swarm Intelligence
