Parkour in the Wild: Learning a General and Extensible Agile Locomotion Policy Using Multi-expert Distillation and RL Fine-tuning
Nikita Rudin, Junzhe He, Joshua Aurand, Marco Hutter

TL;DR
This paper presents a new framework combining multi-expert distillation and reinforcement learning fine-tuning to develop a versatile, robust legged robot locomotion policy capable of navigating diverse unstructured terrains.
Contribution
It introduces a novel multi-stage training process that integrates terrain-specific policies into a unified, adaptable locomotion controller using distillation and RL fine-tuning.
Findings
Significant performance improvements over existing methods.
Successful deployment on real-world robots demonstrating agility.
Robust navigation across diverse unstructured terrains.
Abstract
Legged robots are well-suited for navigating terrains inaccessible to wheeled robots, making them ideal for applications in search and rescue or space exploration. However, current control methods often struggle to generalize across diverse, unstructured environments. This paper introduces a novel framework for agile locomotion of legged robots by combining multi-expert distillation with reinforcement learning (RL) fine-tuning to achieve robust generalization. Initially, terrain-specific expert policies are trained to develop specialized locomotion skills. These policies are then distilled into a unified foundation policy via the DAgger algorithm. The distilled policy is subsequently fine-tuned using RL on a broader terrain set, including real-world 3D scans. The framework allows further adaptation to new terrains through repeated fine-tuning. The proposed policy leverages depth images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics
