Parkour in the Wild: Learning a General and Extensible Agile Locomotion Policy Using Multi-expert Distillation and RL Fine-tuning

Nikita Rudin; Junzhe He; Joshua Aurand; Marco Hutter

arXiv:2505.11164·cs.RO·May 19, 2025

Parkour in the Wild: Learning a General and Extensible Agile Locomotion Policy Using Multi-expert Distillation and RL Fine-tuning

Nikita Rudin, Junzhe He, Joshua Aurand, Marco Hutter

PDF

Open Access

TL;DR

This paper presents a new framework combining multi-expert distillation and reinforcement learning fine-tuning to develop a versatile, robust legged robot locomotion policy capable of navigating diverse unstructured terrains.

Contribution

It introduces a novel multi-stage training process that integrates terrain-specific policies into a unified, adaptable locomotion controller using distillation and RL fine-tuning.

Findings

01

Significant performance improvements over existing methods.

02

Successful deployment on real-world robots demonstrating agility.

03

Robust navigation across diverse unstructured terrains.

Abstract

Legged robots are well-suited for navigating terrains inaccessible to wheeled robots, making them ideal for applications in search and rescue or space exploration. However, current control methods often struggle to generalize across diverse, unstructured environments. This paper introduces a novel framework for agile locomotion of legged robots by combining multi-expert distillation with reinforcement learning (RL) fine-tuning to achieve robust generalization. Initially, terrain-specific expert policies are trained to develop specialized locomotion skills. These policies are then distilled into a unified foundation policy via the DAgger algorithm. The distilled policy is subsequently fine-tuned using RL on a broader terrain set, including real-world 3D scans. The framework allows further adaptation to new terrains through repeated fine-tuning. The proposed policy leverages depth images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics