REAL: Robust Extreme Agility via Spatio-Temporal Policy Learning and Physics-Guided Filtering
Jialong Liu, Dehan Shen, Yanbo Wen, Zeyu Jiang, Changhao Chen

TL;DR
This paper introduces REAL, a robust learning framework for quadruped robots that combines multi-modal perception, physics-guided filtering, and memory to achieve extreme agility under sensory noise and perceptual degradation.
Contribution
REAL integrates vision, proprioception, and physics-based filtering into a unified system, improving robustness and reliability in dynamic, obstacle-rich environments.
Findings
Successfully traverses extreme obstacles with visual noise
Operates in real-time with 13.1 ms inference time
Maintains high agility despite sensory corruption
Abstract
Extreme legged parkour demands rapid terrain assessment and precise foot placement under highly dynamic conditions. While recent learning-based systems achieve impressive agility, they remain fundamentally fragile to perceptual degradation, where even brief visual noise or latency can cause catastrophic failure. To overcome this, we propose Robust Extreme Agility Learning (REAL), an end-to-end framework for reliable parkour under sensory corruption. Instead of relying on perfectly clean perception, REAL tightly couples vision, proprioceptive history, and temporal memory. We distill a cross-modal teacher policy into a deployable student equipped with a FiLM-modulated Mamba backbone to actively filter visual noise and build short-term terrain memory actively. Furthermore, a physics-guided Bayesian state estimator enforces rigid-body consistency during high-impact maneuvers. Validated on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Lower Extremity Biomechanics and Pathologies · Reinforcement Learning in Robotics
