PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots
Zhiyuan Xiao, Xinyu Zhang, Xiang Zhou, and Qingrui Zhang

TL;DR
This paper introduces a multi-encoder privileged learning framework with a residual policy network to improve the robustness and reliability of quadruped robot locomotion under external disturbances, outperforming existing methods.
Contribution
It proposes a novel multi-encoder privileged learning approach combined with a residual policy network to enhance disturbance robustness in quadrupedal locomotion.
Findings
Enhanced robustness and stability in diverse terrains.
Superior performance over state-of-the-art algorithms.
Validated on a real quadruped robot with extensive experiments.
Abstract
Numerous locomotion controllers have been designed based on Reinforcement Learning (RL) to facilitate blind quadrupedal locomotion traversing challenging terrains. Nevertheless, locomotion control is still a challenging task for quadruped robots traversing diverse terrains amidst unforeseen disturbances. Recently, privileged learning has been employed to learn reliable and robust quadrupedal locomotion over various terrains based on a teacher-student architecture. However, its one-encoder structure is not adequate in addressing external force perturbations. The student policy would experience inevitable performance degradation due to the feature embedding discrepancy between the feature encoder of the teacher policy and the one of the student policy. Hence, this paper presents a privileged learning framework with multiple feature encoders and a residual policy network for robust and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Modular Robots and Swarm Intelligence · Robotic Path Planning Algorithms
