MS-PPO: Morphological-Symmetry-Equivariant Policy for Legged Robot Locomotion

Sizhe Wei; Xulin Chen; Fengze Xie; Garrett Ethan Katz; Zhenyu Gan; Lu Gan

arXiv:2512.00727·cs.RO·December 2, 2025

MS-PPO: Morphological-Symmetry-Equivariant Policy for Legged Robot Locomotion

Sizhe Wei, Xulin Chen, Fengze Xie, Garrett Ethan Katz, Zhenyu Gan, Lu Gan

PDF

Open Access

TL;DR

This paper introduces MS-PPO, a novel policy learning framework for legged robots that encodes morphological symmetry and kinematic structure, leading to improved training stability, generalization, and efficiency in diverse locomotion tasks.

Contribution

MS-PPO is the first to integrate morphological symmetry directly into a graph neural network policy for legged robots, eliminating the need for reward shaping or data augmentation.

Findings

01

MS-PPO outperforms state-of-the-art baselines in simulation and hardware.

02

It achieves higher sample efficiency and better symmetry generalization.

03

The approach enhances training stability across various locomotion tasks.

Abstract

Reinforcement learning has recently enabled impressive locomotion capabilities on legged robots; however, most policy architectures remain morphology- and symmetry-agnostic, leading to inefficient training and limited generalization. This work introduces MS-PPO, a morphological-symmetry-equivariant policy learning framework that encodes robot kinematic structure and morphological symmetries directly into the policy network. We construct a morphology-informed graph neural architecture that is provably equivariant with respect to the robot's morphological symmetry group actions, ensuring consistent policy responses under symmetric states while maintaining invariance in value estimation. This design eliminates the need for tedious reward shaping or costly data augmentation, which are typically required to enforce symmetry. We evaluate MS-PPO in simulation on Unitree Go2 and Xiaomi…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Prosthetics and Rehabilitation Robotics