Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning
Kaixi Bao, Chenhao Li, Yarden As, Andreas Krause, Marco Hutter

TL;DR
This paper introduces SGMA, a novel framework that uses symmetry-based experience augmentation and memory extension to improve data efficiency and robustness in reinforcement learning for legged robot locomotion.
Contribution
The paper presents a new symmetry-guided memory augmentation method that enhances RL training efficiency by leveraging robot symmetries and extending transformations to memory states.
Findings
Achieves efficient policy training in simulation and real robots.
Maintains robust performance across diverse locomotion tasks.
Reduces the need for extensive environment interactions.
Abstract
Training reinforcement learning (RL) policies for legged locomotion often requires extensive environment interactions, which are costly and time-consuming. We propose Symmetry-Guided Memory Augmentation (SGMA), a framework that improves training efficiency by combining structured experience augmentation with memory-based context inference. Our method leverages robot and task symmetries to generate additional, physically consistent training experiences without requiring extra interactions. To avoid the pitfalls of naive augmentation, we extend these transformations to the policy's memory states, enabling the agent to retain task-relevant context and adapt its behavior accordingly. We evaluate the approach on quadruped and humanoid robots in simulation, as well as on a real quadruped platform. Across diverse locomotion tasks involving joint failures and payload variations, our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · EEG and Brain-Computer Interfaces · Muscle activation and electromyography studies
MethodsSparse Evolutionary Training
