Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching
Jeremy Dao, Alan Fern

TL;DR
This paper proposes a practical proprioceptive distribution matching method for adapting simulators to improve the transfer of legged locomotion policies from simulation to real hardware, reducing performance gaps.
Contribution
It introduces a novel proprioceptive distribution matching approach that eliminates the need for external sensing and precise time alignment in simulator adaptation for legged robots.
Findings
Matching parameter recovery and policy performance on quadruped simulations.
Substantial drift reduction achieved with less than five minutes of hardware data.
Effective adaptation for challenging two-legged walking behaviors.
Abstract
Simulation trained legged locomotion policies often exhibit performance loss on hardware due to dynamics discrepancies between the simulator and the real world, highlighting the need for approaches that adapt the simulator itself to better match hardware behavior. Prior work typically quantify these discrepancies through precise, time-aligned matching of joint and base trajectories. This process requires motion capture, privileged sensing, and carefully controlled initial conditions. We introduce a practical alternative based on proprioceptive distribution matching, which compares hardware and simulation rollouts as distributions of joint observations and actions, eliminating the need for time alignment or external sensing. Using this metric as a black-box objective, we explore adapting simulator dynamics through parameter identification, action-delta models, and residual actuator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
