Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization
Chieko Sarah Imai, Minghao Zhang, Yuchen Zhang, Marcin Kierebinski,, Ruihan Yang, Yuzhe Qin, Xiaolong Wang

TL;DR
This paper introduces Multi-Modal Delay Randomization (MMDR), a novel training method for reinforcement learning-based vision-guided quadrupedal locomotion that effectively handles real-world latency issues, enabling robust outdoor navigation.
Contribution
The paper proposes MMDR, a new approach to simulate hardware latency during RL training, improving real-world deployment of vision-guided quadrupedal robots in complex environments.
Findings
Robots can navigate complex outdoor terrains smoothly.
Significant performance improvements over baseline methods.
Effective handling of real-world latency in RL policies.
Abstract
Developing robust vision-guided controllers for quadrupedal robots in complex environments, with various obstacles, dynamical surroundings and uneven terrains, is very challenging. While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world. Our key insight is that aside from the discrepancy in the domain gap, in visual appearance between the simulation and the real world, the latency from the control pipeline is also a major cause of difficulty. In this paper, we propose Multi-Modal Delay Randomization (MMDR) to address this issue when training RL agents. Specifically, we simulate the latency of real hardware by using past observations, sampled with randomized periods, for both proprioception and vision. We train the RL policy for end-to-end control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
