Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay   Randomization

Chieko Sarah Imai; Minghao Zhang; Yuchen Zhang; Marcin Kierebinski,; Ruihan Yang; Yuzhe Qin; Xiaolong Wang

arXiv:2109.14549·cs.RO·July 26, 2022

Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization

Chieko Sarah Imai, Minghao Zhang, Yuchen Zhang, Marcin Kierebinski,, Ruihan Yang, Yuzhe Qin, Xiaolong Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Multi-Modal Delay Randomization (MMDR), a novel training method for reinforcement learning-based vision-guided quadrupedal locomotion that effectively handles real-world latency issues, enabling robust outdoor navigation.

Contribution

The paper proposes MMDR, a new approach to simulate hardware latency during RL training, improving real-world deployment of vision-guided quadrupedal robots in complex environments.

Findings

01

Robots can navigate complex outdoor terrains smoothly.

02

Significant performance improvements over baseline methods.

03

Effective handling of real-world latency in RL policies.

Abstract

Developing robust vision-guided controllers for quadrupedal robots in complex environments, with various obstacles, dynamical surroundings and uneven terrains, is very challenging. While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world. Our key insight is that aside from the discrepancy in the domain gap, in visual appearance between the simulation and the real world, the latency from the control pipeline is also a major cause of difficulty. In this paper, we propose Multi-Modal Delay Randomization (MMDR) to address this issue when training RL agents. Specifically, we simulate the latency of real hardware by using past observations, sampled with randomized periods, for both proprioception and vision. We train the RL policy for end-to-end control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mehooz/vision4leg
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms