Advanced Skills by Learning Locomotion and Local Navigation End-to-End
Nikita Rudin, David Hoeller, Marko Bjelonic, Marco Hutter

TL;DR
This paper introduces an end-to-end deep reinforcement learning approach for quadrupedal robot navigation that enables more flexible, efficient, and successful traversal of challenging terrains without relying on traditional path planning and velocity tracking.
Contribution
The work presents a novel end-to-end policy training method that allows robots to learn complex navigation behaviors directly from raw inputs, surpassing traditional modular approaches.
Findings
Robots successfully navigate challenging terrains previously inaccessible.
The learned policies use energy-efficient gaits and improve success rates.
Time-dependent reward shaping is crucial for learning complex behaviors.
Abstract
The common approach for local navigation on challenging environments with legged robots requires path planning, path following and locomotion, which usually requires a locomotion control policy that accurately tracks a commanded velocity. However, by breaking down the navigation problem into these sub-tasks, we limit the robot's capabilities since the individual tasks do not consider the full solution space. In this work, we propose to solve the complete problem by training an end-to-end policy with deep reinforcement learning. Instead of continuously tracking a precomputed path, the robot needs to reach a target position within a provided time. The task's success is only evaluated at the end of an episode, meaning that the policy does not need to reach the target as fast as possible. It is free to select its path and the locomotion gait. Training a policy in this way opens up a larger…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
