Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Nikita Rudin; David Hoeller; Marko Bjelonic; Marco Hutter

arXiv:2209.12827·cs.RO·September 27, 2022

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Nikita Rudin, David Hoeller, Marko Bjelonic, Marco Hutter

PDF

TL;DR

This paper introduces an end-to-end deep reinforcement learning approach for quadrupedal robot navigation that enables more flexible, efficient, and successful traversal of challenging terrains without relying on traditional path planning and velocity tracking.

Contribution

The work presents a novel end-to-end policy training method that allows robots to learn complex navigation behaviors directly from raw inputs, surpassing traditional modular approaches.

Findings

01

Robots successfully navigate challenging terrains previously inaccessible.

02

The learned policies use energy-efficient gaits and improve success rates.

03

Time-dependent reward shaping is crucial for learning complex behaviors.

Abstract

The common approach for local navigation on challenging environments with legged robots requires path planning, path following and locomotion, which usually requires a locomotion control policy that accurately tracks a commanded velocity. However, by breaking down the navigation problem into these sub-tasks, we limit the robot's capabilities since the individual tasks do not consider the full solution space. In this work, we propose to solve the complete problem by training an end-to-end policy with deep reinforcement learning. Instead of continuously tracking a precomputed path, the robot needs to reach a target position within a provided time. The task's success is only evaluated at the end of an episode, meaning that the policy does not need to reach the target as fast as possible. It is free to select its path and the locomotion gait. Training a policy in this way opens up a larger…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.