Hierarchies of Planning and Reinforcement Learning for Robot Navigation

Jan W\"ohlke; Felix Schmitt; Herke van Hoof

arXiv:2109.11178·cs.RO·November 8, 2021

Hierarchies of Planning and Reinforcement Learning for Robot Navigation

Jan W\"ohlke, Felix Schmitt, Herke van Hoof

PDF

TL;DR

This paper introduces a hierarchical reinforcement learning framework with a trainable planning policy for robot navigation, improving adaptability and performance over traditional methods, especially in complex dynamic environments.

Contribution

It proposes a novel VI-RL planning policy that learns environment dynamics, enhancing hierarchical RL for navigation tasks with complex robot behaviors.

Findings

01

VI-RL outperforms vanilla RL in simulated navigation tasks.

02

VI-RL matches hierarchical RL on single layouts and generalizes better across multiple layouts.

03

Shows significant improvements in non-holonomic parking scenarios.

Abstract

Solving robotic navigation tasks via reinforcement learning (RL) is challenging due to their sparse reward and long decision horizon nature. However, in many navigation tasks, high-level (HL) task representations, like a rough floor plan, are available. Previous work has demonstrated efficient learning by hierarchal approaches consisting of path planning in the HL representation and using sub-goals derived from the plan to guide the RL policy in the source task. However, these approaches usually neglect the complex dynamics and sub-optimal sub-goal-reaching capabilities of the robot during planning. This work overcomes these limitations by proposing a novel hierarchical framework that utilizes a trainable planning policy for the HL representation. Thereby robot capabilities and environment conditions can be learned utilizing collected rollout data. We specifically introduce a planning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.