From proprioception to long-horizon planning in novel environments: A   hierarchical RL model

Nishad Gothoskar; Miguel L\'azaro-Gredilla; Dileep George

arXiv:2006.06620·cs.RO·June 12, 2020

From proprioception to long-horizon planning in novel environments: A hierarchical RL model

Nishad Gothoskar, Miguel L\'azaro-Gredilla, Dileep George

PDF

Open Access

TL;DR

This paper presents a three-level hierarchical reinforcement learning model that integrates proprioception, mid-level dynamics, and goal planning, significantly improving sample efficiency in navigation tasks.

Contribution

The work introduces a novel hierarchical RL architecture combining model-free, model-predictive, and graph-based planning for complex environments.

Findings

01

Significant sample-efficiency improvements in Mujoco Ant navigation tasks

02

Effective long-horizon planning in maze environments

03

Hierarchical structure enables flexible reasoning at multiple abstraction levels

Abstract

For an intelligent agent to flexibly and efficiently operate in complex environments, they must be able to reason at multiple levels of temporal, spatial, and conceptual abstraction. At the lower levels, the agent must interpret their proprioceptive inputs and control their muscles, and at the higher levels, the agent must select goals and plan how they will achieve those goals. It is clear that each of these types of reasoning is amenable to different types of representations, algorithms, and inputs. In this work, we introduce a simple, three-level hierarchical architecture that reflects these distinctions. The low-level controller operates on the continuous proprioceptive inputs, using model-free learning to acquire useful behaviors. These in turn induce a set of mid-level dynamics, which are learned by the mid-level controller and used for model-predictive control, to select a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Multimodal Machine Learning Applications