Hierarchical Neural Dynamic Policies

Shikhar Bahl; Abhinav Gupta; Deepak Pathak

arXiv:2107.05627·cs.LG·July 13, 2021

Hierarchical Neural Dynamic Policies

Shikhar Bahl, Abhinav Gupta, Deepak Pathak

PDF

Open Access

TL;DR

This paper introduces Hierarchical Neural Dynamical Policies (H-NDPs), a hierarchical framework that learns local dynamical policies and distills them into a global policy from high-dimensional images, improving generalization and safety in dynamic tasks.

Contribution

The paper proposes a hierarchical deep policy learning framework that embeds dynamical system structure, enabling better generalization to unseen configurations from image inputs.

Findings

01

H-NDPs achieve state-of-the-art results in real-world and simulation dynamic tasks.

02

H-NDPs provide smooth trajectories, enhancing safety in real-world applications.

03

The approach integrates seamlessly with imitation and reinforcement learning methods.

Abstract

We tackle the problem of generalization to unseen configurations for dynamic tasks in the real world while learning from high-dimensional image input. The family of nonlinear dynamical system-based methods have successfully demonstrated dynamic robot behaviors but have difficulty in generalizing to unseen configurations as well as learning from image inputs. Recent works approach this issue by using deep network policies and reparameterize actions to embed the structure of dynamical systems but still struggle in domains with diverse configurations of image goals, and hence, find it difficult to generalize. In this paper, we address this dichotomy by leveraging embedding the structure of dynamical systems in a hierarchical deep policy learning framework, called Hierarchical Neural Dynamical Policies (H-NDPs). Instead of fitting deep dynamical systems to diverse data directly, H-NDPs form…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Robotic Mechanisms and Dynamics