Imitation with Neural Density Models

Kuno Kim; Akshat Jindal; Yang Song; Jiaming Song; Yanan Sui; Stefano; Ermon

arXiv:2010.09808·cs.LG·October 21, 2020

Imitation with Neural Density Models

Kuno Kim, Akshat Jindal, Yang Song, Jiaming Song, Yanan Sui, Stefano, Ermon

PDF

Open Access 1 Video

TL;DR

This paper introduces Neural Density Imitation, a novel imitation learning framework that estimates expert behavior density and uses it as a reward in reinforcement learning, achieving high efficiency on control benchmarks.

Contribution

It presents a new density-based imitation learning method that is non-adversarial, model-free, and provably bounds divergence between expert and imitator.

Findings

01

Achieves state-of-the-art demonstration efficiency

02

Provides a practical algorithm for density-based imitation learning

03

Proven theoretical bounds on divergence minimization

Abstract

We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward. Our approach maximizes a non-adversarial model-free RL objective that provably lower bounds reverse Kullback-Leibler divergence between occupancy measures of the expert and imitator. We present a practical IL algorithm, Neural Density Imitation (NDI), which obtains state-of-the-art demonstration efficiency on benchmark control tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Imitation with Neural Density Models· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Model Reduction and Neural Networks · Reinforcement Learning in Robotics