Inverse Reinforcement Learning without Reinforcement Learning

Gokul Swamy; Sanjiban Choudhury; J. Andrew Bagnell; Zhiwei Steven Wu

arXiv:2303.14623·cs.LG·January 31, 2024·1 cites

Inverse Reinforcement Learning without Reinforcement Learning

Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel approach to inverse reinforcement learning that leverages expert state distributions to reduce computational complexity, achieving exponential speedups both theoretically and practically in continuous control tasks.

Contribution

It presents the first informed imitation learning reduction utilizing expert state distributions to significantly improve efficiency over traditional IRL methods.

Findings

01

Achieves exponential speedup in theory.

02

Significantly speeds up IRL in continuous control tasks.

03

Reduces reliance on repeated RL problem solving.

Abstract

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational weakness: they require repeatedly solving a hard reinforcement learning (RL) problem as a subroutine. This is counter-intuitive from the viewpoint of reductions: we have reduced the easier problem of imitation learning to repeatedly solving the harder problem of RL. Another thread of work has proved that access to the side-information of the distribution of states where a strong policy spends time can dramatically reduce the sample and computational complexities of solving an RL problem. In this work, we demonstrate for the first time a more informed imitation learning reduction where we utilize the state distribution of the expert to alleviate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gkswamy98/fast_irl
pytorchOfficial

Videos

Inverse Reinforcement Learning without Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Evolutionary Algorithms and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings