Inverse Reinforcement Learning in a Continuous State Space with Formal   Guarantees

Gregory Dexter; Kevin Bello; and Jean Honorio

arXiv:2102.07937·cs.LG·September 12, 2022

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

Gregory Dexter, Kevin Bello, and Jean Honorio

PDF

Open Access 1 Video

TL;DR

This paper introduces a new inverse reinforcement learning algorithm for continuous state spaces with unknown dynamics, providing theoretical guarantees and validating its effectiveness through synthetic experiments.

Contribution

The work presents a novel IRL algorithm for continuous states using orthonormal basis functions, with proven correctness and complexity guarantees.

Findings

01

Algorithm achieves formal correctness guarantees.

02

Sample and time complexity are rigorously analyzed.

03

Synthetic experiments support theoretical results.

Abstract

Inverse Reinforcement Learning (IRL) is the problem of finding a reward function which describes observed/known expert behavior. The IRL setting is remarkably useful for automated control, in situations where the reward function is difficult to specify manually or as a means to extract agent preference. In this work, we provide a new IRL algorithm for the continuous state space setting with unknown transition dynamics by modeling the system using a basis of orthonormal functions. Moreover, we provide a proof of correctness and formal guarantees on the sample and time complexity of our algorithm. Finally, we present synthetic experiments to corroborate our theoretical guarantees.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Receptor Mechanisms and Signaling