Receding Horizon Inverse Reinforcement Learning

Yiqing Xu; Wei Gao; David Hsu

arXiv:2206.04477·cs.LG·October 18, 2022

Receding Horizon Inverse Reinforcement Learning

Yiqing Xu, Wei Gao, David Hsu

PDF

Open Access 1 Video

TL;DR

Receding Horizon IRL (RHIRL) is a scalable and robust algorithm for inferring cost functions in high-dimensional, noisy, continuous systems by locally matching trajectories and stitching solutions, outperforming previous methods.

Contribution

Introduces RHIRL, a novel IRL algorithm that addresses scalability and robustness by local trajectory matching and cost function disentanglement in high-dimensional systems.

Findings

01

RHIRL outperforms existing IRL algorithms on benchmark tasks.

02

The cumulative error of RHIRL grows linearly with task duration.

03

RHIRL effectively handles noisy and imperfect expert demonstrations.

Abstract

Inverse reinforcement learning (IRL) seeks to infer a cost function that explains the underlying goals and preferences of expert demonstrations. This paper presents receding horizon inverse reinforcement learning (RHIRL), a new IRL algorithm for high-dimensional, noisy, continuous systems with black-box dynamic models. RHIRL addresses two key challenges of IRL: scalability and robustness. To handle high-dimensional continuous systems, RHIRL matches the induced optimal trajectories with expert demonstrations locally in a receding horizon manner and 'stitches' together the local solutions to learn the cost; it thereby avoids the 'curse of dimensionality'. This contrasts sharply with earlier algorithms that match with expert demonstrations globally over the entire high-dimensional state space. To be robust against imperfect expert demonstrations and control noise, RHIRL learns a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Receding Horizon Inverse Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Evolutionary Algorithms and Applications