On the Effective Horizon of Inverse Reinforcement Learning

Yiqing Xu; Finale Doshi-Velez; David Hsu

arXiv:2307.06541·cs.LG·February 21, 2025

On the Effective Horizon of Inverse Reinforcement Learning

Yiqing Xu, Finale Doshi-Velez, David Hsu

PDF

Open Access

TL;DR

This paper analyzes how the effective time horizon in inverse reinforcement learning influences reward estimation accuracy and overfitting, proposing a joint reward and horizon learning approach validated by experiments.

Contribution

It provides a formal analysis of the effective horizon phenomenon in IRL and introduces a joint reward and horizon learning framework.

Findings

01

Shorter effective horizons can improve IRL performance.

02

Joint reward and horizon learning outperforms traditional methods.

03

Experimental results support the theoretical analysis.

Abstract

Inverse reinforcement learning (IRL) algorithms often rely on (forward) reinforcement learning or planning, over a given time horizon, to compute an approximately optimal policy for a hypothesized reward function; they then match this policy with expert demonstrations. The time horizon plays a critical role in determining both the accuracy of reward estimates and the computational efficiency of IRL algorithms. Interestingly, an *effective time horizon* shorter than the ground-truth value often produces better results faster. This work formally analyzes this phenomenon and provides an explanation: the time horizon controls the complexity of an induced policy class and mitigates overfitting with limited data. This analysis provides a guide for the principled choice of the effective horizon for IRL. It also prompts us to re-examine the classic IRL formulation: it is more natural to learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management · Auction Theory and Applications