PAC Apprenticeship Learning with Bayesian Active Inverse Reinforcement Learning

Ondrej Bajgar; Dewi S.W. Gould; Jonathon Liu; Alessandro Abate; Konstantinos Gatsis; Michael A. Osborne

arXiv:2508.03693·cs.LG·September 22, 2025

PAC Apprenticeship Learning with Bayesian Active Inverse Reinforcement Learning

Ondrej Bajgar, Dewi S.W. Gould, Jonathon Liu, Alessandro Abate, Konstantinos Gatsis, Michael A. Osborne

PDF

TL;DR

This paper introduces PAC-EIG, a novel information-theoretic method for active inverse reinforcement learning that guarantees probably-approximately-correct policies with fewer demonstrations, especially in safety-critical domains.

Contribution

It provides the first PAC guarantee for active IRL with noisy demonstrations and introduces a new acquisition function that maximizes information gain about policy regret.

Findings

01

PAC-EIG achieves reliable policies with fewer demonstrations.

02

Theoretical convergence bounds are established for finite state-action spaces.

03

Experimental results demonstrate advantages over prior heuristic methods.

Abstract

As AI systems become increasingly autonomous, reliably aligning their decision-making with human preferences is essential. Inverse reinforcement learning (IRL) offers a promising approach to infer preferences from demonstrations. These preferences can then be used to produce an apprentice policy that performs well on the demonstrated task. However, in domains like autonomous driving or robotics, where errors can have serious consequences, we need not just good average performance but reliable policies with formal guarantees -- yet obtaining sufficient human demonstrations for reliability guarantees can be costly. Active IRL addresses this challenge by strategically selecting the most informative scenarios for human demonstration. We introduce PAC-EIG, an information-theoretic acquisition function that directly targets probably-approximately-correct (PAC) guarantees for the learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.