PAC Apprenticeship Learning with Bayesian Active Inverse Reinforcement Learning
Ondrej Bajgar, Dewi S.W. Gould, Jonathon Liu, Alessandro Abate, Konstantinos Gatsis, Michael A. Osborne

TL;DR
This paper introduces PAC-EIG, a novel information-theoretic method for active inverse reinforcement learning that guarantees probably-approximately-correct policies with fewer demonstrations, especially in safety-critical domains.
Contribution
It provides the first PAC guarantee for active IRL with noisy demonstrations and introduces a new acquisition function that maximizes information gain about policy regret.
Findings
PAC-EIG achieves reliable policies with fewer demonstrations.
Theoretical convergence bounds are established for finite state-action spaces.
Experimental results demonstrate advantages over prior heuristic methods.
Abstract
As AI systems become increasingly autonomous, reliably aligning their decision-making with human preferences is essential. Inverse reinforcement learning (IRL) offers a promising approach to infer preferences from demonstrations. These preferences can then be used to produce an apprentice policy that performs well on the demonstrated task. However, in domains like autonomous driving or robotics, where errors can have serious consequences, we need not just good average performance but reliable policies with formal guarantees -- yet obtaining sufficient human demonstrations for reliability guarantees can be costly. Active IRL addresses this challenge by strategically selecting the most informative scenarios for human demonstration. We introduce PAC-EIG, an information-theoretic acquisition function that directly targets probably-approximately-correct (PAC) guarantees for the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
