A Bayesian Approach to Robust Inverse Reinforcement Learning
Ran Wei, Siliang Zeng, Chenliang Li, Alfredo Garcia, Anthony McDonald,, Mingyi Hong

TL;DR
This paper introduces a Bayesian framework for offline inverse reinforcement learning that jointly estimates the expert's reward function and subjective environment model, demonstrating robustness and superior performance in high-dimensional settings.
Contribution
It presents a novel Bayesian method that simultaneously estimates reward and environment dynamics, improving robustness and efficiency over existing IRL approaches.
Findings
Estimated policy is robust when the expert's environment model is highly accurate.
Algorithms outperform state-of-the-art offline IRL methods in MuJoCo environments.
The approach is effective in high-dimensional settings.
Abstract
We consider a Bayesian approach to offline model-based inverse reinforcement learning (IRL). The proposed framework differs from existing offline model-based IRL approaches by performing simultaneous estimation of the expert's reward function and subjective model of environment dynamics. We make use of a class of prior distributions which parameterizes how accurate the expert's model of the environment is to develop efficient algorithms to estimate the expert's reward and subjective dynamics in high-dimensional settings. Our analysis reveals a novel insight that the estimated policy exhibits robust performance when the expert is believed (a priori) to have a highly accurate model of the environment. We verify this observation in the MuJoCo environments and show that our algorithms outperform state-of-the-art offline IRL algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques
