Towards Generalized Inverse Reinforcement Learning

Chaosheng Dong; Yijia Wang

arXiv:2402.07246·cs.LG·February 13, 2024·1 cites

Towards Generalized Inverse Reinforcement Learning

Chaosheng Dong, Yijia Wang

PDF

Open Access

TL;DR

This paper introduces a generalized inverse reinforcement learning framework that learns MDP components from observed, possibly suboptimal, policies with unknown or partially known elements, using a new formulation and heuristic algorithm.

Contribution

It formulates the GIRL problem considering uncertain MDP components and proposes a fast heuristic algorithm to solve it, addressing key challenges in the field.

Findings

01

The proposed formulation effectively captures the GIRL problem.

02

The heuristic algorithm demonstrates good performance on finite and infinite state problems.

03

Numerical results validate the approach's merit.

Abstract

This paper studies generalized inverse reinforcement learning (GIRL) in Markov decision processes (MDPs), that is, the problem of learning the basic components of an MDP given observed behavior (policy) that might not be optimal. These components include not only the reward function and transition probability matrices, but also the action space and state space that are not exactly known but are known to belong to given uncertainty sets. We address two key challenges in GIRL: first, the need to quantify the discrepancy between the observed policy and the underlying optimal policy; second, the difficulty of mathematically characterizing the underlying optimal policy when the basic components of an MDP are unobservable or partially observable. Then, we propose the mathematical formulation for GIRL and develop a fast heuristic algorithm. Numerical results on both finite and infinite state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics