Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Bo Yue, Jian Li, Guiliang Liu

TL;DR
This paper introduces a theoretically grounded exploration framework for Inverse Constrained Reinforcement Learning, improving the efficiency of inferring constraints from expert behaviors with proven sample complexity.
Contribution
It proposes two novel exploration algorithms for efficient constraint inference in ICRL, with theoretical guarantees and empirical validation.
Findings
Algorithms achieve reduced sample complexity
Empirical results validate theoretical guarantees
Effective constraint inference in complex environments
Abstract
Optimizing objective functions subject to constraints is fundamental in many real-world applications. However, these constraints are often not readily defined and must be inferred from expert agent behaviors, a problem known as Inverse Constraint Inference. Inverse Constrained Reinforcement Learning (ICRL) is a common solver for recovering feasible constraints in complex environments, relying on training samples collected from interactive environments. However, the efficacy and efficiency of current sampling strategies remain unclear. We propose a strategic exploration framework for sampling with guaranteed efficiency to bridge this gap. By defining the feasible cost set for ICRL problems, we analyze how estimation errors in transition dynamics and the expert policy influence the feasibility of inferred constraints. Based on this analysis, we introduce two exploratory algorithms to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Adaptive Dynamic Programming Control
MethodsSparse Evolutionary Training
