Context-Hierarchy Inverse Reinforcement Learning
Wei Gao, David Hsu, Wee Sun Lee

TL;DR
This paper introduces CHIRL, a hierarchical IRL method that leverages context hierarchies and modular neural networks to improve learning of complex reward functions, especially in large-scale tasks like autonomous driving.
Contribution
The paper proposes CHIRL, a novel IRL algorithm that models context hierarchically and uses modular neural networks to enhance reward learning and task decomposition.
Findings
Effective in scaling IRL to complex tasks
Improves data sharing and state abstraction
Shows promising results in autonomous driving simulations
Abstract
An inverse reinforcement learning (IRL) agent learns to act intelligently by observing expert demonstrations and learning the expert's underlying reward function. Although learning the reward functions from demonstrations has achieved great success in various tasks, several other challenges are mostly ignored. Firstly, existing IRL methods try to learn the reward function from scratch without relying on any prior knowledge. Secondly, traditional IRL methods assume the reward functions are homogeneous across all the demonstrations. Some existing IRL methods managed to extend to the heterogeneous demonstrations. However, they still assume one hidden variable that affects the behavior and learn the underlying hidden variable together with the reward from demonstrations. To solve these issues, we present Context Hierarchy IRL(CHIRL), a new IRL algorithm that exploits the context to scale up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
