Context-Hierarchy Inverse Reinforcement Learning

Wei Gao; David Hsu; Wee Sun Lee

arXiv:2202.12597·cs.AI·February 28, 2022

Context-Hierarchy Inverse Reinforcement Learning

Wei Gao, David Hsu, Wee Sun Lee

PDF

Open Access

TL;DR

This paper introduces CHIRL, a hierarchical IRL method that leverages context hierarchies and modular neural networks to improve learning of complex reward functions, especially in large-scale tasks like autonomous driving.

Contribution

The paper proposes CHIRL, a novel IRL algorithm that models context hierarchically and uses modular neural networks to enhance reward learning and task decomposition.

Findings

01

Effective in scaling IRL to complex tasks

02

Improves data sharing and state abstraction

03

Shows promising results in autonomous driving simulations

Abstract

An inverse reinforcement learning (IRL) agent learns to act intelligently by observing expert demonstrations and learning the expert's underlying reward function. Although learning the reward functions from demonstrations has achieved great success in various tasks, several other challenges are mostly ignored. Firstly, existing IRL methods try to learn the reward function from scratch without relying on any prior knowledge. Secondly, traditional IRL methods assume the reward functions are homogeneous across all the demonstrations. Some existing IRL methods managed to extend to the heterogeneous demonstrations. However, they still assume one hidden variable that affects the behavior and learn the underlying hidden variable together with the reward from demonstrations. To solve these issues, we present Context Hierarchy IRL(CHIRL), a new IRL algorithm that exploits the context to scale up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator