Concept-modulated model-based offline reinforcement learning for rapid   generalization

Nicholas A. Ketz; Praveen K. Pilly

arXiv:2209.03207·cs.LG·September 8, 2022

Concept-modulated model-based offline reinforcement learning for rapid generalization

Nicholas A. Ketz, Praveen K. Pilly

PDF

Open Access

TL;DR

This paper introduces a concept-conditioned model-based offline reinforcement learning approach that enhances rapid generalization by self-generating constrained simulated scenarios, demonstrated in a driving simulator for improved failure case handling.

Contribution

It combines model-based RL with interpretability to generate environment scenarios conditioned on concepts, enabling better generalization beyond training data.

Findings

01

Significant improvement in one-shot generalization to failure cases.

02

Effective zero-shot generalization to similar environment variations.

03

Outperforms traditional model-based and model-free methods.

Abstract

The robustness of any machine learning solution is fundamentally bound by the data it was trained on. One way to generalize beyond the original training is through human-informed augmentation of the original dataset; however, it is impossible to specify all possible failure cases that can occur during deployment. To address this limitation we combine model-based reinforcement learning and model-interpretability methods to propose a solution that self-generates simulated scenarios constrained by environmental concepts and dynamics learned in an unsupervised manner. In particular, an internal model of the agent's environment is conditioned on low-dimensional concept representations of the input space that are sensitive to the agent's actions. We demonstrate this method within a standard realistic driving simulator in a simple point-to-point navigation task, where we show dramatic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification