Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz, Praveen K. Pilly

TL;DR
This paper introduces a concept-conditioned model-based offline reinforcement learning approach that enhances rapid generalization by self-generating constrained simulated scenarios, demonstrated in a driving simulator for improved failure case handling.
Contribution
It combines model-based RL with interpretability to generate environment scenarios conditioned on concepts, enabling better generalization beyond training data.
Findings
Significant improvement in one-shot generalization to failure cases.
Effective zero-shot generalization to similar environment variations.
Outperforms traditional model-based and model-free methods.
Abstract
The robustness of any machine learning solution is fundamentally bound by the data it was trained on. One way to generalize beyond the original training is through human-informed augmentation of the original dataset; however, it is impossible to specify all possible failure cases that can occur during deployment. To address this limitation we combine model-based reinforcement learning and model-interpretability methods to propose a solution that self-generates simulated scenarios constrained by environmental concepts and dynamics learned in an unsupervised manner. In particular, an internal model of the agent's environment is conditioned on low-dimensional concept representations of the input space that are sensitive to the agent's actions. We demonstrate this method within a standard realistic driving simulator in a simple point-to-point navigation task, where we show dramatic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification
