Learning Generalizable Behavior via Visual Rewrite Rules
Yiheng Xie, Mingxuan Li, Shangqun Yu, Michael Littman

TL;DR
This paper introduces visual rewrite rules (VRRs), a neural network-free method for capturing environment dynamics, enabling more robust, sample-efficient, and generalizable reinforcement learning agents through explicit visual change modeling.
Contribution
The paper presents a novel approach to learn environment dynamics using visual rewrite rules, avoiding neural networks and improving generalization and efficiency.
Findings
VRR agents outperform deep agents in classical games.
VRR agents exhibit high sample efficiency.
VRR agents demonstrate robust generalization.
Abstract
Though deep reinforcement learning agents have achieved unprecedented success in recent years, their learned policies can be brittle, failing to generalize to even slight modifications of their environments or unfamiliar situations. The black-box nature of the neural network learning dynamics makes it impossible to audit trained deep agents and recover from such failures. In this paper, we propose a novel representation and learning approach to capture environment dynamics without using neural networks. It originates from the observation that, in games designed for people, the effect of an action can often be perceived in the form of local changes in consecutive visual observations. Our algorithm is designed to extract such vision-based changes and condense them into a set of action-dependent descriptive rules, which we call ''visual rewrite rules'' (VRRs). We also present preliminary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics
