Learning Generalizable Behavior via Visual Rewrite Rules

Yiheng Xie; Mingxuan Li; Shangqun Yu; Michael Littman

arXiv:2112.05218·cs.AI·December 13, 2021

Learning Generalizable Behavior via Visual Rewrite Rules

Yiheng Xie, Mingxuan Li, Shangqun Yu, Michael Littman

PDF

Open Access

TL;DR

This paper introduces visual rewrite rules (VRRs), a neural network-free method for capturing environment dynamics, enabling more robust, sample-efficient, and generalizable reinforcement learning agents through explicit visual change modeling.

Contribution

The paper presents a novel approach to learn environment dynamics using visual rewrite rules, avoiding neural networks and improving generalization and efficiency.

Findings

01

VRR agents outperform deep agents in classical games.

02

VRR agents exhibit high sample efficiency.

03

VRR agents demonstrate robust generalization.

Abstract

Though deep reinforcement learning agents have achieved unprecedented success in recent years, their learned policies can be brittle, failing to generalize to even slight modifications of their environments or unfamiliar situations. The black-box nature of the neural network learning dynamics makes it impossible to audit trained deep agents and recover from such failures. In this paper, we propose a novel representation and learning approach to capture environment dynamics without using neural networks. It originates from the observation that, in games designed for people, the effect of an action can often be perceived in the form of local changes in consecutive visual observations. Our algorithm is designed to extract such vision-based changes and condense them into a set of action-dependent descriptive rules, which we call ''visual rewrite rules'' (VRRs). We also present preliminary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics