Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning
Nasehatul Mustakim, Lucas Lehnert

TL;DR
This paper introduces a theoretical model demonstrating that smaller abstract state spaces in RL agents facilitate better out-of-distribution generalization across tasks of varying complexity.
Contribution
It extends state abstraction frameworks to POMDPs and defines a successor-weighted model reduction to enable smaller abstract spaces for improved generalization.
Findings
Reducing abstract state space size improves OOD test performance.
A bound on OOD generalization performance is derived.
Confining agents to small, finite abstract states is necessary for generalization.
Abstract
While humans readily generalize abstract concepts to more complex or larger tasks, building Reinforcement Learning (RL) systems with this ability remains elusive. Here, we present the first theoretical model of how such Out-of-Distribution (OOD) generalization can be achieved in RL agents. Our approach considers Partially Observable Markov Decision Processes (POMDPs) and assumes that an intelligent agent uses an abstraction function to determine which experiences can be treated as equivalent and which must be distinguished. First, we extend the existing state abstraction framework and proof techniques to POMDPs. Then, we define a successor-weighted model reduction, a model reduction variant that enables compression into smaller abstract spaces than prior definitions allow. We derive a bound on the agent's OOD test performance, thereby defining the conditions under which OOD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
