Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

Nasehatul Mustakim; Lucas Lehnert

arXiv:2605.20272·cs.LG·May 21, 2026

Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

Nasehatul Mustakim, Lucas Lehnert

PDF

TL;DR

This paper introduces a theoretical model demonstrating that smaller abstract state spaces in RL agents facilitate better out-of-distribution generalization across tasks of varying complexity.

Contribution

It extends state abstraction frameworks to POMDPs and defines a successor-weighted model reduction to enable smaller abstract spaces for improved generalization.

Findings

01

Reducing abstract state space size improves OOD test performance.

02

A bound on OOD generalization performance is derived.

03

Confining agents to small, finite abstract states is necessary for generalization.

Abstract

While humans readily generalize abstract concepts to more complex or larger tasks, building Reinforcement Learning (RL) systems with this ability remains elusive. Here, we present the first theoretical model of how such Out-of-Distribution (OOD) generalization can be achieved in RL agents. Our approach considers Partially Observable Markov Decision Processes (POMDPs) and assumes that an intelligent agent uses an abstraction function to determine which experiences can be treated as equivalent and which must be distinguished. First, we extend the existing state abstraction framework and proof techniques to POMDPs. Then, we define a successor-weighted model reduction, a model reduction variant that enables compression into smaller abstract spaces than prior definitions allow. We derive a bound on the agent's OOD test performance, thereby defining the conditions under which OOD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.