Memory Allocation in Resource-Constrained Reinforcement Learning

Massimiliano Tamborski; David Abel

arXiv:2506.17263·cs.LG·June 24, 2025

Memory Allocation in Resource-Constrained Reinforcement Learning

Massimiliano Tamborski, David Abel

PDF

TL;DR

This paper investigates how memory limitations affect reinforcement learning agents' performance, analyzing the trade-offs in memory allocation between modeling and planning in resource-constrained environments.

Contribution

It introduces an analysis of memory allocation strategies in resource-constrained reinforcement learning, focusing on MCTS and DQN algorithms.

Findings

01

Memory allocation impacts agent performance significantly.

02

Different environments require different memory strategies.

03

Trade-offs between modeling and planning are crucial for efficiency.

Abstract

Resource constraints can fundamentally change both learning and decision-making. We explore how memory constraints influence an agent's performance when navigating unknown environments using standard reinforcement learning algorithms. Specifically, memory-constrained agents face a dilemma: how much of their limited memory should be allocated to each of the agent's internal processes, such as estimating a world model, as opposed to forming a plan using that model? We study this dilemma in MCTS- and DQN-based algorithms and examine how different allocations of memory impact performance in episodic and continual learning settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.