On Memory: A comparison of memory mechanisms in world models
Eli J. Laird, Corey Clark

TL;DR
This paper analyzes how different memory mechanisms in transformer-based world models affect their ability to remember past states and perform long-term planning, highlighting improvements in memory span and loop closure capabilities.
Contribution
It introduces a taxonomy of memory mechanisms, evaluates their effectiveness in extending memory span, and demonstrates their role in enabling loop closures in world models.
Findings
Memory mechanisms extend the effective memory span of vision transformers.
Memory augmentation improves loop closure capabilities.
Trade-offs exist between different memory mechanisms' effectiveness.
Abstract
World models enable agents to plan within imagined environments by predicting future states conditioned on past observations and actions. However, their ability to plan over long horizons is limited by the effective memory span of the backbone architecture. This limitation leads to perceptual drift in long rollouts, hindering the model's capacity to perform loop closures within imagined trajectories. In this work, we investigate the effective memory span of transformer-based world models through an analysis of several memory augmentation mechanisms. We introduce a taxonomy that distinguishes between memory encoding and memory injection mechanisms, motivating their roles in extending the world model's memory through the lens of residual stream dynamics. Using a state recall evaluation task, we measure the memory recall of each mechanism and analyze its respective trade-offs. Our findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · AI-based Problem Solving and Planning · Embodied and Extended Cognition
