Human-like Working Memory Interference in Large Language Models

Hua-Dong Xiong (1); Li Ji-An (2); Jiaqi Huang (3; 4); Robert C. Wilson (1; 5); Kwonjoon Lee (4); Xue-Xin Wei (6) ((1) School of Psychological; Brain Sciences; Georgia Tech; (2) Department of Psychology; New York University; (3) Department of Cognitive Science; Indiana University Bloomington; (4) Honda Research Institute; (5) Center of Excellence for Computational Cognition; Georgia Tech; (6) Departments of Neuroscience; Psychology; The University of Texas at Austin)

arXiv:2604.09670·cs.LG·April 14, 2026

Human-like Working Memory Interference in Large Language Models

Hua-Dong Xiong (1), Li Ji-An (2), Jiaqi Huang (3, 4), Robert C. Wilson (1, 5), Kwonjoon Lee (4), Xue-Xin Wei (6) ((1) School of Psychological, Brain Sciences, Georgia Tech, (2) Department of Psychology, New York University, (3) Department of Cognitive Science

PDF

TL;DR

This paper investigates the limitations of working memory in large language models, revealing that interference control is a key factor affecting their ability to maintain task-relevant information.

Contribution

It uncovers the shared computational mechanism of interference in LLMs' working memory and demonstrates how interference suppression can improve performance.

Findings

01

LLMs show performance degradation with increased memory load.

02

Models encode multiple items in entangled representations, causing interference.

03

Suppressing stimulus content improves LLM working memory performance.

Abstract

Intelligent systems must maintain and manipulate task-relevant information online to adapt to dynamic environments and changing goals. This capacity, known as working memory, is fundamental to human reasoning and intelligence. Despite having on the order of 100 billion neurons, both biological and artificial systems exhibit limitations in working memory. This raises a key question: why do large language models (LLMs) show such limitations, given that transformers have full access to prior context through attention? We find that although a two-layer transformer can be trained to solve working memory tasks perfectly, a diverse set of pretrained LLMs continues to show working memory limitations. Notably, LLMs reproduce interference signatures observed in humans: performance degrades with increasing memory load and is biased by recency and stimulus statistics. Across models, stronger…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.