Unable to Forget: Proactive Interference Reveals Working Memory Limits in LLMs Beyond Context Length

Chupei Wang (University of Virginia); Jiaqiu Vince Sun (New York University)

arXiv:2506.08184·cs.CL·August 1, 2025

Unable to Forget: Proactive Interference Reveals Working Memory Limits in LLMs Beyond Context Length

Chupei Wang (University of Virginia), Jiaqiu Vince Sun (New York University)

PDF

Open Access 4 Datasets

TL;DR

This paper demonstrates that LLMs experience a fundamental working memory bottleneck due to proactive interference, leading to retrieval errors with longer contexts, and highlights the need for methods to improve interference suppression.

Contribution

We adapt the proactive interference paradigm to evaluate LLMs, revealing their susceptibility to interference and identifying a working memory limit beyond context length.

Findings

01

LLMs' retrieval accuracy declines log-linearly with accumulated interference.

02

Prompt engineering has limited success in mitigating interference effects.

03

Interference causes retrieval errors from previously overwritten information.

Abstract

Information retrieval in Large Language Models (LLMs) is increasingly recognized as intertwined with generation capabilities rather than mere lookup. While longer contexts are often assumed to improve retrieval, the effects of intra-context interference remain understudied. To address this, we adapt the proactive interference (PI) paradigm from cognitive science, where earlier information disrupts recall of newer updates. In humans, susceptibility to such interference is inversely linked to working memory capacity. We introduce PI-LLM, an evaluation that sequentially streams semantically related key-value updates and queries only the final values. Although these final values are clearly positioned just before the query, LLM retrieval accuracy declines log-linearly toward zero as interference accumulates; errors arise from retrieving previously overwritten values. Attempts to mitigate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis · Big Data and Business Intelligence · AI and HR Technologies