The Trap of Trajectory: Towards Understanding and Mitigating Spurious Correlations in Agentic Memory

Luoxi Tang; Rupali Rajendra Vaje; Yuqiao Meng; Sakshi Sunil Narkar; Weicheng Ma; Zeyu Ding; Dazheng Zhang; Zhaohan Xi

arXiv:2605.09330·cs.LG·May 12, 2026

The Trap of Trajectory: Towards Understanding and Mitigating Spurious Correlations in Agentic Memory

Luoxi Tang, Rupali Rajendra Vaje, Yuqiao Meng, Sakshi Sunil Narkar, Weicheng Ma, Zeyu Ding, Dazheng Zhang, Zhaohan Xi

PDF

TL;DR

This paper investigates the vulnerability of agentic memory in LLMs to spurious correlations, benchmarks their impact, and introduces CAMEL, a calibration method that mitigates these issues while maintaining performance.

Contribution

It provides a benchmark for spurious patterns in agentic memory and proposes CAMEL, a calibration technique to reduce reliance on spurious correlations in memory systems.

Findings

01

Memory improves reasoning on clean inputs.

02

Memory reliance on spurious patterns increases with their presence.

03

CAMEL reduces reliance on spurious patterns across architectures.

Abstract

Agentic memory enables LLMs to persist information beyond a single context window and reuse it in later decisions, but it also introduces a new vulnerability: spurious correlations, where retrieved memory carries miscorrelated evidence and propagates erroneous reasoning into downstream decisions. Despite the widespread use of agentic memory, this risk remains largely underexplored. We address it from two aspects. First, we benchmark several canonical types of spurious patterns identified through causal structure and record them across trajectory-level memory. Diagnosing agentic memory systems on this benchmark reveals that memory improves reasoning on clean inputs but amplifies reliance on spurious patterns when they are present. Second, we propose CAMEL, a plug-and-play calibration method that operates across diverse memory architectures at both write and retrieval time. CAMEL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.