
TL;DR
This paper introduces Memory Worth, a simple, theoretically grounded metric for assessing memory usefulness in agent systems, enabling better memory management based on outcome co-occurrence.
Contribution
It proposes Memory Worth, a lightweight two-counter signal that tracks memory success, with proven convergence properties and empirical validation in synthetic and real retrieval scenarios.
Findings
Memory Worth correlates strongly with true utility (rho = 0.89) after 10,000 episodes.
Stale memories tend to have low Memory Worth, guiding suppression.
The method requires only two scalar counters per memory and can be integrated into existing architectures.
Abstract
Agent memory systems accumulate experience but currently lack a principled operational metric for memory quality governance -- deciding which memories to trust, suppress, or deprecate as the agent's task distribution shifts. Write-time importance scores are static; dynamic management systems use LLM judgment or structural heuristics rather than outcome feedback. This paper proposes Memory Worth (MW): a two-counter per-memory signal that tracks how often a memory co-occurs with successful versus failed outcomes, providing a lightweight, theoretically grounded foundation for staleness detection, retrieval suppression, and deprecation decisions. We prove that MW converges almost surely to the conditional success probability p+(m) = Pr[y_t = +1 | m in M_t] -- the probability of task success given that memory m is retrieved -- under a stationary retrieval regime with a minimum exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
