TL;DR
This study empirically examines how memory management operations influence LLM agent behavior, revealing the experience-following property and challenges like error propagation, with implications for designing more robust long-term agents.
Contribution
It systematically analyzes the impact of memory addition and deletion on LLM agents, highlighting the importance of experience quality regulation for improved long-term performance.
Findings
LLM agents exhibit an experience-following property.
Error propagation can degrade future performance.
Regulating experience quality enhances agent robustness.
Abstract
Memory is a critical component in large language model (LLM)-based agents, enabling them to store and retrieve past executions to improve task performance over time. In this paper, we conduct an empirical study on how memory management choices impact the LLM agents' behavior, especially their long-term performance. Specifically, we focus on two fundamental memory management operations that are widely used by many agent frameworks-memory addition and deletion-to systematically study their impact on the agent behavior. Through our quantitative analysis, we find that LLM agents display an experience-following property: high similarity between a task input and the input in a retrieved memory record often results in highly similar agent outputs. Our analysis further reveals two significant challenges associated with this property: error propagation, where inaccuracies in past experiences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
