TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking
Yu Cheng, Jiuan Zhou, Yongkang Hu, Yihang Chen, Huichi Zhou, Mingang Chen, Zhizhong Zhang, Kun Shao, Yuan Xie, Zhaoxia Yin

TL;DR
This paper introduces TAME, a dual-memory framework for agent memory evolution that enhances trustworthiness and task performance during benign task evolution, addressing the challenge of Agent Memory Misevolution.
Contribution
The paper presents TAME, a novel dual-memory evolutionary approach with systematic benchmarking to improve agent safety and utility during test-time memory evolution.
Findings
TAME effectively mitigates misevolution, maintaining trustworthiness.
TAME improves both safety and task utility in experiments.
Benchmarking reveals trustworthiness declines in existing methods.
Abstract
Test-time evolution of agent memory serves as a pivotal paradigm for achieving AGI by bolstering complex reasoning through experience accumulation. However, even during benign task evolution, agent safety alignment remains vulnerable-a phenomenon known as Agent Memory Misevolution. To evaluate this phenomenon, we construct the Trust-Memevo benchmark to assess multi-dimensional trustworthiness during benign task evolution, revealing an overall decline in trustworthiness across various task domains and evaluation settings. To address this issue, we propose TAME, a dual-memory evolutionary framework that separately evolves executor memory to improve task performance by distilling generalizable methodologies, and evaluator memory to refine assessments of both safety and task utility based on historical feedback. Through a closed loop of memory filtering, draft generation, trustworthy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Ethics and Social Impacts of AI · AI-based Problem Solving and Planning
