Joint Optimization of Multi-agent Memory System
Wenyu Mao, Haoyang Liu, Haosong Tan, Yaorui Shi, Jiancan Wu, An Zhang, Xiang Wang

TL;DR
This paper introduces CoMAM, a joint optimization framework for multi-agent memory systems in LLMs, using end-to-end reinforcement learning to improve collaboration and performance.
Contribution
It proposes a novel joint optimization method with adaptive credit assignment for multi-agent memory systems, addressing limitations of independent agent training.
Findings
CoMAM outperforms existing memory systems in experiments.
Joint optimization improves agent collaboration and overall system performance.
Adaptive credit assignment enhances targeted feedback for agents.
Abstract
Memory systems are critical for LLMs, mitigating context window limitations and supporting long-horizon user-LLM interactions. Such systems typically comprise multiple agents responsible for memory construction and retrieval. Existing approaches often optimize each agent independently under a shared global objective (e.g., downstream QA accuracy), treating other agents as a static environment. However, this design has two key limitations: (1) independent optimization ignores inter-agent dependencies and lacks agents' co-adaptation, and (2) relying solely on sparse global rewards provides limited guidance for optimizing specialized agents and causes ambiguous credit assignment. These may ultimately limit agents' effective collaboration in the memory system. To address these limitations, we propose CoMAM, a joint optimization framework that promotes collaboration among agents via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
