TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation
Ming-Tung Shen, Yuh-Jzer Joung

TL;DR
TALM introduces a dynamic, tree-structured multi-agent framework with long-term memory that improves complex code generation by enhancing reasoning flexibility, error correction, and knowledge reuse.
Contribution
It presents a novel framework combining structured task decomposition, localized re-reasoning, and long-term memory for scalable, efficient code generation.
Findings
Outperforms existing models on HumanEval, BigCodeBench, ClassEval benchmarks.
Achieves higher reasoning accuracy and token efficiency.
Demonstrates robustness in complex code generation tasks.
Abstract
Agentic code generation requires large language models (LLMs) capable of complex context management and multi-step reasoning. Prior multi-agent frameworks attempt to address these challenges through collaboration, yet they often suffer from rigid workflows and high reasoning recovery costs. To overcome these limitations, we propose TALM (Tree-Structured Multi-Agent Framework with Long-Term Memory), a dynamic framework that integrates structured task decomposition, localized re-reasoning, and long-term memory mechanisms. TALM employs an extensible tree-based collaboration structure. The parent-child relationships, when combined with a divide-and-conquer strategy, enhance reasoning flexibility and enable efficient error correction across diverse task scopes. Furthermore, a long-term memory module enables semantic querying and integration of prior knowledge, supporting implicit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
