Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents
Ye Ye

TL;DR
The paper introduces the Task Memory Engine (TME), a modular spatial memory system that enhances LLM agents' robustness in multi-step tasks by replacing linear context with a graph-based memory structure, significantly reducing hallucinations and misinterpretations.
Contribution
TME is a novel spatial memory framework that transforms LLMs into revision-aware, multi-turn reasoning agents without fine-tuning, using dynamic task graphs to improve reliability.
Findings
TME eliminates 100% of hallucinations and misinterpretations in three tasks.
Reduces hallucinations by 66.7% and misinterpretations by 83.3% across 27 user turns.
Outperforms ReAct in multi-turn scenarios.
Abstract
Large Language Models (LLMs) falter in multi-step interactions -- often hallucinating, repeating actions, or misinterpreting user corrections -- due to reliance on linear, unstructured context. This fragility stems from the lack of persistent memory to track evolving goals and task dependencies, undermining trust in autonomous agents. We introduce the Task Memory Engine (TME), a modular memory controller that transforms existing LLMs into robust, revision-aware agents without fine-tuning. TME implements a spatial memory framework that replaces flat context with graph-based structures to support consistent, multi-turn reasoning. Departing from linear concatenation and ReAct-style prompting, TME builds a dynamic task graph -- either a tree or directed acyclic graph (DAG) -- to map user inputs to subtasks, align them with prior context, and enable dependency-tracked revisions. Its Task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Distributed and Parallel Computing Systems · Multi-Agent Systems and Negotiation
MethodsALIGN
