Compiled Memory: Not More Information, but More Precise Instructions for Language Agents

James Rhodes; George Kang

arXiv:2603.15666·cs.AI·March 18, 2026

Compiled Memory: Not More Information, but More Precise Instructions for Language Agents

James Rhodes, George Kang

PDF

Open Access

TL;DR

This paper introduces Atlas, a memory system that compiles task experiences into instruction rewrites for language agents, improving performance without traditional storage or fine-tuning.

Contribution

Atlas offers a novel approach by distilling experience into instruction prompts, enhancing agent behavior without fine-tuning or retrieval-augmented generation.

Findings

01

Improves GPT-4o token-level F1 by +8.7pp on CUAD.

02

Enhances HotpotQA joint F1 by +3.16pp.

03

Task-specific compiled knowledge benefits different models.

Abstract

Existing memory systems for language agents address memory management: how to retrieve and page more information within a context budget. We address a complementary problem -- memory utility: what experience is worth keeping, and how it should change agent behavior. We present Atlas, a memory kernel that compiles accumulated task experience into an agent's instruction structure -- without fine-tuning, RAG, or human intervention. Memory is distillation, not storage; delivery is instruction rewriting, not context injection. Facts extracted from agent failures and successes are verified through a three-step promotion gate and delivered by rewriting the agent's system prompt with learned sub-bullets. On CUAD contract analysis, the evolved prompt improves GPT-4o token-level F1 by $+ 8.7$ pp and precision by $+ 12.5$ pp. On HotpotQA multi-hop QA, joint F1 improves $+ 3.16$ pp. An ablation isolates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Natural Language Processing Techniques · Topic Modeling