PAACE: A Plan-Aware Automated Agent Context Engineering Framework
Kamer Ali Yuksel

TL;DR
PAACE is a framework that enhances large language model agents by optimizing context management through plan-aware compression, leading to improved accuracy and efficiency in complex workflows.
Contribution
It introduces a unified plan-aware context engineering framework with synthetic data generation and plan-aware compression methods for LLM agents.
Findings
Improves agent accuracy on long-horizon benchmarks.
Reduces context load and attention dependency.
Maintains high performance with significantly lower inference costs.
Abstract
Large Language Model (LLM) agents are increasingly deployed in complex, multi-step workflows involving planning, tool use, reflection, and interaction with external knowledge systems. These workflows generate rapidly expanding contexts that must be curated, transformed, and compressed to maintain fidelity, avoid attention dilution, and reduce inference cost. Prior work on summarization and query-aware compression largely ignores the multi-step, plan-aware nature of agentic reasoning. In this work, we introduce PAACE (Plan-Aware Automated Context Engineering), a unified framework for optimizing the evolving state of LLM agents through next-k-task relevance modeling, plan-structure analysis, instruction co-refinement, and function-preserving compression. PAACE comprises (1) PAACE-Syn, a large-scale generator of synthetic agent workflows annotated with stepwise compression supervision, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Multimodal Machine Learning Applications · Topic Modeling
