Localizing and Correcting Errors for LLM-based Planners
Aditya Kumar, William W. Cohen

TL;DR
This paper introduces Localized In-Context Learning (L-ICL), a method to improve LLM-based planners by iteratively correcting specific planning errors, significantly increasing the validity of generated plans across multiple domains.
Contribution
The paper proposes L-ICL, a novel targeted correction technique that outperforms traditional instruction-based methods in improving LLM planning accuracy.
Findings
L-ICL achieves 89% valid plans on gridworld with 60 examples.
L-ICL outperforms baselines by up to 30% in plan validity.
Effective across multiple domains and LLM architectures.
Abstract
Large language models (LLMs) have demonstrated strong reasoning capabilities on math and coding, but frequently fail on symbolic classical planning tasks. Our studies, as well as prior work, show that LLM-generated plans routinely violate domain constraints given in their instructions (e.g., walking through walls). To address this failure, we propose iteratively augmenting instructions with Localized In-Context Learning (L-ICL) demonstrations: targeted corrections for specific failing steps. Specifically, L-ICL identifies the first constraint violation in a trace and injects a minimal input-output example giving the correct behavior for the failing step. Our proposed technique of L-ICL is much effective than explicit instructions or traditional ICL, which adds complete problem-solving trajectories, and many other baselines. For example, on an 8x8 gridworld, L-ICL produces valid plans…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Constraint Satisfaction and Optimization
