Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning
Shanwei Fan, Bin Zhang, Zhiwei Xu, Yingxuan Teng, Siqi Dai, Lin Cheng, Guoliang Fan

TL;DR
This paper introduces SGA-ACR, a framework that enhances LLM-guided reinforcement learning by integrating environment-specific subgoal graphs and a multi-LLM pipeline to improve plan feasibility, verification, and execution in open-world tasks.
Contribution
It proposes a novel subgoal graph-augmented planning framework with explicit generation, critique, and refinement stages for better alignment between plans and environment actions.
Findings
Improved subgoal feasibility and reliability in open-world tasks.
Enhanced alignment between high-level plans and environment-specific actions.
Demonstrated effectiveness on 22 diverse tasks in the Crafter game.
Abstract
Large language models (LLMs) offer strong high-level planning capabilities for reinforcement learning (RL) by decomposing tasks into subgoals. However, their practical utility is limited by poor planning-execution alignment, which reflects a critical gap between abstract plans and actionable, environment-compatible behaviors. This misalignment arises from two interrelated limitations: (1) LLMs often produce subgoals that are semantically plausible but infeasible or irrelevant in the target environment due to insufficient grounding in environment-specific knowledge, and (2) single-LLM planning conflates generation with self-verification, resulting in overconfident yet unreliable subgoals that frequently fail during execution. To address these challenges, we propose Subgoal Graph-Augmented Actor-Critic-Refiner (SGA-ACR), a framework that integrates an environment-specific subgoal graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Artificial Intelligence in Games
