Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Shanwei Fan; Bin Zhang; Zhiwei Xu; Yingxuan Teng; Siqi Dai; Lin Cheng; Guoliang Fan

arXiv:2511.20993·cs.LG·December 9, 2025

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Shanwei Fan, Bin Zhang, Zhiwei Xu, Yingxuan Teng, Siqi Dai, Lin Cheng, Guoliang Fan

PDF

Open Access

TL;DR

This paper introduces SGA-ACR, a framework that enhances LLM-guided reinforcement learning by integrating environment-specific subgoal graphs and a multi-LLM pipeline to improve plan feasibility, verification, and execution in open-world tasks.

Contribution

It proposes a novel subgoal graph-augmented planning framework with explicit generation, critique, and refinement stages for better alignment between plans and environment actions.

Findings

01

Improved subgoal feasibility and reliability in open-world tasks.

02

Enhanced alignment between high-level plans and environment-specific actions.

03

Demonstrated effectiveness on 22 diverse tasks in the Crafter game.

Abstract

Large language models (LLMs) offer strong high-level planning capabilities for reinforcement learning (RL) by decomposing tasks into subgoals. However, their practical utility is limited by poor planning-execution alignment, which reflects a critical gap between abstract plans and actionable, environment-compatible behaviors. This misalignment arises from two interrelated limitations: (1) LLMs often produce subgoals that are semantically plausible but infeasible or irrelevant in the target environment due to insufficient grounding in environment-specific knowledge, and (2) single-LLM planning conflates generation with self-verification, resulting in overconfident yet unreliable subgoals that frequently fail during execution. To address these challenges, we propose Subgoal Graph-Augmented Actor-Critic-Refiner (SGA-ACR), a framework that integrates an environment-specific subgoal graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Artificial Intelligence in Games