GNNVerifier: Graph-based Verifier for LLM Task Planning
Yu Hao, Qiuyu Wang, Cheng Yang, Yawen Li, Zhiqiang Zhang, Chuan Shi

TL;DR
GNNVerifier introduces a graph neural network-based approach to evaluate and improve LLM-generated task plans by modeling them as graphs, enabling structural analysis and local plan corrections to enhance reliability.
Contribution
The paper presents a novel graph-based verifier that uses GNNs to assess and refine LLM task plans, addressing limitations of existing LLM-based verification methods.
Findings
Significant improvements in plan quality across multiple datasets.
Effective detection of structural plan flaws using graph representations.
Successful automatic plan correction guided by GNN feedback.
Abstract
Large language models (LLMs) facilitate the development of autonomous agents. As a core component of such agents, task planning aims to decompose complex natural language requests into concrete, solvable sub-tasks. Since LLM-generated plans are frequently prone to hallucinations and sensitive to long-context prom-pts, recent research has introduced plan verifiers to identify and correct potential flaws. However, most existing approaches still rely on an LLM as the verifier via additional prompting for plan review or self-reflection. LLM-based verifiers can be misled by plausible narration and struggle to detect failures caused by structural relations across steps, such as type mismatches, missing intermediates, or broken dependencies. To address these limitations, we propose a graph-based verifier for LLM task planning. Specifically, the proposed method has four major components:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · AI-based Problem Solving and Planning · Explainable Artificial Intelligence (XAI)
