VeriGraph: Scene Graphs for Execution Verifiable Robot Planning

Daniel Ekpo; Mara Levy; Saksham Suri; Chuong Huynh; Archana Swaminathan; Abhinav Shrivastava

arXiv:2411.10446·cs.RO·April 20, 2026

VeriGraph: Scene Graphs for Execution Verifiable Robot Planning

Daniel Ekpo, Mara Levy, Saksham Suri, Chuong Huynh, Archana Swaminathan, Abhinav Shrivastava

PDF

1 Repo

TL;DR

VeriGraph introduces a framework that uses scene graphs to verify and refine robot action plans generated by vision-language models, significantly improving task success rates.

Contribution

It integrates scene graphs with large language models to verify and correct robot plans, enhancing reliability and execution success.

Findings

01

Outperforms baseline methods by 58% on language tasks.

02

Achieves 56% improvement on tangram puzzles.

03

Improves task completion by 30% on image-based tasks.

Abstract

Recent progress in vision-language models (VLMs) has opened new possibilities for robot task planning, but these models often produce incorrect action sequences. To address these limitations, we propose VeriGraph, a novel framework that integrates VLMs for robotic planning while verifying action feasibility. VeriGraph uses scene graphs as an intermediate representation to capture key objects and spatial relationships, enabling more reliable plan verification and refinement. The system generates a scene graph from input images and uses it to iteratively check and correct action sequences generated by an LLM-based task planner, ensuring constraints are respected and actions are executable. Our approach significantly enhances task completion rates across diverse manipulation scenarios, outperforming baseline methods by 58% on language-based tasks, 56% on tangram puzzle tasks, and 30% on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://verigraph-agent.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.