Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA

Zihua Wang; Zhitao Lin; Ruibo Li; Yu Zhang; Xu Yang; Siya Mi; Xiu-Shen Wei

arXiv:2604.02965·cs.RO·April 6, 2026

Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA

Zihua Wang, Zhitao Lin, Ruibo Li, Yu Zhang, Xu Yang, Siya Mi, Xiu-Shen Wei

PDF

1 Repo

TL;DR

SV-VLA enhances vision-language-action models by integrating open-loop planning with lightweight online verification, improving efficiency and robustness in dynamic environments.

Contribution

It introduces a novel framework combining macro-planning with online verification to improve VLA control performance.

Findings

01

SV-VLA achieves efficient long-horizon planning with online verification.

02

The framework improves robustness against environmental changes.

03

Code is publicly available at the provided GitHub URL.

Abstract

Vision-Language-Action (VLA) models, as large foundation models for embodied control, have shown strong performance in manipulation tasks. However, their performance comes at high inference cost. To improve efficiency, recent methods adopt action chunking, which predicts a sequence of future actions for open-loop execution. Although effective for reducing computation, open-loop execution is sensitive to environmental changes and prone to error accumulation due to the lack of close-loop feedback. To address this limitation, we propose Speculative Verification for VLA Control (SV-VLA), a framework that combines efficient open-loop long-horizon planning with lightweight closed-loop online verification. Specifically, SV-VLA uses a heavy VLA as a low-frequency macro-planner to generate an action chunk together with a planning context, while a lightweight verifier continuously monitors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

edsad122/SV-VLA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.