Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Yanchuan Tang; Taowen Wang; Yuefei Chen; Boxuan Zhang; Qiang Guan; Ruixiang Tang

arXiv:2603.18342·cs.RO·March 20, 2026

Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Yanchuan Tang, Taowen Wang, Yuefei Chen, Boxuan Zhang, Qiang Guan, Ruixiang Tang

PDF

Open Access

TL;DR

This paper introduces a novel uncertainty quantification method for VLA models that preserves transient risk signals and improves failure prediction accuracy, enhancing safety and reliability in robotic control.

Contribution

It proposes a unified approach combining max-based pooling, motion-aware weighting, and Bayesian calibration to better detect failures in VLA models.

Findings

01

Significantly improves failure prediction accuracy on LIBERO benchmark.

02

Provides more reliable uncertainty signals for human intervention.

03

Enhances safety in robotic policy execution.

Abstract

Vision-Language-Action (VLA) models enable general-purpose robotic policies by mapping visual observations and language instructions to low-level actions, but they often lack reliable introspection. A common practice is to compute a token-level uncertainty signal and take its mean over a rollout. However, mean aggregation can dilute short-lived but safety-critical uncertainty spikes in continuous control. In particular, successful rollouts may contain localized high-entropy segments due to benign noise or non-critical micro-adjustments, while failure rollouts can appear low-entropy for most timesteps and only exhibit brief spikes near the onset of failure. We propose a unified uncertainty quantification approach for predicting rollout success versus failure that (1) uses max-based sliding window pooling to preserve transient risk signals, (2) applies motion-aware stability weighting to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning