Reshaping Action Error Distributions for Reliable Vision-Language-Action Models

Shuanghao Bai; Dakai Wang; Cheng Chi; Wanqi Zhou; Jing Lyu; Xiaoguang Zhao; Pengwei Wang; Zhongyuan Wang; Lei Xing; Shanghang Zhang; Badong Chen

arXiv:2602.04228·cs.RO·February 5, 2026

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models

Shuanghao Bai, Dakai Wang, Cheng Chi, Wanqi Zhou, Jing Lyu, Xiaoguang Zhao, Pengwei Wang, Zhongyuan Wang, Lei Xing, Shanghang Zhang, Badong Chen

PDF

Open Access

TL;DR

This paper introduces a novel training approach for vision-language-action models in robotics by reshaping action error distributions using Minimum Error Entropy, leading to improved success rates and robustness without extra inference costs.

Contribution

It proposes integrating Minimum Error Entropy into VLA training, moving beyond traditional MSE regression, and provides theoretical analysis of its effectiveness.

Findings

01

Consistent success rate improvements across benchmarks.

02

Enhanced robustness in noisy and few-shot settings.

03

Negligible additional training cost and no inference impact.

Abstract

In robotic manipulation, vision-language-action (VLA) models have emerged as a promising paradigm for learning generalizable and scalable robot policies. Most existing VLA frameworks rely on standard supervised objectives, typically cross-entropy for discrete actions and mean squared error (MSE) for continuous action regression, which impose strong pointwise constraints on individual predictions. In this work, we focus on continuous-action VLA models and move beyond conventional MSE-based regression by reshaping action error distributions during training. Drawing on information-theoretic principles, we introduce Minimum Error Entropy (MEE) into modern VLA architectures and propose a trajectory-level MEE objective, together with two weighted variants, combined with MSE for continuous-action VLA training. We evaluate our approaches across standard, few-shot, and noisy settings on multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Reinforcement Learning in Robotics