Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion

Zhuo Li; Junjia Liu; Zhipeng Dong; Tao Teng; Quentin Rouxel; Darwin Caldwell; Fei Chen

arXiv:2511.14178·cs.RO·April 17, 2026

Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion

Zhuo Li, Junjia Liu, Zhipeng Dong, Tao Teng, Quentin Rouxel, Darwin Caldwell, Fei Chen

PDF

1 Repo

TL;DR

VLA-Pilot enables zero-shot deployment of pre-trained VLA models in robotics by using inference-time policy steering, eliminating the need for fine-tuning or additional data collection.

Contribution

Introduces VLA-Pilot, a plug-and-play inference-time policy steering method that improves zero-shot generalization of VLA policies without fine-tuning.

Findings

01

VLA-Pilot significantly increases success rates across six manipulation tasks.

02

It enables robust zero-shot generalization to new tasks and robot embodiments.

03

Experimental videos and code are publicly available.

Abstract

Vision-Language-Action (VLA) models have demonstrated significant potential in real-world robotic manipulation. However, pre-trained VLA policies still suffer from substantial performance degradation during downstream deployment. Although fine-tuning can mitigate this issue, its reliance on costly demonstration collection and intensive computation makes it impractical in real-world settings. In this work, we introduce VLA-Pilot, a plug-and-play inference-time policy steering method for zero-shot deployment of pre-trained VLA without any additional fine-tuning or data collection. We evaluate VLA-Pilot on six real-world downstream manipulation tasks across two distinct robotic embodiments, encompassing both in-distribution and out-of-distribution scenarios. Experimental results demonstrate that VLA-Pilot substantially boosts the success rates of off-the-shelf pre-trained VLA policies,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://rip4kobe.github.io/vla-pilot
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.