Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

Jianchao Zhao; Huoren Yang; Yusong Hu; Yuyang Gao; Qiguan Ou; Cong Wan; SongLin Dong; Zhiheng Ma; Yihong Gong

arXiv:2605.10094·cs.RO·May 13, 2026

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

Jianchao Zhao, Huoren Yang, Yusong Hu, Yuyang Gao, Qiguan Ou, Cong Wan, SongLin Dong, Zhiheng Ma, Yihong Gong

PDF

TL;DR

This paper presents a retrieve-then-steer framework for test-time adaptation of generative vision-language-action models, improving robotic task success by leveraging successful past experiences without retraining.

Contribution

It introduces an online success-memory mechanism that retrieves and incorporates environment-specific successful actions into the model during deployment.

Findings

01

Enhanced task success in simulation and real-world tests.

02

Improved stability in long-horizon, multi-stage tasks.

03

Lightweight, non-parametric adaptation without retraining.

Abstract

Vision-Language-Action (VLA) models show strong potential for general-purpose robotic manipulation, yet their closed-loop reliability often degrades under local deployment conditions. Existing evaluations typically treat test episodes as independent zero-shot trials. However, real robots often operate repeatedly in the same or slowly changing environments, where successful executions provide environment-verified evidence of reliable behavior patterns. We study this persistent-deployment setting, asking whether a partially competent frozen VLA can improve its reliability by reusing its successful test-time experience. We propose an online success-memory guided test-time adaptation framework for generative VLAs. During deployment, the robot stores progress-calibrated successful observation-action segments in a long-term memory. At inference, it retrieves state-relevant action chunks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.