Contrastive Conceptor Activation Steering (COAST): Unlocking Vision-Language-Action Models through Hidden States

Miranda Muqing Miao; Subin Kim; Brandon Yang; Lyle Ungar

arXiv:2605.17144·cs.RO·May 19, 2026

Contrastive Conceptor Activation Steering (COAST): Unlocking Vision-Language-Action Models through Hidden States

Miranda Muqing Miao, Subin Kim, Brandon Yang, Lyle Ungar

PDF

TL;DR

COAST is a method that improves vision-language-action models by steering their latent states into success-critical subspaces identified through conceptors, significantly enhancing robotic task performance.

Contribution

The paper introduces COAST, a novel, training-free technique using conceptors to steer VLA model latents into success subspaces, boosting task success rates.

Findings

01

COAST improves simulation success rate by over 20%.

02

COAST enhances real-robot task success by over 40%.

03

Shared failure modes across tasks can be mitigated using previously fitted conceptors.

Abstract

Vision-Language-Action (VLA) models leverage powerful perceptual priors from web-scale Vision-Language Model (VLM) pre-training, yet they remain surprisingly brittle in practice, frequently failing at simple robotic tasks. To mitigate this, we propose Contrastive Conceptor Activation Steering (COAST). COAST builds on the notion of a "conceptor", a linear operator that soft-projects data into the principal components of a target distribution. COAST uses conceptors to identify success-critical subspaces for a target robotic task from a few examples of success and failure rollouts. At inference time, it steers VLA latents into these identified success subspaces to improve task outcomes. Across three architecturally distinct neural policies (flow-matching VLA, autoregressive VLA, and Diffusion Policy), COAST improves absolute mean simulation and real-robot task success rate by over 20 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.