Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning

Kewei Chen; Yayu Long; Mingsheng Shang

arXiv:2605.09537·cs.RO·May 12, 2026

Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning

Kewei Chen, Yayu Long, Mingsheng Shang

PDF

TL;DR

This paper introduces CAPS, a novel inference-time framework that uses power sampling and SNR-based control to mitigate instruction drift in long-horizon robotic planning, improving robustness without retraining.

Contribution

It proposes Context-Aware Power Sampling (CAPS) and a SNR-based control mechanism to enhance long-horizon robotic control robustness during inference.

Findings

01

CAPS significantly outperforms strong baselines like OpenVLA and TACO.

02

The SNR-based control triggers adaptive search only when drift risk is detected.

03

Experiments on multiple benchmarks demonstrate improved long-horizon robustness.

Abstract

Despite rapid progress in Vision-Language-Action (VLA) models for robotic control, instruction drift remains a persistent failure mode in long-horizon tasks. This paper reconceptualizes this phenomenon, positing that instruction drift is fundamentally a systematic sampling error: local greedy sampling is prone to collapsing into "Negative Pivotal Windows"--irreversible local optima with high local probability that sever global success pathways. To address this, we propose Context-Aware Power Sampling (CAPS), a training-free inference-time computation framework. CAPS leverages power distributions to sharpen global trajectory probabilities, enabling lookahead search over the model's conditional generative trajectory distribution. Furthermore, we introduce a metacognitive control mechanism based on Signal-to-Noise Ratio (SNR). This mechanism triggers adaptive MCMC search solely when drift…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.