Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning
Kewei Chen, Yayu Long, Mingsheng Shang

TL;DR
This paper introduces CAPS, a novel inference-time framework that uses power sampling and SNR-based control to mitigate instruction drift in long-horizon robotic planning, improving robustness without retraining.
Contribution
It proposes Context-Aware Power Sampling (CAPS) and a SNR-based control mechanism to enhance long-horizon robotic control robustness during inference.
Findings
CAPS significantly outperforms strong baselines like OpenVLA and TACO.
The SNR-based control triggers adaptive search only when drift risk is detected.
Experiments on multiple benchmarks demonstrate improved long-horizon robustness.
Abstract
Despite rapid progress in Vision-Language-Action (VLA) models for robotic control, instruction drift remains a persistent failure mode in long-horizon tasks. This paper reconceptualizes this phenomenon, positing that instruction drift is fundamentally a systematic sampling error: local greedy sampling is prone to collapsing into "Negative Pivotal Windows"--irreversible local optima with high local probability that sever global success pathways. To address this, we propose Context-Aware Power Sampling (CAPS), a training-free inference-time computation framework. CAPS leverages power distributions to sharpen global trajectory probabilities, enabling lookahead search over the model's conditional generative trajectory distribution. Furthermore, we introduce a metacognitive control mechanism based on Signal-to-Noise Ratio (SNR). This mechanism triggers adaptive MCMC search solely when drift…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
