Guided Streaming Stochastic Interpolant Policy
Puming Jiang, Meiyi Wang, Kelvin Lin, Ce Hao, Harold Soh

TL;DR
This paper introduces a real-time, guidance-based control framework for generative robot policies, improving reactivity and adaptability in dynamic environments without retraining.
Contribution
It derives an optimal guidance law for Stochastic Interpolants and develops the Streaming Stochastic Interpolant Policy (SSIP) with two guidance mechanisms, enhancing control speed and flexibility.
Findings
Outperforms chunk-based policies in reactivity and guidance quality.
Enables zero-shot adaptation with stochastic trajectory ensemble guidance.
Provides superior control in dynamic, unstructured environments.
Abstract
Inference-time guidance is essential for steering generative robot policies toward dynamic objectives without retraining, yet existing methods are largely confined to chunk-based architectures that exhibit high latency and lack the reactivity needed for test-time preference alignment or obstacle avoidance. In this work, we formally derive the optimal guidance term for Stochastic Interpolants (SI) by analyzing the value function's time evolution via the Backward Kolmogorov Equation, establishing a modified drift that theoretically guarantees sampling from a target distribution. We apply this framework to real-time control through the Streaming Stochastic Interpolant Policy (SSIP), which generalizes the deterministic Streaming Flow Policy (SFP). Unifying this guidance law with the streaming architecture enables fast and reactive control. To support diverse deployment needs, we propose two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
