Steering Robots with Inference-Time Interactions
Yanwei Wang

TL;DR
This paper introduces a method to make pretrained robot policies steerable at inference time, allowing user interactions to correct errors and guide behavior without additional training.
Contribution
It proposes inference-time steering techniques that enable user-guided correction and editing of robot behaviors in pretrained policies without finetuning.
Findings
Enables switching between discrete skills via user interactions.
Allows editing of continuous motions to satisfy task constraints.
Corrects policy errors without additional training.
Abstract
Imitation learning has driven the development of generalist policies capable of autonomously solving multiple tasks. However, when a pretrained policy makes errors during deployment, there are limited mechanisms for users to correct its behavior. While collecting additional data for finetuning can address such issues, doing so for each downstream use case is inefficient at deployment. My research proposes an alternative: keeping pretrained policies frozen as a fixed skill repertoire while allowing user interactions to guide behavior generation toward user preferences at inference time. By making pretrained policies steerable, users can help correct policy errors when the model struggles to generalize-without needing to finetune the policy. Specifically, I propose (1) inference-time steering, which leverages user interactions to switch between discrete skills, and (2) task and motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl and Dynamics of Mobile Robots
