FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation
Edgar Welte, Yitian Shi, Rosa Wolf, Maximillian Gilles, Rania Rayyes

TL;DR
FlowCorrect is a modular approach that allows real-time, human-in-the-loop corrections to generative robotic manipulation policies, significantly improving success rates with minimal additional training.
Contribution
It introduces a novel interactive imitation learning method enabling deployment-time policy adaptation through sparse human corrections without retraining.
Findings
Achieves 80% success on previously failed cases with few corrections.
Preserves performance on previously learned tasks.
Enables fast, sample-efficient policy updates during deployment.
Abstract
Generative manipulation policies can fail catastrophically under deployment-time distribution shift, yet many failures are near-misses: the robot reaches almost-correct poses and would succeed with a small corrective motion. We propose FlowCorrect, a modular interactive imitation learning approach that enables deployment-time adaptation of flow-matching manipulation policies from sparse, relative human corrections without retraining. During execution, a human provides brief corrective pose nudges via a lightweight VR interface. FlowCorrect uses these sparse corrections to locally adapt the policy, improving actions without retraining the backbone while preserving the model performance on previously learned scenarios. We evaluate on a real-world robot across four tabletop tasks: pick-and-place, pouring, cup uprighting, and insertion. With a low correction budget, FlowCorrect achieves an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI
