TL;DR
Rewind-IL is a training-free online safeguard framework that detects failures in imitation learning policies and respawns the robot at safe states, improving reliability in long-horizon manipulation tasks.
Contribution
It introduces a novel zero-shot failure detector and a state-respawning mechanism using vision-language models and policy checkpoints, enhancing imitation learning robustness.
Findings
Rewind-IL improves failure detection accuracy in real-world tasks.
The framework enables recovery and continued operation after failures.
Experiments show increased reliability in long-horizon manipulation tasks.
Abstract
Imitation learning has enabled robots to acquire complex visuomotor manipulation skills from demonstrations, but deployment failures remain a major obstacle, especially for long-horizon action-chunked policies. Once execution drifts off the demonstration manifold, these policies often continue producing locally plausible actions without recovering from the failure. Existing runtime monitors either require failure data, over-trigger under benign feature drift, or stop at failure detection without providing a recovery mechanism. We present Rewind-IL, a training-free online safeguard framework for generative action-chunked imitation policies. Rewind-IL combines a zero-shot failure detector based on Temporal Inter-chunk Discrepancy Estimate (TIDE), calibrated with split conformal prediction, with a state-respawning mechanism that returns the robot to a semantically verified safe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
