TL;DR
iTeach introduces an interactive, failure-driven framework for real-time robot perception adaptation using minimal human supervision, significantly improving segmentation and manipulation success in diverse real-world environments.
Contribution
The paper presents a novel interactive teaching approach that enables deployment-time perception adaptation with minimal annotation effort through failure-driven sampling and semi-supervised labeling.
Findings
Significant improvement in object segmentation performance after adaptation.
Enhanced grasping and pick-and-place success rates in real-world experiments.
Effective in-the-wild perception adaptation with limited human supervision.
Abstract
Robotic perception models often fail when deployed in real-world environments due to out-of-distribution conditions such as clutter, occlusion, and novel object instances. Existing approaches address this gap through offline data collection and retraining, which are slow and do not resolve deployment-time failures. We propose iTeach, a failure-driven interactive teaching framework for adapting robot perception in the wild. A co-located human observes model predictions during deployment, identifies failure cases, and performs short human-object interaction (HumanPlay) to expose informative object configurations while recording RGB-D sequences. To minimize annotation effort, iTeach employs a Few-Shot Semi- Supervised (FS3) labeling strategy, where only the final frame of a short interaction sequence is annotated using hands-free eye-gaze and voice commands, and labels are propagated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
