Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels
Abhay Deshpande, Liyiming Ke, Quinn Pfeifer, Abhishek Gupta,, Siddhartha S. Srinivasa

TL;DR
This paper introduces CCIL, a novel imitation learning framework that uses continuity-based corrective labels to improve fine manipulation tasks on robots, effectively reducing errors caused by covariate shift.
Contribution
The paper presents CCIL, a new method that leverages local smoothness in contact-rich manipulation to generate corrective labels, enhancing imitation learning performance in real-world robotic tasks.
Findings
CCIL significantly improves imitation learning in contact-rich manipulation.
Local smoothness in real-world tasks enables effective use of CCIL.
Corrective labels are most beneficial in low-data regimes.
Abstract
We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by learning a locally continuous dynamics model from demonstrations to guide the agent back toward expert states. Through extensive experiments on peg insertion and fine grasping, we provide the first empirical validation that CCIL can significantly improve imitation learning performance despite discontinuities present in contact-rich manipulation. We find that: (1) real-world manipulation exhibits sufficient local smoothness to apply CCIL, (2) generated corrective labels are most beneficial in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications
