TL;DR
This paper introduces a pose-guided imitation learning approach using object poses in SE(3) for robotic precise insertion, combining diffusion policies with RGBD cues to enhance robustness and data efficiency in contact-rich tasks.
Contribution
The paper proposes a novel diffusion policy framework utilizing SE(3) object poses and RGBD data for precise robotic insertion, improving robustness and generalization over existing methods.
Findings
Achieved high success rates on six real-robot tasks with minimal demonstrations.
Succeeded in tasks with clearances as small as 0.01 mm.
Demonstrated improved data efficiency and robustness compared to baselines.
Abstract
Imitation learning is promising for robotic manipulation, but \emph{precise insertion} in the real world remains difficult due to contact-rich dynamics, tight clearances, and limited demonstrations. Many existing visuomotor policies depend on high-dimensional RGB/point-cloud observations, which can be data-inefficient and generalize poorly under pose variations. In this paper, we study pose-guided imitation learning by using object poses in as compact, object-centric observations for precise insertion tasks. First, we propose a diffusion policy for precise insertion that observes the \emph{relative} pose of the source object with respect to the target object and predicts a future relative pose trajectory as its action. Second, to improve robustness to pose estimation noise, we augment the pose-guided policy with RGBD cues. Specifically, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
