EasyInsert: A Data-Efficient and Generalizable Insertion Policy

Guanghe Li; Junming Zhao; Shengjie Wang; Yang Gao

arXiv:2505.16187·cs.RO·March 10, 2026

EasyInsert: A Data-Efficient and Generalizable Insertion Policy

Guanghe Li, Junming Zhao, Shengjie Wang, Yang Gao

PDF

Open Access

TL;DR

EasyInsert is a data-efficient, generalizable visual policy for robotic insertion that achieves high success rates on unseen objects in cluttered environments with minimal human supervision.

Contribution

It introduces a delta-pose regression approach for insertion, enabling scalable data collection and zero-shot generalization without reliance on CAD models or digital twins.

Findings

01

Over 90% success rate on 13 out of 15 unseen objects

02

Robust zero-shot performance in cluttered environments

03

Fast adaptation with minimal manual resets

Abstract

Robotic insertion is a highly challenging task that requires exceptional precision in cluttered environments. Existing methods often have poor generalization capabilities. They typically function in restricted and structured environments, and frequently fail when the plug and socket are far apart, when the scene is densely cluttered, or when handling novel objects. They also rely on strong assumptions such as access to CAD models or a digital twin in simulation. To address these limitations, we propose EasyInsert. Inspired by human intuition, it formulates insertion as a delta-pose regression problem, which unlocks an efficient, highly scalable data collection pipeline with minimal human labor to train an end-to-end visual policy. During execution, the visual policy predicts the relative pose between plug and socket to drive a multi-phase, coarse-to-fine insertion process. EasyInsert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Interactive and Immersive Displays · Hand Gesture Recognition Systems