InsertionNet 2.0: Minimal Contact Multi-Step Insertion Using Multimodal Multiview Sensory Input
Oren Spector, Vladimir Tchuiev, Dotan Di Castro

TL;DR
InsertionNet 2.0 enables robots to learn multi-step insertion tasks rapidly and safely using multimodal sensory input, contrastive learning, and one-shot relation networks, achieving high success rates in real-world experiments.
Contribution
The paper introduces InsertionNet 2.0, a novel method combining multimodal perception, contrastive learning, and one-shot relation networks for efficient, robust, and minimal-contact multi-step insertion learning.
Findings
Achieved over 97.5% success rate in 16 real-life insertion tasks.
Demonstrated zero-shot success on unseen insertion tasks.
Reduced contact and execution time during insertion.
Abstract
We address the problem of devising the means for a robot to rapidly and safely learn insertion skills with just a few human interventions and without hand-crafted rewards or demonstrations. Our InsertionNet version 2.0 provides an improved technique to robustly cope with a wide range of use-cases featuring different shapes, colors, initial poses, etc. In particular, we present a regression-based method based on multimodal input from stereo perception and force, augmented with contrastive learning for the efficient learning of valuable features. In addition, we introduce a one-shot learning technique for insertion, which relies on a relation network scheme to better exploit the collected data and to support multi-step insertion tasks. Our method improves on the results obtained with the original InsertionNet, achieving an almost perfect score (above 97.5 on 200 trials) in 16…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Hand Gesture Recognition Systems · Muscle activation and electromyography studies
MethodsContrastive Learning
