One-Shot Manipulation Strategy Learning by Making Contact Analogies

Yuyao Liu; Jiayuan Mao; Joshua Tenenbaum; Tom\'as Lozano-P\'erez,; Leslie Pack Kaelbling

arXiv:2411.09627·cs.RO·March 25, 2025

One-Shot Manipulation Strategy Learning by Making Contact Analogies

Yuyao Liu, Jiayuan Mao, Joshua Tenenbaum, Tom\'as Lozano-P\'erez,, Leslie Pack Kaelbling

PDF

Open Access

TL;DR

MAGIC is a novel one-shot manipulation learning method that generalizes contact strategies to new objects by combining shape matching and curvature analysis, enabling fast and versatile manipulation in robotics.

Contribution

Introduces MAGIC, a two-stage contact-point matching approach that enhances one-shot manipulation learning with improved speed and generalization capabilities.

Findings

01

Outperforms existing methods in manipulation tasks

02

Achieves faster runtime and better generalization

03

Successfully applies to scooping, hanging, and hooking tasks

Abstract

We present a novel approach, MAGIC (manipulation analogies for generalizable intelligent contacts), for one-shot learning of manipulation strategies with fast and extensive generalization to novel objects. By leveraging a reference action trajectory, MAGIC effectively identifies similar contact points and sequences of actions on novel objects to replicate a demonstrated strategy, such as using different hooks to retrieve distant objects of different shapes and sizes. Our method is based on a two-stage contact-point matching process that combines global shape matching using pretrained neural features with local curvature analysis to ensure precise and physically plausible contact points. We experiment with three tasks including scooping, hanging, and hooking objects. MAGIC demonstrates superior performance over existing methods, achieving significant improvements in runtime speed and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings