CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects
Yoonyoung Cho, Junhyek Han, Yoontae Cho, Beomjoon Kim

TL;DR
This paper introduces CORN, a contact-based object representation and pretraining method using a patch-based transformer, enabling efficient zero-shot transfer of nonprehensile manipulation policies to unseen objects.
Contribution
It presents a novel contact-based object representation and a scalable pretraining pipeline that improves generalization and efficiency in nonprehensile manipulation tasks.
Findings
Effective zero-shot transfer to real-world objects
Significant reduction in training time and data requirements
Outperforms previous shape representation methods
Abstract
Nonprehensile manipulation is essential for manipulating objects that are too thin, large, or otherwise ungraspable in the wild. To sidestep the difficulty of contact modeling in conventional modeling-based approaches, reinforcement learning (RL) has recently emerged as a promising alternative. However, previous RL approaches either lack the ability to generalize over diverse object shapes, or use simple action primitives that limit the diversity of robot motions. Furthermore, using RL over diverse object geometry is challenging due to the high cost of training a policy that takes in high-dimensional sensory inputs. We propose a novel contact-based object representation and pretraining pipeline to tackle this. To enable massively parallel training, we leverage a lightweight patch-based transformer architecture for our encoder that processes point clouds, thus scaling our training across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Robot Manipulation and Learning · 3D Surveying and Cultural Heritage
