USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation
Zhengrong Xue, Zhecheng Yuan, Jiashun Wang, Xueqian Wang, Yang Gao,, Huazhe Xu

TL;DR
This paper introduces USEEK, an unsupervised SE(3)-equivariant keypoints method that enables robots to manipulate unseen intra-category objects in arbitrary poses by transferring functional knowledge from demonstrations.
Contribution
USEEK is the first unsupervised SE(3)-equivariant keypoints approach that achieves category-level generalization for manipulation tasks, decoupling keypoint discovery and detection.
Findings
USEEK produces semantically rich keypoints that transfer functional knowledge.
USEEK outperforms other representations in handling shape variance.
USEEK is robust with limited demonstrations and efficient during inference.
Abstract
Can a robot manipulate intra-category unseen objects in arbitrary poses with the help of a mere demonstration of grasping pose on a single object instance? In this paper, we try to address this intriguing challenge by using USEEK, an unsupervised SE(3)-equivariant keypoints method that enjoys alignment across instances in a category, to perform generalizable manipulation. USEEK follows a teacher-student structure to decouple the unsupervised keypoint discovery and SE(3)-equivariant keypoint detection. With USEEK in hand, the robot can infer the category-level task-relevant object frames in an efficient and explainable manner, enabling manipulation of any intra-category objects from and to any poses. Through extensive experiments, we demonstrate that the keypoints produced by USEEK possess rich semantics, thus successfully transferring the functional knowledge from the demonstration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Human Pose and Action Recognition
