Generalizable Articulated Object Perception with Superpoints
Qiaojun Yu, Ce Hao, Xibin Yuan, Li Zhang, Liu Liu, Yukang Huo, Rohit, Agarwal, Cewu Lu

TL;DR
This paper presents a superpoint-based perception method that enhances 3D part segmentation of articulated objects, leveraging learnable superpoints, SAM, and transformer decoders for improved accuracy across categories.
Contribution
Introduces a novel superpoint generation technique combined with SAM and transformer decoders for superior 3D part segmentation of articulated objects.
Findings
Outperforms state-of-the-art in cross-category segmentation
Achieves 77.9% AP50 on seen categories
Achieves 39.3% AP50 on unseen categories
Abstract
Manipulating articulated objects with robotic arms is challenging due to the complex kinematic structure, which requires precise part segmentation for efficient manipulation. In this work, we introduce a novel superpoint-based perception method designed to improve part segmentation in 3D point clouds of articulated objects. We propose a learnable, part-aware superpoint generation technique that efficiently groups points based on their geometric and semantic similarities, resulting in clearer part boundaries. Furthermore, by leveraging the segmentation capabilities of the 2D foundation model SAM, we identify the centers of pixel regions and select corresponding superpoints as candidate query points. Integrating a query-based transformer decoder further enhances our method's ability to achieve precise part segmentation. Experimental results on the GAPartNet dataset show that our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Object Detection Techniques · Image Retrieval and Classification Techniques · Advanced Vision and Imaging
MethodsSegment Anything Model
