SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
Haitao Lin, Zichang Liu, Chilam Cheang, Yanwei Fu, Guodong Guo,, Xiangyang Xue

TL;DR
This paper introduces SAR-Net, a shape alignment-based method for category-level 6D object pose and size estimation from a single scene image and point cloud, without needing real pose annotations, and demonstrates its effectiveness on benchmarks and robotic grasping tasks.
Contribution
SAR-Net is a novel framework that leverages shape alignment and symmetry modeling for accurate pose and size estimation without external real pose-annotated data.
Findings
Competitive performance on NOCS benchmark
Effective in robotic grasping tasks
Lightweight model with real-time potential
Abstract
Given a single scene image, this paper proposes a method of Category-level 6D Object Pose and Size Estimation (COPSE) from the point cloud of the target object, without external real pose-annotated training data. Specifically, beyond the visual cues in RGB images, we rely on the shape information predominately from the depth (D) channel. The key idea is to explore the shape alignment of each instance against its corresponding category-level template shape, and the symmetric correspondence of each object category for estimating a coarse 3D object shape. Our framework deforms the point cloud of the category-level template shape to align the observed instance point cloud for implicitly representing its 3D rotation. Then we model the symmetric correspondence by predicting symmetric point cloud from the partially observed point cloud. The concatenation of the observed point cloud and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robot Manipulation and Learning · Human Pose and Action Recognition
MethodsALIGN
