SAR-Net: Shape Alignment and Recovery Network for Category-level 6D   Object Pose and Size Estimation

Haitao Lin; Zichang Liu; Chilam Cheang; Yanwei Fu; Guodong Guo,; Xiangyang Xue

arXiv:2106.14193·cs.CV·April 12, 2022·5 cites

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation

Haitao Lin, Zichang Liu, Chilam Cheang, Yanwei Fu, Guodong Guo,, Xiangyang Xue

PDF

Open Access

TL;DR

This paper introduces SAR-Net, a shape alignment-based method for category-level 6D object pose and size estimation from a single scene image and point cloud, without needing real pose annotations, and demonstrates its effectiveness on benchmarks and robotic grasping tasks.

Contribution

SAR-Net is a novel framework that leverages shape alignment and symmetry modeling for accurate pose and size estimation without external real pose-annotated data.

Findings

01

Competitive performance on NOCS benchmark

02

Effective in robotic grasping tasks

03

Lightweight model with real-time potential

Abstract

Given a single scene image, this paper proposes a method of Category-level 6D Object Pose and Size Estimation (COPSE) from the point cloud of the target object, without external real pose-annotated training data. Specifically, beyond the visual cues in RGB images, we rely on the shape information predominately from the depth (D) channel. The key idea is to explore the shape alignment of each instance against its corresponding category-level template shape, and the symmetric correspondence of each object category for estimating a coarse 3D object shape. Our framework deforms the point cloud of the category-level template shape to align the observed instance point cloud for implicitly representing its 3D rotation. Then we model the symmetric correspondence by predicting symmetric point cloud from the partially observed point cloud. The concatenation of the observed point cloud and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Robot Manipulation and Learning · Human Pose and Action Recognition

MethodsALIGN