DualPoseNet: Category-level 6D Object Pose and Size Estimation Using   Dual Pose Network with Refined Learning of Pose Consistency

Jiehong Lin; Zewei Wei; Zhihao Li; Songcen Xu; Kui Jia; Yuanqing Li

arXiv:2103.06526·cs.CV·August 17, 2021·6 cites

DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency

Jiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia, Yuanqing Li

PDF

Open Access 1 Repo

TL;DR

DualPoseNet is a novel approach for category-level 6D object pose and size estimation that employs dual pose decoders with refined learning of pose consistency, achieving superior accuracy in cluttered scenes.

Contribution

The paper introduces DualPoseNet, which uses two parallel pose decoders with a shared encoder and a novel implicit decoder for refined pose prediction without test-time CAD models.

Findings

01

Outperforms existing methods with high precision

02

Effective use of dual decoders and pose consistency enforcement

03

Demonstrates robustness in cluttered scenes

Abstract

Category-level 6D object pose and size estimation is to predict full pose configurations of rotation, translation, and size for object instances observed in single, arbitrary views of cluttered scenes. In this paper, we propose a new method of Dual Pose Network with refined learning of pose consistency for this task, shortened as DualPoseNet. DualPoseNet stacks two parallel pose decoders on top of a shared pose encoder, where the implicit decoder predicts object poses with a working mechanism different from that of the explicit one; they thus impose complementary supervision on the training of pose encoder. We construct the encoder based on spherical convolutions, and design a module of Spherical Fusion wherein for a better embedding of pose-sensitive features from the appearance and shape observations. Given no testing CAD models, it is the novel introduction of the implicit decoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Gorilla-Lab-SCUT/DualPoseNet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · 3D Shape Modeling and Analysis