Category-Level 6D Object Pose Estimation with Flexible Vector-Based   Rotation Representation

Wei Chen; Xi Jia; Zhongqun Zhang; Hyung Jin Chang; Linlin Shen,; Jinming Duan; Ales Leonardis

arXiv:2212.04632·cs.CV·January 31, 2023·1 cites

Category-Level 6D Object Pose Estimation with Flexible Vector-Based Rotation Representation

Wei Chen, Xi Jia, Zhongqun Zhang, Hyung Jin Chang, Linlin Shen,, Jinming Duan, Ales Leonardis

PDF

Open Access

TL;DR

This paper introduces a novel 3D graph convolution pipeline with a flexible vector-based rotation representation for improved category-level 6D object pose and size estimation from RGB-D images, achieving state-of-the-art results.

Contribution

It proposes a new vector-based rotation representation and a 3D graph convolution autoencoder for robust, category-level 6D pose estimation with enhanced generalization.

Findings

01

Achieves state-of-the-art performance on category-level pose estimation tasks.

02

The proposed rotation representation outperforms existing methods.

03

The pipeline demonstrates strong generalization capabilities.

Abstract

In this paper, we propose a novel 3D graph convolution based pipeline for category-level 6D pose and size estimation from monocular RGB-D images. The proposed method leverages an efficient 3D data augmentation and a novel vector-based decoupled rotation representation. Specifically, we first design an orientation-aware autoencoder with 3D graph convolution for latent feature learning. The learned latent feature is insensitive to point shift and size thanks to the shift and scale-invariance properties of the 3D graph convolution. Then, to efficiently decode the rotation information from the latent feature, we design a novel flexible vector-based decomposable rotation representation that employs two decoders to complementarily access the rotation information. The proposed rotation representation has two major advantages: 1) decoupled characteristic that makes the rotation estimation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Advanced Neural Network Applications

MethodsConvolution