PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D   Pose Estimation

Guangyuan Zhou; Huiqun Wang; Jiaxin Chen; Di Huang

arXiv:2108.09916·cs.CV·August 24, 2021

PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation

Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang

PDF

Open Access

TL;DR

This paper introduces PR-GCN, a deep learning framework that enhances 6D pose estimation from RGB-D data by refining point clouds and effectively integrating multi-modal information, achieving state-of-the-art results.

Contribution

The paper presents a novel unified approach combining point cloud refinement and multi-modal graph convolutional networks for improved 6D pose estimation.

Findings

01

Achieves state-of-the-art performance on benchmark datasets.

02

Demonstrates effective refinement of 3D point clouds.

03

Shows generalization of modules to other frameworks.

Abstract

RGB-D based 6D pose estimation has recently achieved remarkable progress, but still suffers from two major limitations: (1) ineffective representation of depth data and (2) insufficient integration of different modalities. This paper proposes a novel deep learning approach, namely Graph Convolutional Network with Point Refinement (PR-GCN), to simultaneously address the issues above in a unified way. It first introduces the Point Refinement Network (PRN) to polish 3D point clouds, recovering missing parts with noise removed. Subsequently, the Multi-Modal Fusion Graph Convolutional Network (MMF-GCN) is presented to strengthen RGB-D combination, which captures geometry-aware inter-modality correlation through local information propagation in the graph convolutional network. Extensive experiments are conducted on three widely used benchmarks, and state-of-the-art performance is reached.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · 3D Surveying and Cultural Heritage · Robotics and Sensor-Based Localization