Learning Better Keypoints for Multi-Object 6DoF Pose Estimation

Yangzheng Wu; Michael Greenspan

arXiv:2308.07827·cs.CV·November 13, 2023

Learning Better Keypoints for Multi-Object 6DoF Pose Estimation

Yangzheng Wu, Michael Greenspan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces KeyGNet, a graph network that learns optimal keypoint locations for multi-object 6DoF pose estimation, significantly improving accuracy across multiple datasets by replacing heuristic keypoints.

Contribution

The paper proposes a novel learned keypoint selection method using KeyGNet, enhancing pose estimation accuracy and efficiency over traditional heuristic approaches.

Findings

01

Improved accuracy on all tested datasets.

02

Significant AR gains on BOP datasets.

03

Elimination of SISO-MIMO gap in multi-object training.

Abstract

We address the problem of keypoint selection, and find that the performance of 6DoF pose estimation methods can be improved when pre-defined keypoint locations are learned, rather than being heuristically selected as has been the standard approach. We found that accuracy and efficiency can be improved by training a graph network to select a set of disperse keypoints with similarly distributed votes. These votes, learned by a regression network to accumulate evidence for the keypoint locations, can be regressed more accurately compared to previous heuristic keypoint algorithms. The proposed KeyGNet, supervised by a combined loss measuring both Wasserstein distance and dispersion, learns the color and geometry features of the target objects to estimate optimal keypoint locations. Experiments demonstrate the keypoints selected by KeyGNet improved the accuracy for all evaluation metrics of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aaronwool/keygnet
pytorchOfficial

Videos

Learning Better Keypoints for Multi-Object 6DoF Pose Estimation· youtube

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotics and Sensor-Based Localization