Learning Better Keypoints for Multi-Object 6DoF Pose Estimation
Yangzheng Wu, Michael Greenspan

TL;DR
This paper introduces KeyGNet, a graph network that learns optimal keypoint locations for multi-object 6DoF pose estimation, significantly improving accuracy across multiple datasets by replacing heuristic keypoints.
Contribution
The paper proposes a novel learned keypoint selection method using KeyGNet, enhancing pose estimation accuracy and efficiency over traditional heuristic approaches.
Findings
Improved accuracy on all tested datasets.
Significant AR gains on BOP datasets.
Elimination of SISO-MIMO gap in multi-object training.
Abstract
We address the problem of keypoint selection, and find that the performance of 6DoF pose estimation methods can be improved when pre-defined keypoint locations are learned, rather than being heuristically selected as has been the standard approach. We found that accuracy and efficiency can be improved by training a graph network to select a set of disperse keypoints with similarly distributed votes. These votes, learned by a regression network to accumulate evidence for the keypoint locations, can be regressed more accurately compared to previous heuristic keypoint algorithms. The proposed KeyGNet, supervised by a combined loss measuring both Wasserstein distance and dispersion, learns the color and geometry features of the target objects to estimate optimal keypoint locations. Experiments demonstrate the keypoints selected by KeyGNet improved the accuracy for all evaluation metrics of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Learning Better Keypoints for Multi-Object 6DoF Pose Estimation· youtube
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
