GMatch: A Lightweight, Geometry-Constrained Keypoint Matcher for Zero-Shot 6DoF Pose Estimation in Robotic Grasp Tasks
Ming Yang, Haoran Li

TL;DR
GMatch is a lightweight, geometry-constrained keypoint matcher designed for efficient 6DoF pose estimation on resource-limited robotic platforms, achieving high accuracy and real-world applicability.
Contribution
The paper introduces GMatch, a novel, efficient keypoint matching framework that incorporates geometric constraints for robust pose estimation on embedded devices.
Findings
GMatch outperforms existing keypoint matchers on benchmark datasets.
GMatch approaches state-of-the-art zero-shot methods on texture-rich objects.
GMatch enables high success rates in real-world robotic grasp experiments.
Abstract
6DoF object pose estimation is fundamental to robotic grasp tasks. While recent learning-based methods achieve high accuracy, their computational demands hinder deployment on resource-constrained mobile platforms. In this work, we revisit the classical keypoint matching paradigm and propose GMatch, a lightweight, geometry-constrained keypoint matcher that can run efficiently on embedded CPU-only platforms. GMatch works with keypoint descriptors and it uses a set of geometric constraints to establishes inherent ambiguities between features extracted by descriptors, thus giving a globally consistent correspondences from which 6DoF pose can be easily solved. We benchmark GMatch on the HOPE and YCB-Video datasets, where our method beats existing keypoint matchers (both feature-based and geometry-based) among three commonly used descriptors and approaches the SOTA zero-shot method on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Human Pose and Action Recognition
MethodsHigh-Order Proximity preserved Embedding · Sparse Evolutionary Training
