GMatch: A Lightweight, Geometry-Constrained Keypoint Matcher for Zero-Shot 6DoF Pose Estimation in Robotic Grasp Tasks

Ming Yang; Haoran Li

arXiv:2505.16144·cs.CV·October 21, 2025

GMatch: A Lightweight, Geometry-Constrained Keypoint Matcher for Zero-Shot 6DoF Pose Estimation in Robotic Grasp Tasks

Ming Yang, Haoran Li

PDF

Open Access

TL;DR

GMatch is a lightweight, geometry-constrained keypoint matcher designed for efficient 6DoF pose estimation on resource-limited robotic platforms, achieving high accuracy and real-world applicability.

Contribution

The paper introduces GMatch, a novel, efficient keypoint matching framework that incorporates geometric constraints for robust pose estimation on embedded devices.

Findings

01

GMatch outperforms existing keypoint matchers on benchmark datasets.

02

GMatch approaches state-of-the-art zero-shot methods on texture-rich objects.

03

GMatch enables high success rates in real-world robotic grasp experiments.

Abstract

6DoF object pose estimation is fundamental to robotic grasp tasks. While recent learning-based methods achieve high accuracy, their computational demands hinder deployment on resource-constrained mobile platforms. In this work, we revisit the classical keypoint matching paradigm and propose GMatch, a lightweight, geometry-constrained keypoint matcher that can run efficiently on embedded CPU-only platforms. GMatch works with keypoint descriptors and it uses a set of geometric constraints to establishes inherent ambiguities between features extracted by descriptors, thus giving a globally consistent correspondences from which 6DoF pose can be easily solved. We benchmark GMatch on the HOPE and YCB-Video datasets, where our method beats existing keypoint matchers (both feature-based and geometry-based) among three commonly used descriptors and approaches the SOTA zero-shot method on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Human Pose and Action Recognition

MethodsHigh-Order Proximity preserved Embedding · Sparse Evolutionary Training