A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning

Hao Chen; Takuya Kiyokawa; Zhengtao Hu; Weiwei Wan; and Kensuke Harada

arXiv:2507.11938·cs.RO·July 17, 2025

A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning

Hao Chen, Takuya Kiyokawa, Zhengtao Hu, Weiwei Wan, and Kensuke Harada

PDF

TL;DR

This paper presents a multi-level similarity matching framework for robust single-view object grasping in robotics, utilizing semantic, geometric, and dimensional features, along with novel descriptors and large language models to improve accuracy and generalization.

Contribution

It introduces a comprehensive similarity matching approach combining multiple features and novel descriptors, moving away from traditional learning-based methods for better robustness in unknown-object grasping.

Findings

01

Achieved robust grasping of unknown objects from a single view.

02

Enhanced similarity matching accuracy with the C-FPFH descriptor.

03

Demonstrated improved generalization over existing methods.

Abstract

Grasping unknown objects from a single view has remained a challenging topic in robotics due to the uncertainty of partial observation. Recent advances in large-scale models have led to benchmark solutions such as GraspNet-1Billion. However, such learning-based approaches still face a critical limitation in performance robustness for their sensitivity to sensing noise and environmental changes. To address this bottleneck in achieving highly generalized grasping, we abandon the traditional learning framework and introduce a new perspective: similarity matching, where similar known objects are utilized to guide the grasping of unknown target objects. We newly propose a method that robustly achieves unknown-object grasping from a single viewpoint through three key steps: 1) Leverage the visual features of the observed object to perform similarity matching with an existing database…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.