A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning
Hao Chen, Takuya Kiyokawa, Zhengtao Hu, Weiwei Wan, and Kensuke Harada

TL;DR
This paper presents a multi-level similarity matching framework for robust single-view object grasping in robotics, utilizing semantic, geometric, and dimensional features, along with novel descriptors and large language models to improve accuracy and generalization.
Contribution
It introduces a comprehensive similarity matching approach combining multiple features and novel descriptors, moving away from traditional learning-based methods for better robustness in unknown-object grasping.
Findings
Achieved robust grasping of unknown objects from a single view.
Enhanced similarity matching accuracy with the C-FPFH descriptor.
Demonstrated improved generalization over existing methods.
Abstract
Grasping unknown objects from a single view has remained a challenging topic in robotics due to the uncertainty of partial observation. Recent advances in large-scale models have led to benchmark solutions such as GraspNet-1Billion. However, such learning-based approaches still face a critical limitation in performance robustness for their sensitivity to sensing noise and environmental changes. To address this bottleneck in achieving highly generalized grasping, we abandon the traditional learning framework and introduce a new perspective: similarity matching, where similar known objects are utilized to guide the grasping of unknown target objects. We newly propose a method that robustly achieves unknown-object grasping from a single viewpoint through three key steps: 1) Leverage the visual features of the observed object to perform similarity matching with an existing database…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
