Universal 3D Shape Matching via Coarse-to-Fine Language Guidance

Qinfeng Xiao; Guofeng Mei; Bo Yang; Liying Zhang; Jian Zhang; Kit-lun Yick

arXiv:2602.19112·cs.CV·March 10, 2026

Universal 3D Shape Matching via Coarse-to-Fine Language Guidance

Qinfeng Xiao, Guofeng Mei, Bo Yang, Liying Zhang, Jian Zhang, Kit-lun Yick

PDF

Open Access

TL;DR

UniMatch is a versatile 3D shape matching framework that uses language-guided semantic cues in a coarse-to-fine manner, enabling dense correspondences across diverse object categories without relying on isometry.

Contribution

It introduces a novel coarse-to-fine approach leveraging large language and vision models for universal, non-isometric 3D shape correspondence without predefined part proposals.

Findings

01

Outperforms existing methods in diverse scenarios

02

Effective across multiple object categories

03

No need for predefined part proposals

Abstract

Establishing dense correspondences between shapes is a crucial task in computer vision and graphics, while prior approaches depend on near-isometric assumptions and homogeneous subject types (i.e., only operate for human shapes). However, building semantic correspondences for cross-category objects remains challenging and has received relatively little attention. To achieve this, we propose UniMatch, a semantic-aware, coarse-to-fine framework for constructing dense semantic correspondences between strongly non-isometric shapes without restricting object categories. The key insight is to lift "coarse" semantic cues into "fine" correspondence, which is achieved through two stages. In the "coarse" stage, we perform class-agnostic 3D segmentation to obtain non-overlapping semantic parts and prompt multimodal large language models (MLLMs) to identify part names. Then, we employ pretrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis