MATCHA:Towards Matching Anything

Fei Xue; Sven Elflein; Laura Leal-Taix\'e; Qunjie Zhou

arXiv:2501.14945·cs.CV·January 28, 2025

MATCHA:Towards Matching Anything

Fei Xue, Sven Elflein, Laura Leal-Taix\'e, Qunjie Zhou

PDF

Open Access

TL;DR

MATCHA introduces a unified feature model that leverages diffusion features, attention mechanisms, and object-level information to establish robust correspondences across diverse computer vision tasks, surpassing prior specialized methods.

Contribution

The paper presents the first unified feature model capable of handling geometric, semantic, and temporal matching tasks with a single approach, integrating multiple feature types for versatility.

Findings

01

MATCHA outperforms state-of-the-art methods across various matching tasks.

02

The model effectively fuses semantic and geometric features for robust correspondence.

03

Extensive experiments validate MATCHA's versatility and superior performance.

Abstract

Establishing correspondences across images is a fundamental challenge in computer vision, underpinning tasks like Structure-from-Motion, image editing, and point tracking. Traditional methods are often specialized for specific correspondence types, geometric, semantic, or temporal, whereas humans naturally identify alignments across these domains. Inspired by this flexibility, we propose MATCHA, a unified feature model designed to ``rule them all'', establishing robust correspondences across diverse matching tasks. Building on insights that diffusion model features can encode multiple correspondence types, MATCHA augments this capacity by dynamically fusing high-level semantic and low-level geometric features through an attention-based module, creating expressive, versatile, and robust features. Additionally, MATCHA integrates object-level features from DINOv2 to further boost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUser Authentication and Security Systems

MethodsDiffusion