ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

Hongkai Chen; Zixin Luo; Lei Zhou; Yurun Tian; Mingmin Zhen; Tian; Fang; David Mckinnon; Yanghai Tsin; Long Quan

arXiv:2208.14201·cs.CV·August 31, 2022·6 cites

ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian, Fang, David Mckinnon, Yanghai Tsin, Long Quan

PDF

Open Access 1 Repo

TL;DR

ASpanFormer is a novel detector-free image matching method using a hierarchical, adaptive span transformer that dynamically adjusts attention regions for improved robustness and accuracy across various benchmarks.

Contribution

The paper introduces a transformer-based image matcher with a novel adaptive attention span mechanism that dynamically determines search regions based on pixel uncertainty.

Findings

01

Achieves state-of-the-art accuracy on multiple benchmarks.

02

Effectively balances long-range dependencies and local detail.

03

Demonstrates robustness across diverse matching scenarios.

Abstract

Generating robust and reliable correspondences across images is a fundamental task for a diversity of applications. To capture context at both global and local granularity, we propose ASpanFormer, a Transformer-based detector-free matcher that is built on hierarchical attention structure, adopting a novel attention operation which is capable of adjusting attention span in a self-adaptive manner. To achieve this goal, first, flow maps are regressed in each cross attention phase to locate the center of search region. Next, a sampling grid is generated around the center, whose size, instead of being empirically configured as fixed, is adaptively computed from a pixel uncertainty estimated along with the flow map. Finally, attention is computed across two images within derived regions, referred to as attention span. By these means, we are able to not only maintain long-range dependencies,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

apple/ml-aspanformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning