XFeat: Accelerated Features for Lightweight Image Matching

Guilherme Potje; Felipe Cadar; Andre Araujo; Renato Martins; Erickson; R. Nascimento

arXiv:2404.19174·cs.CV·May 1, 2024

XFeat: Accelerated Features for Lightweight Image Matching

Guilherme Potje, Felipe Cadar, Andre Araujo, Renato Martins, Erickson, R. Nascimento

PDF

1 Repo

TL;DR

XFeat is a lightweight, fast, and versatile neural network architecture for local image feature detection and matching, suitable for resource-limited devices and supporting semi-dense matching with high accuracy.

Contribution

It introduces a novel, efficient architecture for local feature matching that supports semi-dense matching and runs in real-time on CPU without hardware optimization.

Findings

01

XFeat is up to 5x faster than existing methods.

02

It achieves comparable or better accuracy in pose estimation and localization.

03

It operates in real-time on inexpensive CPU hardware.

Abstract

We introduce a lightweight and accurate architecture for resource-efficient visual correspondence. Our method, dubbed XFeat (Accelerated Features), revisits fundamental design choices in convolutional neural networks for detecting, extracting, and matching local features. Our new model satisfies a critical need for fast and robust algorithms suitable to resource-limited devices. In particular, accurate image matching requires sufficiently large image resolutions - for this reason, we keep the resolution as large as possible while limiting the number of channels in the network. Besides, our model is designed to offer the choice of matching at the sparse or semi-dense levels, each of which may be more suitable for different downstream applications, such as visual navigation and augmented reality. Our model is the first to offer semi-dense matching efficiently, leveraging a novel match…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

verlab/accelerated_features
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings