RDD: Robust Feature Detector and Descriptor using Deformable Transformer

Gonglin Chen; Tianwen Fu; Haiwei Chen; Wenbin Teng; Hanyuan Xiao; Yajie Zhao

arXiv:2505.08013·cs.CV·December 9, 2025

RDD: Robust Feature Detector and Descriptor using Deformable Transformer

Gonglin Chen, Tianwen Fu, Haiwei Chen, Wenbin Teng, Hanyuan Xiao, Yajie Zhao

PDF

TL;DR

The paper introduces RDD, a deformable transformer-based feature detector and descriptor that captures global context and geometric invariance, improving robustness in challenging scenarios like large viewpoint changes.

Contribution

It proposes a novel deformable transformer-based method for robust feature detection and description, and introduces new datasets and benchmarks for evaluation.

Findings

01

Outperforms state-of-the-art methods in sparse matching tasks

02

Capable of semi-dense matching under challenging conditions

03

Effective in large viewpoint and scale variation scenarios

Abstract

As a core step in structure-from-motion and SLAM, robust feature detection and description under challenging scenarios such as significant viewpoint changes remain unresolved despite their ubiquity. While recent works have identified the importance of local features in modeling geometric transformations, these methods fail to learn the visual cues present in long-range relationships. We present Robust Deformable Detector (RDD), a novel and robust keypoint detector/descriptor leveraging the deformable transformer, which captures global context and geometric invariance through deformable self-attention mechanisms. Specifically, we observed that deformable attention focuses on key locations, effectively reducing the search space complexity and modeling the geometric invariance. Furthermore, we collected an Air-to-Ground dataset for training in addition to the standard MegaDepth dataset.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need