Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing
Anasse Boutayeb, Iyad Lahsen-cherif, Ahmed El Khadimi

TL;DR
This paper applies deformable attention-based Transformer models to remote sensing object detection, demonstrating high accuracy on optical and SAR datasets, advancing deep learning methods in geospatial image analysis.
Contribution
It introduces the use of Deformable-DETR architecture for remote sensing object detection across optical and SAR images, showing improved performance over existing models.
Findings
F1 score of 95.12% on optical dataset
F1 score of 94.54% on SAR dataset
Outperforms CNN and standard Transformer models
Abstract
Object detection has recently seen an interesting trend in terms of the most innovative research work, this task being of particular importance in the field of remote sensing, given the consistency of these images in terms of geographical coverage and the objects present. Furthermore, Deep Learning (DL) models, in particular those based on Transformers, are especially relevant for visual computing tasks in general, and target detection in particular. Thus, the present work proposes an application of Deformable-DETR model, a specific architecture using deformable attention mechanisms, on remote sensing images in two different modes, especially optical and Synthetic Aperture Radar (SAR). To achieve this objective, two datasets are used, one optical, which is Pleiades Aircraft dataset, and the other SAR, in particular SAR Ship Detection Dataset (SSDD). The results of a 10-fold stratified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · CCD and CMOS Imaging Sensors
MethodsSoftmax · Attention Is All You Need
