DFAM-DETR: Deformable feature based attention mechanism DETR on slender object detection
Wen Feng, Wang Mei, Hu Xiaojie

TL;DR
This paper introduces DFAM-DETR, an improved deformable transformer-based object detector that enhances slender object detection accuracy and efficiency by integrating deformable convolution and attention mechanisms.
Contribution
It proposes the DFAM module to augment Deformable DETR, significantly improving slender object detection performance over existing methods.
Findings
Achieves superior detection accuracy on slender objects.
Outperforms existing CNN-based and transformer-based detectors.
Enhances detection efficiency with adaptive sampling points.
Abstract
Object detection is one of the most significant aspects of computer vision, and it has achieved substantial results in a variety of domains. It is worth noting that there are few studies focusing on slender object detection. CNNs are widely employed in object detection, however it performs poorly on slender object detection due to the fixed geometric structure and sampling points. In comparison, Deformable DETR has the ability to obtain global to specific features. Even though it outperforms the CNNs in slender objects detection accuracy and efficiency, the results are still not satisfactory. Therefore, we propose Deformable Feature based Attention Mechanism (DFAM) to increase the slender object detection accuracy and efficiency of Deformable DETR. The DFAM has adaptive sampling points of deformable convolution and attention mechanism that aggregate information from the entire input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection
MethodsLinear Layer · Byte Pair Encoding · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Softmax · Label Smoothing · Feedforward Network · Dropout · Deformable Attention Module
