DFAM-DETR: Deformable feature based attention mechanism DETR on slender   object detection

Wen Feng; Wang Mei; Hu Xiaojie

arXiv:2204.10667·cs.CV·April 25, 2022

DFAM-DETR: Deformable feature based attention mechanism DETR on slender object detection

Wen Feng, Wang Mei, Hu Xiaojie

PDF

Open Access

TL;DR

This paper introduces DFAM-DETR, an improved deformable transformer-based object detector that enhances slender object detection accuracy and efficiency by integrating deformable convolution and attention mechanisms.

Contribution

It proposes the DFAM module to augment Deformable DETR, significantly improving slender object detection performance over existing methods.

Findings

01

Achieves superior detection accuracy on slender objects.

02

Outperforms existing CNN-based and transformer-based detectors.

03

Enhances detection efficiency with adaptive sampling points.

Abstract

Object detection is one of the most significant aspects of computer vision, and it has achieved substantial results in a variety of domains. It is worth noting that there are few studies focusing on slender object detection. CNNs are widely employed in object detection, however it performs poorly on slender object detection due to the fixed geometric structure and sampling points. In comparison, Deformable DETR has the ability to obtain global to specific features. Even though it outperforms the CNNs in slender objects detection accuracy and efficiency, the results are still not satisfactory. Therefore, we propose Deformable Feature based Attention Mechanism (DFAM) to increase the slender object detection accuracy and efficiency of Deformable DETR. The DFAM has adaptive sampling points of deformable convolution and attention mechanism that aggregate information from the entire input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection

MethodsLinear Layer · Byte Pair Encoding · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Softmax · Label Smoothing · Feedforward Network · Dropout · Deformable Attention Module