DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu, Feng Li, Hao Zhang, Xiao Yang, Xianbiao Qi, Hang Su, Jun, Zhu, Lei Zhang

TL;DR
This paper introduces a novel query formulation for DETR using dynamic anchor boxes, improving training convergence and detection accuracy by explicitly incorporating box coordinates as queries.
Contribution
It proposes a new query design with dynamic anchor boxes that enhances DETR's performance and training efficiency, providing deeper understanding of query roles in DETR.
Findings
Achieves 45.7% AP on MS-COCO with ResNet50-DC5 in 50 epochs
Improves training convergence and detection accuracy
Validates the effectiveness through extensive experiments
Abstract
We present in this paper a novel query formulation using dynamic anchor boxes for DETR (DEtection TRansformer) and offer a deeper understanding of the role of queries in DETR. This new formulation directly uses box coordinates as queries in Transformer decoders and dynamically updates them layer-by-layer. Using box coordinates not only helps using explicit positional priors to improve the query-to-feature similarity and eliminate the slow training convergence issue in DETR, but also allows us to modulate the positional attention map using the box width and height information. Such a design makes it clear that queries in DETR can be implemented as performing soft ROI pooling layer-by-layer in a cascade manner. As a result, it leads to the best performance on MS-COCO benchmark among the DETR-like detection models under the same setting, e.g., AP 45.7\% using ResNet50-DC5 as backbone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗IDEA-Research/dab-detr-resnet-50model· 3.1k dl· ♡ 23.1k dl♡ 2
- 🤗IDEA-Research/dab-detr-resnet-50-dc5model· 32 dl· ♡ 132 dl♡ 1
- 🤗IDEA-Research/dab-detr-resnet-50-dc5-fixxymodel· 10 dl10 dl
- 🤗IDEA-Research/dab-detr-resnet-50-pat3model· 10 dl10 dl
- 🤗IDEA-Research/dab-detr-resnet-50-dc5-pat3model· 13 dl· ♡ 113 dl♡ 1
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Dense Connections · Absolute Position Encodings · Byte Pair Encoding · Dropout · Position-Wise Feed-Forward Layer
