DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Hu Su, Yonghao He, Rui Jiang, Jiabin Zhang, Wei Zou, Bin Fan

TL;DR
This paper introduces DSLA, a novel label assignment method for anchor-free object detection that uses dynamic, smooth labels and IoU prediction to improve accuracy and simplify architecture.
Contribution
The paper proposes a dynamic smooth label assignment strategy that unifies quality estimation with classification, addressing inconsistencies in anchor-free detectors.
Findings
Significant accuracy improvements on MS COCO benchmark.
Reduction of inconsistencies between classification scores and localization quality.
Simplification of detector architecture by merging quality estimation into classification.
Abstract
Anchor-free detectors basically formulate object detection as dense classification and regression. For popular anchor-free detectors, it is common to introduce an individual prediction branch to estimate the quality of localization. The following inconsistencies are observed when we delve into the practices of classification and quality estimation. Firstly, for some adjacent samples which are assigned completely different labels, the trained model would produce similar classification scores. This violates the training objective and leads to performance degradation. Secondly, it is found that detected bounding boxes with higher confidences contrarily have smaller overlaps with the corresponding ground-truth. Accurately localized bounding boxes would be suppressed by less accurate ones in the Non-Maximum Suppression (NMS) procedure. To address the inconsistency problems, the Dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Non Maximum Suppression · 1x1 Convolution · Feature Pyramid Network · FCOS
