Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training
Hongkai Zhang, Hong Chang, Bingpeng Ma, Naiyan Wang, Xilin Chen

TL;DR
Dynamic R-CNN introduces a training method that adaptively adjusts label assignment and regression loss parameters during training, significantly improving high-quality object detection performance without extra computational cost.
Contribution
It proposes a novel dynamic training approach that automatically adjusts training criteria based on proposal statistics, addressing the inconsistency problem in fixed settings.
Findings
Achieves 1.9% AP improvement over baseline
Improves AP90 by 5.5% on MS COCO
No extra computational overhead
Abstract
Although two-stage object detectors have continuously advanced the state-of-the-art performance in recent years, the training process itself is far from crystal. In this work, we first point out the inconsistency problem between the fixed network settings and the dynamic training procedure, which greatly affects the performance. For example, the fixed label assignment strategy and regression loss function cannot fit the distribution change of proposals and thus are harmful to training high quality detectors. Consequently, we propose Dynamic R-CNN to adjust the label assignment criteria (IoU threshold) and the shape of regression loss function (parameters of SmoothL1 Loss) automatically based on the statistics of proposals during training. This dynamic design makes better use of the training samples and pushes the detector to fit more high quality samples. Specifically, our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsAverage Pooling · Non Maximum Suppression · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Soft-NMS · 1x1 Convolution · Feature Pyramid Network · Region Proposal Network · Softmax
