DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution
Siyuan Qiao, Liang-Chieh Chen, Alan Yuille

TL;DR
DetectoRS introduces recursive feature pyramids and switchable atrous convolutions to enhance object detection, achieving state-of-the-art results on COCO with improved accuracy for detection, segmentation, and panoptic tasks.
Contribution
It proposes novel recursive and switchable convolutional modules that significantly boost object detection performance.
Findings
Achieves 55.7% box AP on COCO test-dev
Improves mask AP to 48.5% for instance segmentation
Reaches 50.0% PQ for panoptic segmentation
Abstract
Many modern object detectors demonstrate outstanding performances by using the mechanism of looking and thinking twice. In this paper, we explore this mechanism in the backbone design for object detection. At the macro level, we propose Recursive Feature Pyramid, which incorporates extra feedback connections from Feature Pyramid Networks into the bottom-up backbone layers. At the micro level, we propose Switchable Atrous Convolution, which convolves the features with different atrous rates and gathers the results using switch functions. Combining them results in DetectoRS, which significantly improves the performances of object detection. On COCO test-dev, DetectoRS achieves state-of-the-art 55.7% box AP for object detection, 48.5% mask AP for instance segmentation, and 50.0% PQ for panoptic segmentation. The code is made publicly available.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
MethodsSigmoid Activation · Dilated Convolution · Global Average Pooling · Average Pooling · 1x1 Convolution · Switchable Atrous Convolution · Recursive Feature Pyramid · Convolution
