AugFPN: Improving Multi-scale Feature Learning for Object Detection
Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan

TL;DR
AugFPN introduces a novel feature pyramid architecture that enhances multi-scale feature learning for object detection, addressing FPN's limitations and improving detection accuracy across various models.
Contribution
The paper proposes AugFPN, a new feature pyramid architecture with three components, to better exploit multi-scale features and improve object detection performance.
Findings
Achieves 2.3 and 1.6 AP improvements in Faster R-CNN with ResNet50 and MobileNet-v2.
Enhances RetinaNet by 1.6 AP and FCOS by 0.9 AP with ResNet50.
Addresses FPN's design defects to fully exploit multi-scale features.
Abstract
Current state-of-the-art detectors typically exploit feature pyramid to detect objects at different scales. Among them, FPN is one of the representative works that build a feature pyramid by multi-scale features summation. However, the design defects behind prevent the multi-scale features from being fully exploited. In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems. Specifically, AugFPN consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection. AugFPN narrows the semantic gaps between features of different scales before feature fusion through Consistent Supervision. In feature fusion, ratio-invariant context information is extracted by Residual Feature Augmentation to reduce the information loss of feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
AugFPN: Improving Multi-Scale Feature Learning for Object Detection· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Residual Connection · Average Pooling · Inverted Residual Block · Max Pooling · Global Average Pooling
