EfficientDet: Scalable and Efficient Object Detection
Mingxing Tan, Ruoming Pang, Quoc V. Le

TL;DR
EfficientDet introduces a scalable and efficient object detection architecture that combines a novel BiFPN for multiscale feature fusion and a compound scaling method, achieving state-of-the-art accuracy with significantly reduced computational resources.
Contribution
The paper presents a new family of object detectors, EfficientDet, with a novel BiFPN architecture and a compound scaling method for improved efficiency and accuracy.
Findings
EfficientDet-D7 achieves 55.1 AP on COCO with 77M parameters.
EfficientDet is 4x-9x smaller and uses 13x-42x fewer FLOPs than previous detectors.
EfficientDet outperforms prior art across various resource constraints.
Abstract
Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and propose several key optimizations to improve efficiency. First, we propose a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion; Second, we propose a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time. Based on these optimizations and better backbones, we have developed a new family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints. In particular, with single model and single-scale, our EfficientDet-D7 achieves state-of-the-art 55.1 AP on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
EfficientDet: Scalable and Efficient Object Detection· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsCosine Annealing · RMSProp · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Batch Normalization · Random Horizontal Flip · Image Scale Augmentation
