EfficientNetV2: Smaller Models and Faster Training
Mingxing Tan, Quoc V. Le

TL;DR
EfficientNetV2 introduces a family of convolutional networks that train faster, are more parameter-efficient, and outperform previous models on image classification benchmarks, achieved through neural architecture search and progressive learning techniques.
Contribution
The paper presents EfficientNetV2, a new neural network family optimized for faster training and better efficiency, using training-aware architecture search and adaptive regularization.
Findings
EfficientNetV2 trains 5-11x faster than previous models.
Models are up to 6.8x smaller with improved accuracy.
Achieves 87.3% top-1 accuracy on ImageNet, surpassing ViT.
Abstract
This paper introduces EfficientNetV2, a new family of convolutional networks that have faster training speed and better parameter efficiency than previous models. To develop this family of models, we use a combination of training-aware neural architecture search and scaling, to jointly optimize training speed and parameter efficiency. The models were searched from the search space enriched with new ops such as Fused-MBConv. Our experiments show that EfficientNetV2 models train much faster than state-of-the-art models while being up to 6.8x smaller. Our training can be further sped up by progressively increasing the image size during training, but it often causes a drop in accuracy. To compensate for this accuracy drop, we propose to adaptively adjust regularization (e.g., dropout and data augmentation) as well, such that we can achieve both fast training and good accuracy. With…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Camais03/camie-taggermodel· 36 dl· ♡ 6236 dl♡ 62
- 🤗timm/efficientnetv2_rw_m.agc_in1kmodel· 56k dl· ♡ 156k dl♡ 1
- 🤗timm/efficientnetv2_rw_s.ra2_in1kmodel· 10k dl· ♡ 110k dl♡ 1
- 🤗timm/efficientnetv2_rw_t.ra2_in1kmodel· 2.1k dl2.1k dl
- 🤗timm/gc_efficientnetv2_rw_t.agc_in1kmodel· 768 dl768 dl
- 🤗timm/tf_efficientnetv2_b0.in1kmodel· 42k dl· ♡ 242k dl♡ 2
- 🤗timm/tf_efficientnetv2_b1.in1kmodel· 3.5k dl3.5k dl
- 🤗timm/tf_efficientnetv2_b2.in1kmodel· 4.7k dl4.7k dl
- 🤗timm/tf_efficientnetv2_b3.in1kmodel· 2.1k dl2.1k dl
- 🤗timm/tf_efficientnetv2_b3.in21kmodel· 773 dl· ♡ 2773 dl♡ 2
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · 1x1 Convolution · Inverted Residual Block · EfficientNetV2 · Dropout
