Solving ImageNet: a Unified Scheme for Training any Backbone to Top   Results

Tal Ridnik; Hussam Lawen; Emanuel Ben-Baruch; Asaf Noy

arXiv:2204.03475·cs.CV·May 13, 2022·5 cites

Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results

Tal Ridnik, Hussam Lawen, Emanuel Ben-Baruch, Asaf Noy

PDF

Open Access 2 Repos 2 Models

TL;DR

This paper introduces USI, a unified, hyper-parameter-free training scheme for ImageNet that consistently improves performance across diverse architectures, transforming model training into an automatic, efficient process.

Contribution

The paper presents USI, a universal training scheme that eliminates the need for architecture-specific tuning, enabling consistent top results on ImageNet for various models.

Findings

01

USI outperforms previous state-of-the-art methods on all tested architectures.

02

USI reduces training complexity by removing hyper-parameter tuning.

03

USI enables fair comparison of different backbones on the speed-accuracy trade-off.

Abstract

ImageNet serves as the primary dataset for evaluating the quality of computer-vision models. The common practice today is training each architecture with a tailor-made scheme, designed and tuned by an expert. In this paper, we present a unified scheme for training any backbone on ImageNet. The scheme, named USI (Unified Scheme for ImageNet), is based on knowledge distillation and modern tricks. It requires no adjustments or hyper-parameters tuning between different models, and is efficient in terms of training times. We test USI on a wide variety of architectures, including CNNs, Transformers, Mobile-oriented and MLP-only. On all models tested, USI outperforms previous state-of-the-art results. Hence, we are able to transform training on ImageNet from an expert-oriented task to an automatic seamless routine. Since USI accepts any backbone and trains it to top results, it also enables to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsKnowledge Distillation