ResMLP: Feedforward networks for image classification with data-efficient training
Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord,, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel, Synnaeve, Jakob Verbeek, Herv\'e J\'egou

TL;DR
ResMLP is a simple, fully feedforward residual architecture for image classification that achieves competitive accuracy with efficient training, and extends to self-supervised learning and machine translation.
Contribution
It introduces ResMLP, a novel architecture based solely on multi-layer perceptrons for image classification, demonstrating strong performance and versatility.
Findings
Achieves good accuracy/complexity trade-offs on ImageNet
Effective in self-supervised training setups
Adapts well to machine translation tasks
Abstract
We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗kadirnar/timm_model_listmodel· ♡ 1♡ 1
- 🤗timm/resmlp_12_224.fb_dinomodel· 58 dl58 dl
- 🤗timm/resmlp_12_224.fb_distilled_in1kmodel· 55 dl55 dl
- 🤗timm/resmlp_12_224.fb_in1kmodel· 2.6k dl2.6k dl
- 🤗timm/resmlp_24_224.fb_dinomodel· 41 dl41 dl
- 🤗timm/resmlp_24_224.fb_distilled_in1kmodel· 37 dl37 dl
- 🤗timm/resmlp_24_224.fb_in1kmodel· 105 dl105 dl
- 🤗timm/resmlp_36_224.fb_distilled_in1kmodel· 40 dl40 dl
- 🤗timm/resmlp_36_224.fb_in1kmodel· 67 dl· ♡ 167 dl♡ 1
- 🤗timm/resmlp_big_24_224.fb_distilled_in1kmodel· 37 dl37 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Imaging for Blood Diseases · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI
MethodsDense Connections · Residual Connection · Feedforward Network · Refunds@Expedia|||How do I get a full refund from Expedia? · Class-MLP · Affine Operator · Residual Multi-Layer Perceptrons · LayerScale · Linear Layer
