RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition
Xiaohan Ding, Chunlong Xia, Xiangyu Zhang, Xiaojie Chu, Jungong Han,, Guiguang Ding

TL;DR
RepMLP introduces a re-parameterization technique that combines the benefits of fully-connected layers and convolutional layers, enhancing image recognition performance while maintaining efficiency.
Contribution
The paper presents RepMLP, a novel neural network block that merges convolutional local priors with fully-connected layers through structural re-parameterization, improving accuracy and speed.
Findings
Pure-MLP model on CIFAR rivals CNN performance.
Inserting RepMLP improves ResNet accuracy on ImageNet.
Enhances face recognition and semantic segmentation with lower FLOPs.
Abstract
We propose RepMLP, a multi-layer-perceptron-style neural network building block for image recognition, which is composed of a series of fully-connected (FC) layers. Compared to convolutional layers, FC layers are more efficient, better at modeling the long-range dependencies and positional patterns, but worse at capturing the local structures, hence usually less favored for image recognition. We propose a structural re-parameterization technique that adds local prior into an FC to make it powerful for image recognition. Specifically, we construct convolutional layers inside a RepMLP during training and merge them into the FC for inference. On CIFAR, a simple pure-MLP model shows performance very close to CNN. By inserting RepMLP in traditional CNN, we improve ResNets by 1.8% accuracy on ImageNet, 2.9% for face recognition, and 2.3% mIoU on Cityscapes with lower FLOPs. Our intriguing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Face recognition and analysis · Advanced Image and Video Retrieval Techniques
MethodsConvolution
