RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Jungong Han, Guiguang Ding

TL;DR
This paper introduces RepMLPNet, a hierarchical vision MLP architecture that incorporates local priors via Locality Injection, achieving a good accuracy-efficiency balance and effective transfer to semantic segmentation tasks.
Contribution
It proposes Locality Injection as a novel structural re-parameterization method and introduces RepMLPNet, a hierarchical MLP backbone for vision tasks.
Findings
Locality Injection effectively incorporates local priors into MLPs.
RepMLPNet outperforms other MLPs in accuracy-efficiency trade-offs.
RepMLPNet successfully transfers to Cityscapes semantic segmentation.
Abstract
Compared to convolutional layers, fully-connected (FC) layers are better at modeling the long-range dependencies but worse at capturing the local patterns, hence usually less favored for image recognition. In this paper, we propose a methodology, Locality Injection, to incorporate local priors into an FC layer via merging the trained parameters of a parallel conv kernel into the FC kernel. Locality Injection can be viewed as a novel Structural Re-parameterization method since it equivalently converts the structures via transforming the parameters. Based on that, we propose a multi-layer-perceptron (MLP) block named RepMLP Block, which uses three FC layers to extract features, and a novel architecture named RepMLPNet. The hierarchical design distinguishes RepMLPNet from the other concurrently proposed vision MLPs. As it produces feature maps of different levels, it qualifies as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
