Diversity-Guided MLP Reduction for Efficient Large Vision Transformers
Chengchao Shen, Hourun Zhu, Gongfan Fang, Jianxin Wang, Xinchao Wang

TL;DR
This paper introduces a method to significantly reduce parameters and computational costs of large vision transformers by pruning redundant neurons in MLP modules while maintaining performance, using a diversity-guided approach.
Contribution
We propose a novel Diversity-Guided MLP Reduction (DGMR) method that effectively prunes MLP neurons in vision transformers with minimal performance loss, leveraging weight diversity preservation.
Findings
Achieves over 57% reduction in parameters and FLOPs with negligible performance loss.
Maintains performance with only 0.06% of LAION-2B data for training.
Reduces parameters by 71.5% on EVA-CLIP-E without accuracy degradation.
Abstract
Transformer models achieve excellent scaling property, where the performance is improved with the increment of model capacity. However, large-scale model parameters lead to an unaffordable cost of computing and memory. We analyze popular transformer architectures and find that multilayer perceptron (MLP) modules take up the majority of model parameters. To this end, we focus on the recoverability of the compressed models and propose a Diversity-Guided MLP Reduction (DGMR) method to significantly reduce the parameters of large vision transformers with only negligible performance degradation. Specifically, we conduct a Gram-Schmidt weight pruning strategy to eliminate redundant neurons of MLP hidden layer, while preserving weight diversity for better performance recover during distillation. Compared to the model trained from scratch, our pruned model only requires 0.06\% data of LAION-2B…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Image Enhancement Techniques
MethodsFocus · Pruning
