SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks
Chien-Yu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan,, Maxwell Horton, and Mohammad Rastegari

TL;DR
This paper empirically evaluates parameter sharing strategies in isotropic neural networks like ConvMixer and vision transformers, proposing a new method that improves efficiency and accuracy over traditional scaling.
Contribution
It introduces a formal framework for weight sharing in isotropic networks and proposes a strategy that enhances model efficiency and performance.
Findings
Achieved 1.9x compression of ConvMixer with improved accuracy on ImageNet.
Provided a comprehensive empirical evaluation of weight sharing design choices.
Developed a qualitative understanding of weight sharing behavior in isotropic architectures.
Abstract
Recent isotropic networks, such as ConvMixer and vision transformers, have found significant success across visual recognition tasks, matching or outperforming non-isotropic convolutional neural networks (CNNs). Isotropic architectures are particularly well-suited to cross-layer weight sharing, an effective neural network compression technique. In this paper, we perform an empirical evaluation on methods for sharing parameters in isotropic networks (SPIN). We present a framework to formalize major weight sharing design decisions and perform a comprehensive empirical evaluation of this design space. Guided by our experimental results, we propose a weight sharing strategy to generate a family of models with better overall efficiency, in terms of FLOPs and parameters versus accuracy, compared to traditional scaling methods alone, for example compressing ConvMixer by 1.9x while improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Advanced Neural Network Applications · CCD and CMOS Imaging Sensors
