A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks
Rasa Khosrowshahli, Shahryar Rahnamayan, Beatrice Ombuki-Berman

TL;DR
This paper introduces a model- and layer-independent multi-objective evolutionary algorithm for weight-sharing compression in deep neural networks, achieving significant memory reduction across multiple datasets without retraining shared weights.
Contribution
It proposes a novel, architecture-agnostic compression framework using uniform quantization and Pareto optimization to enhance neural network compression efficiency.
Findings
Achieves up to 14.98x memory reduction on CIFAR-10
Reduces network size by up to 12.99x on CIFAR-100
Attains 8.58x compression on ImageNet
Abstract
Deep neural networks suffer from storing millions and billions of weights in memory post-training, making challenging memory-intensive models to deploy on embedded devices. The weight-sharing technique is one of the popular compression approaches that use fewer weight values and share across specific connections in the network. In this paper, we propose a multi-objective evolutionary algorithm (MOEA) based compression framework independent of neural network architecture, dimension, task, and dataset. We use uniformly sized bins to quantize network weights into a single codebook (lookup table) for efficient weight representation. Using MOEA, we search for Pareto optimal bins by optimizing two objectives. Then, we apply the iterative merge technique to non-dominated Pareto frontier solutions by combining neighboring bins without degrading performance to decrease the number of bins and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computing and Algorithms
