A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing   Compression in Deep Neural Networks

Rasa Khosrowshahli; Shahryar Rahnamayan; Beatrice Ombuki-Berman

arXiv:2501.03095·cs.CV·January 7, 2025

A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks

Rasa Khosrowshahli, Shahryar Rahnamayan, Beatrice Ombuki-Berman

PDF

Open Access

TL;DR

This paper introduces a model- and layer-independent multi-objective evolutionary algorithm for weight-sharing compression in deep neural networks, achieving significant memory reduction across multiple datasets without retraining shared weights.

Contribution

It proposes a novel, architecture-agnostic compression framework using uniform quantization and Pareto optimization to enhance neural network compression efficiency.

Findings

01

Achieves up to 14.98x memory reduction on CIFAR-10

02

Reduces network size by up to 12.99x on CIFAR-100

03

Attains 8.58x compression on ImageNet

Abstract

Deep neural networks suffer from storing millions and billions of weights in memory post-training, making challenging memory-intensive models to deploy on embedded devices. The weight-sharing technique is one of the popular compression approaches that use fewer weight values and share across specific connections in the network. In this paper, we propose a multi-objective evolutionary algorithm (MOEA) based compression framework independent of neural network architecture, dimension, task, and dataset. We use uniformly sized bins to quantize network weights into a single codebook (lookup table) for efficient weight representation. Using MOEA, we search for Pareto optimal $k$ bins by optimizing two objectives. Then, we apply the iterative merge technique to non-dominated Pareto frontier solutions by combining neighboring bins without degrading performance to decrease the number of bins and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Computing and Algorithms