Theoretical Compression Bounds for Wide Multilayer Perceptrons

Houssam El Cheairi; David Gamarnik; Rahul Mazumder

arXiv:2512.06288·cs.LG·December 9, 2025

Theoretical Compression Bounds for Wide Multilayer Perceptrons

Houssam El Cheairi, David Gamarnik, Rahul Mazumder

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of pruning and quantization in wide neural networks, demonstrating the existence of highly compressed subnetworks with competitive performance through a randomized greedy algorithm.

Contribution

It introduces a rigorous theoretical framework for pruning and quantization in wide MLPs and CNNs, explaining their empirical success without data assumptions.

Findings

01

Existence of pruned/quantized subnetworks with competitive performance

02

Extension of results to structured pruning of CNNs

03

Tradeoff between compressibility and network width

Abstract

Pruning and quantization techniques have been broadly successful in reducing the number of parameters needed for large neural networks, yet theoretical justification for their empirical success falls short. We consider a randomized greedy compression algorithm for pruning and quantization post-training and use it to rigorously show the existence of pruned/quantized subnetworks of multilayer perceptrons (MLPs) with competitive performance. We further extend our results to structured pruning of MLPs and convolutional neural networks (CNNs), thus providing a unified analysis of pruning in wide networks. Our results are free of data assumptions, and showcase a tradeoff between compressibility and network width. The algorithm we consider bears some similarities with Optimal Brain Damage (OBD) and can be viewed as a post-training randomized version of it. The theoretical results we derive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning