Big2Small: A Unifying Neural Network Framework for Model Compression
Jing-Xiao Liao, Haoran Wang, Tao Li, Daoming Lyu, Yi Zhang, Chengjun Cai, Feng-Lei Fan

TL;DR
Big2Small introduces a unifying mathematical framework for model compression and proposes a data-free method that encodes large model weights into compact neural representations, achieving competitive results.
Contribution
The paper develops a measure-theoretic unifying framework for various model compression techniques and introduces Big2Small, a data-free compression method using implicit neural representations.
Findings
Big2Small achieves competitive accuracy and compression ratios.
The framework demonstrates mathematical equivalence among different compression techniques.
Experimental results validate the effectiveness of the proposed method.
Abstract
With the development of foundational models, model compression has become a critical requirement. Various model compression approaches have been proposed such as low-rank decomposition, pruning, quantization, ergodic dynamic systems, and knowledge distillation, which are based on different heuristics. To elevate the field from fragmentation to a principled discipline, we construct a unifying mathematical framework for model compression grounded in measure theory. We further demonstrate that each model compression technique is mathematically equivalent to a neural network subject to a regularization. Building upon this mathematical and structural equivalence, we propose an experimentally-verified data-free model compression framework, termed \textit{Big2Small}, which translates Implicit Neural Representations (INRs) from data domain to the domain of network parameters. \textit{Big2Small}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
