Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs
Pedram Bakhtiarifard, Tong Chen, Jonathan Wensh{\o}j, Erik B Dam, Raghavendra Selvan

TL;DR
This paper introduces Mosaic-of-Motifs (MoMos), a new parameterization method that biases neural network training toward simpler, more compressible solutions by constraining weights to reusable motifs, supported by empirical evidence.
Contribution
The paper formalizes the Kolmogorov complexity of neural network weights and proposes MoMos, a constrained parameterization that reduces complexity and enhances compressibility during training.
Findings
MoMos consistently lowers the algorithmic complexity of neural networks.
MoMos preserves model performance despite complexity reduction.
Parameter compressibility can be induced during training, not just post hoc.
Abstract
Large-scale deep learning models are well-suited for compression. Across a variety of tasks, methods like pruning, quantization, and knowledge distillation have been used to achieve massive reductions in model parameters with only marginal performance drops. This raises the central question: *Why are deep neural networks suited for compression?* In this work, we take up the perspective of algorithmic complexity to explain this behavior. We hypothesize that the parameters of trained models have more structure and, hence, exhibit lower algorithmic complexity compared to the weights at (random) initialization. Furthermore, model compression methods harness this reduced algorithmic complexity to compress models. Although an unconstrained parameterization of model weights, , can represent arbitrary weight assignments, the solutions found during training exhibit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Computability, Logic, AI Algorithms · Machine Learning and Data Classification
