Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs

Pedram Bakhtiarifard; Tong Chen; Jonathan Wensh{\o}j; Erik B Dam; Raghavendra Selvan

arXiv:2602.14896·cs.LG·May 18, 2026

Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs

Pedram Bakhtiarifard, Tong Chen, Jonathan Wensh{\o}j, Erik B Dam, Raghavendra Selvan

PDF

TL;DR

This paper introduces Mosaic-of-Motifs (MoMos), a new parameterization method that biases neural network training toward simpler, more compressible solutions by constraining weights to reusable motifs, supported by empirical evidence.

Contribution

The paper formalizes the Kolmogorov complexity of neural network weights and proposes MoMos, a constrained parameterization that reduces complexity and enhances compressibility during training.

Findings

01

MoMos consistently lowers the algorithmic complexity of neural networks.

02

MoMos preserves model performance despite complexity reduction.

03

Parameter compressibility can be induced during training, not just post hoc.

Abstract

Large-scale deep learning models are well-suited for compression. Across a variety of tasks, methods like pruning, quantization, and knowledge distillation have been used to achieve massive reductions in model parameters with only marginal performance drops. This raises the central question: *Why are deep neural networks suited for compression?* In this work, we take up the perspective of algorithmic complexity to explain this behavior. We hypothesize that the parameters of trained models have more structure and, hence, exhibit lower algorithmic complexity compared to the weights at (random) initialization. Furthermore, model compression methods harness this reduced algorithmic complexity to compress models. Although an unconstrained parameterization of model weights, $w \in R^{n}$ , can represent arbitrary weight assignments, the solutions found during training exhibit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Computability, Logic, AI Algorithms · Machine Learning and Data Classification