Characterizing Learning in Deep Neural Networks using Tractable Algorithmic Complexity Analysis

Pedram Bakhtiarifard; Sophia N. Wilson; Mahmoud Afifi; Jonathan Wensh{\o}j; Raghavendra Selvan

arXiv:2605.15551·cs.LG·May 18, 2026

Characterizing Learning in Deep Neural Networks using Tractable Algorithmic Complexity Analysis

Pedram Bakhtiarifard, Sophia N. Wilson, Mahmoud Afifi, Jonathan Wensh{\o}j, Raghavendra Selvan

PDF

TL;DR

This paper introduces QuBD, a scalable method for estimating the algorithmic complexity of deep neural network weights, revealing how complexity evolves during training and correlates with generalization.

Contribution

The authors develop QuBD, extending complexity estimation to multi-ary objects, enabling analysis of large DNNs and providing new insights into learning mechanisms.

Findings

01

Complexity decreases as models learn

02

Complexity scales with data budget and increases during overfitting

03

Most information resides in significant bit-planes, aiding quantization decisions

Abstract

Training large-scale deep neural networks (DNNs) is resource-intensive, making model compression a practical necessity. The widely accepted ''learning as compression'' hypothesis posits that training induces structure in network weights, which enables compression. Measuring this structure through Kolmogorov-Chaitin-Solomonoff (KCS) complexity is appealing, but existing estimators based on the Coding Theorem Method (CTM) and the Block Decomposition Method (BDM) are limited to small binary objects and do not scale to modern DNNs. We introduce the Quantized Block Decomposition method (QuBD), which extends algorithmic complexity estimation to any $k$ -ary object. QuBD first quantizes the network weights to a finite alphabet, then estimates the KCS complexity by aggregating per bit-plane CTM estimates. We show theoretically that QuBD yields a strictly tighter estimation gap with respect to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.