Characterizing Learning in Deep Neural Networks using Tractable Algorithmic Complexity Analysis
Pedram Bakhtiarifard, Sophia N. Wilson, Mahmoud Afifi, Jonathan Wensh{\o}j, Raghavendra Selvan

TL;DR
This paper introduces QuBD, a scalable method for estimating the algorithmic complexity of deep neural network weights, revealing how complexity evolves during training and correlates with generalization.
Contribution
The authors develop QuBD, extending complexity estimation to multi-ary objects, enabling analysis of large DNNs and providing new insights into learning mechanisms.
Findings
Complexity decreases as models learn
Complexity scales with data budget and increases during overfitting
Most information resides in significant bit-planes, aiding quantization decisions
Abstract
Training large-scale deep neural networks (DNNs) is resource-intensive, making model compression a practical necessity. The widely accepted ''learning as compression'' hypothesis posits that training induces structure in network weights, which enables compression. Measuring this structure through Kolmogorov-Chaitin-Solomonoff (KCS) complexity is appealing, but existing estimators based on the Coding Theorem Method (CTM) and the Block Decomposition Method (BDM) are limited to small binary objects and do not scale to modern DNNs. We introduce the Quantized Block Decomposition method (QuBD), which extends algorithmic complexity estimation to any -ary object. QuBD first quantizes the network weights to a finite alphabet, then estimates the KCS complexity by aggregating per bit-plane CTM estimates. We show theoretically that QuBD yields a strictly tighter estimation gap with respect to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
