Revisiting minimum description length complexity in overparameterized models
Raaz Dwivedi, Chandan Singh, Bin Yu, Martin J. Wainwright

TL;DR
This paper introduces MDL-COMP, a new complexity measure based on minimum description length, which effectively characterizes overparameterized models and guides hyper-parameter tuning to improve generalization and reduce computational costs.
Contribution
We propose MDL-COMP, a novel MDL-based complexity measure valid for overparameterized models, with theoretical analysis and practical applications demonstrating its effectiveness.
Findings
MDL-COMP scales linearly with parameters when d<n, and logarithmically when d>n.
MDL-COMP informs minimax in-sample error in kernel methods.
Data-driven Prac-MDL-COMP improves hyper-parameter tuning and can outperform cross-validation.
Abstract
Complexity is a fundamental concept underlying statistical learning theory that aims to inform generalization performance. Parameter count, while successful in low-dimensional settings, is not well-justified for overparameterized settings when the number of parameters is more than the number of training samples. We revisit complexity measures based on Rissanen's principle of minimum description length (MDL) and define a novel MDL-based complexity (MDL-COMP) that remains valid for overparameterized models. MDL-COMP is defined via an optimality criterion over the encodings induced by a good Ridge estimator class. We provide an extensive theoretical characterization of MDL-COMP for linear models and kernel methods and show that it is not just a function of parameter count, but rather a function of the singular values of the design or the kernel matrix and the signal-to-noise ratio. For a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Face and Expression Recognition
