Extending the Use of MDL for High-Dimensional Problems: Variable Selection, Robust Fitting, and Additive Modeling
Zhenyu Wei, Raymond K. W. Wong, Thomas C. M. Lee

TL;DR
This paper extends the minimum description length (MDL) principle to high-dimensional problems, including variable selection, robust fitting, and additive modeling, demonstrating its effectiveness through numerical experiments.
Contribution
It introduces a natural extension of MDL for high-dimensional settings, covering linear regression, outlier robustness, and nonparametric additive models, with empirical validation.
Findings
MDL effectively handles high-dimensional variable selection.
The approach is robust to outliers in data.
Numerical experiments show efficiency and effectiveness.
Abstract
In the signal processing and statistics literature, the minimum description length (MDL) principle is a popular tool for choosing model complexity. Successful examples include signal denoising and variable selection in linear regression, for which the corresponding MDL solutions often enjoy consistent properties and produce very promising empirical results. This paper demonstrates that MDL can be extended naturally to the high-dimensional setting, where the number of predictors is larger than the number of observations . It first considers the case of linear regression, then allows for outliers in the data, and lastly extends to the robust fitting of nonparametric additive models. Results from numerical experiments are presented to demonstrate the efficiency and effectiveness of the MDL approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Neural Networks and Applications · Control Systems and Identification
MethodsMinimum Description Length
