Extending the Use of MDL for High-Dimensional Problems: Variable   Selection, Robust Fitting, and Additive Modeling

Zhenyu Wei; Raymond K. W. Wong; Thomas C. M. Lee

arXiv:2201.11171·eess.SP·January 28, 2022·ICASSP

Extending the Use of MDL for High-Dimensional Problems: Variable Selection, Robust Fitting, and Additive Modeling

Zhenyu Wei, Raymond K. W. Wong, Thomas C. M. Lee

PDF

Open Access

TL;DR

This paper extends the minimum description length (MDL) principle to high-dimensional problems, including variable selection, robust fitting, and additive modeling, demonstrating its effectiveness through numerical experiments.

Contribution

It introduces a natural extension of MDL for high-dimensional settings, covering linear regression, outlier robustness, and nonparametric additive models, with empirical validation.

Findings

01

MDL effectively handles high-dimensional variable selection.

02

The approach is robust to outliers in data.

03

Numerical experiments show efficiency and effectiveness.

Abstract

In the signal processing and statistics literature, the minimum description length (MDL) principle is a popular tool for choosing model complexity. Successful examples include signal denoising and variable selection in linear regression, for which the corresponding MDL solutions often enjoy consistent properties and produce very promising empirical results. This paper demonstrates that MDL can be extended naturally to the high-dimensional setting, where the number of predictors $p$ is larger than the number of observations $n$ . It first considers the case of linear regression, then allows for outliers in the data, and lastly extends to the robust fitting of nonparametric additive models. Results from numerical experiments are presented to demonstrate the efficiency and effectiveness of the MDL approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Neural Networks and Applications · Control Systems and Identification

MethodsMinimum Description Length