Muddling Labels for Regularization, a novel approach to generalization

Karim Lounici; Katia Meziani; Benjamin Riu

arXiv:2102.08769·stat.ML·February 18, 2021·1 cites

Muddling Labels for Regularization, a novel approach to generalization

Karim Lounici, Katia Meziani, Benjamin Riu

PDF

Open Access

TL;DR

This paper introduces a new regularization approach that directly measures and minimizes overfitting risk, enabling hyperparameter calibration during training without data splitting, and shows improved generalization in linear regression tasks.

Contribution

A novel risk measure for regularization that allows hyperparameter tuning during training without validation data, applicable to various structures and compatible with gradient descent.

Findings

01

Procedures outperform traditional cross-validation methods in generalization.

02

Methods are computationally feasible and easy to implement.

03

Approach improves estimation and support recovery of model parameters.

Abstract

Generalization is a central problem in Machine Learning. Indeed most prediction methods require careful calibration of hyperparameters usually carried out on a hold-out \textit{validation} dataset to achieve generalization. The main goal of this paper is to introduce a novel approach to achieve generalization without any data splitting, which is based on a new risk measure which directly quantifies a model's tendency to overfit. To fully understand the intuition and advantages of this new approach, we illustrate it in the simple linear regression model ( $Y = X β + ξ$ ) where we develop a new criterion. We highlight how this criterion is a good proxy for the true generalization risk. Next, we derive different procedures which tackle several structures simultaneously (correlation, sparsity,...). Noticeably, these procedures \textbf{concomitantly} train the model and calibrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification

MethodsLinear Regression