Implicit vs. explicit regularization for high-dimensional gradient descent
Thomas Stark, Lukas Steinberger

TL;DR
This paper compares implicit and explicit regularization in high-dimensional gradient descent, showing explicit regularization's near-optimal statistical efficiency and proposing a data-driven method for selecting regularization parameters.
Contribution
It introduces a new methodology for high-dimensional linear prediction that demonstrates the advantages of explicit regularization over implicit early stopping, with a computationally efficient parameter choice.
Findings
Explicit regularization approaches the benchmark error with optimal tuning.
Implicit regularization via early stopping is less statistically efficient.
Proposes a data-driven method for regularization parameter selection.
Abstract
In this paper we investigate the generalization error of gradient descent (GD) applied to an -regularized OLS objective function in the linear model. Based on our analysis we develop new methodology for computationally tractable and statistically efficient linear prediction in a high-dimensional and massive data scenario (large-, large-). Our results are based on the surprising observation that the generalization error of optimally tuned regularized gradient descent approaches that of an optimal benchmark procedure in the iteration number . On the other hand standard GD for OLS (without explicit regularization) can achieve the benchmark only in degenerate cases. This shows that (optimal) explicit regularization can be nearly statistically efficient (for large ) whereas implicit regularization by (optimal) early stopping can not. To complete our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Hydrocarbon exploration and reservoir analysis · Medical Imaging Techniques and Applications
