Regularization Trade-offs with Fake Features
Martin Hellkvist, Ay\c{c}a \"Oz\c{c}elikkale, Anders Ahl\'en

TL;DR
This paper analyzes how fake features in overparameterized models affect generalization, providing bounds and insights into the trade-offs between implicit and explicit regularization, with numerical validation.
Contribution
It introduces a non-asymptotic high-probability bound on generalization error for ridge regression with fake features, revealing the interplay between different regularization effects.
Findings
Generalization error bound depends on fake features and ridge parameter
Optimal ridge regularization heavily influenced by the number of fake features
Numerical results demonstrate trade-offs in model regularization strategies
Abstract
Recent successes of massively overparameterized models have inspired a new line of work investigating the underlying conditions that enable overparameterized models to generalize well. This paper considers a framework where the possibly overparametrized model includes fake features, i.e., features that are present in the model but not in the data. We present a non-asymptotic high-probability bound on the generalization error of the ridge regression problem under the model misspecification of having fake features. Our highprobability results provide insights into the interplay between the implicit regularization provided by the fake features and the explicit regularization provided by the ridge parameter. Numerical results illustrate the trade-off between the number of fake features and how the optimal ridge parameter may heavily depend on the number of fake features.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Advanced Multi-Objective Optimization Algorithms
