Estimating Implicit Regularization in Deep Learning

Joseph H. Rudoler; Kevin Tan; Giles Hooker; Konrad P. Kording

arXiv:2605.05436·stat.ML·May 8, 2026

Estimating Implicit Regularization in Deep Learning

Joseph H. Rudoler, Kevin Tan, Giles Hooker, Konrad P. Kording

PDF

TL;DR

This paper introduces gradient matching methods to empirically estimate implicit regularization in deep learning, enabling analysis of complex networks and effects like dropout.

Contribution

It provides a practical approach to measure implicit regularization effects in neural networks, including those not analytically derivable.

Findings

01

Recovers explicit penalties like L1 and L2 through gradient matching.

02

Replicates implicit effects such as early stopping-induced quadratic penalties.

03

Characterizes dropout's implicit L2 effects in deep networks.

Abstract

Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization -- connecting it to an equivalent penalty that augments the learning objective. However, modern deep learning systems are complex, carrying modifications to the training procedure and architecture (e.g. early stopping, minibatching, dropout) whose effects are not always directly interpretable. Although estimating the resulting implicit regularization could aid theorists in algorithm design and practitioners in interpreting their hyperparameter choices, this problem has received little direct attention. It is also tractable: regularization makes weight updates deviate from loss gradients, promising a signal for identifying implicit bias. Here we provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.