Estimating Implicit Regularization in Deep Learning
Joseph H. Rudoler, Kevin Tan, Giles Hooker, Konrad P. Kording

TL;DR
This paper introduces gradient matching methods to empirically estimate implicit regularization in deep learning, enabling analysis of complex networks and effects like dropout.
Contribution
It provides a practical approach to measure implicit regularization effects in neural networks, including those not analytically derivable.
Findings
Recovers explicit penalties like L1 and L2 through gradient matching.
Replicates implicit effects such as early stopping-induced quadratic penalties.
Characterizes dropout's implicit L2 effects in deep networks.
Abstract
Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization -- connecting it to an equivalent penalty that augments the learning objective. However, modern deep learning systems are complex, carrying modifications to the training procedure and architecture (e.g. early stopping, minibatching, dropout) whose effects are not always directly interpretable. Although estimating the resulting implicit regularization could aid theorists in algorithm design and practitioners in interpreting their hyperparameter choices, this problem has received little direct attention. It is also tractable: regularization makes weight updates deviate from loss gradients, promising a signal for identifying implicit bias. Here we provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
