A Unified Approach to Controlling Implicit Regularization via Mirror   Descent

Haoyuan Sun; Khashayar Gatmiry; Kwangjun Ahn; Navid Azizan

arXiv:2306.13853·cs.LG·January 12, 2024

A Unified Approach to Controlling Implicit Regularization via Mirror Descent

Haoyuan Sun, Khashayar Gatmiry, Kwangjun Ahn, Navid Azizan

PDF

Open Access

TL;DR

This paper introduces a unified mirror descent framework to control implicit regularization in over-parameterized models, demonstrating its effectiveness in both regression and classification tasks with theoretical convergence guarantees.

Contribution

It presents a general mirror descent approach that unifies and extends implicit regularization control across various learning problems, addressing previous limitations.

Findings

01

MD converges to generalized maximum-margin solutions in classification.

02

MD can be efficiently implemented with fast convergence.

03

Different regularizers via MD lead to varied generalization performances.

Abstract

Inspired by the remarkable success of large neural networks, there has been significant interest in understanding the generalization performance of over-parameterized models. Substantial efforts have been invested in characterizing how optimization algorithms impact generalization through their "preferred" solutions, a phenomenon commonly referred to as implicit regularization. In particular, it has been argued that gradient descent (GD) induces an implicit $ℓ_{2}$ -norm regularization in regression and classification problems. However, the implicit regularization of different algorithms are confined to either a specific geometry or a particular class of learning problems, indicating a gap in a general approach for controlling the implicit regularization. To address this, we present a unified approach using mirror descent (MD), a notable generalization of GD, to control implicit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical methods in inverse problems