On the Implicit Bias in Deep-Learning Algorithms

Gal Vardi

arXiv:2208.12591·cs.LG·November 8, 2022

On the Implicit Bias in Deep-Learning Algorithms

Gal Vardi

PDF

Open Access 1 Repo

TL;DR

This paper reviews the concept of implicit bias in deep learning algorithms, explaining its role in their ability to generalize despite overparameterization, and discusses key theoretical results and implications.

Contribution

It provides a concise survey of implicit bias in deep learning, summarizing main theoretical findings and their significance for understanding generalization.

Findings

01

Implicit bias influences generalization in deep learning.

02

Gradient-based algorithms tend to converge to solutions with specific properties.

03

Understanding implicit bias can inform the design of better algorithms.

Abstract

Gradient-based deep-learning algorithms exhibit remarkable performance in practice, but it is not well-understood why they are able to generalize despite having more parameters than training examples. It is believed that implicit bias is a key factor in their ability to generalize, and hence it was widely studied in recent years. In this short survey, we explain the notion of implicit bias, review main results and discuss their implications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wmz9/ire-algorithm-framework
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification