Understanding the dynamics of the frequency bias in neural networks

Juan Molina; Mircea Petrache; Francisco Sahli Costabal; Mat\'ias; Courdurier

arXiv:2405.14957·cs.LG·May 27, 2024

Understanding the dynamics of the frequency bias in neural networks

Juan Molina, Mircea Petrache, Francisco Sahli Costabal, Mat\'ias, Courdurier

PDF

Open Access

TL;DR

This paper develops a PDE model to analyze the frequency bias in neural networks, showing how initialization distributions can control this bias, validated through experiments on Fourier Features and multi-layer NNs.

Contribution

It introduces a PDE framework for understanding frequency dynamics in neural networks and demonstrates how initialization can mitigate frequency bias.

Findings

01

PDE accurately models frequency learning dynamics

02

Initialization distributions influence frequency bias control

03

Method extends from 2-layer to multi-layer NNs

Abstract

Recent works have shown that traditional Neural Network (NN) architectures display a marked frequency bias in the learning process. Namely, the NN first learns the low-frequency features before learning the high-frequency ones. In this study, we rigorously develop a partial differential equation (PDE) that unravels the frequency dynamics of the error for a 2-layer NN in the Neural Tangent Kernel regime. Furthermore, using this insight, we explicitly demonstrate how an appropriate choice of distributions for the initialization weights can eliminate or control the frequency bias. We focus our study on the Fourier Features model, an NN where the first layer has sine and cosine activation functions, with frequencies sampled from a prescribed distribution. In this setup, we experimentally validate our theoretical results and compare the NN dynamics to the solution of the PDE using the finite…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus