Frequency Bias in Neural Networks for Input of Non-Uniform Density

Ronen Basri; Meirav Galun; Amnon Geifman; David Jacobs; Yoni Kasten,; Shira Kritchman

arXiv:2003.04560·cs.LG·March 11, 2020·29 cites

Frequency Bias in Neural Networks for Input of Non-Uniform Density

Ronen Basri, Meirav Galun, Amnon Geifman, David Jacobs, Yoni Kasten,, Shira Kritchman

PDF

Open Access 1 Video

TL;DR

This paper investigates how neural networks trained on non-uniform data distributions exhibit frequency bias, affecting convergence rates depending on local data density, using NTK analysis for both shallow and deep networks.

Contribution

It analytically derives convergence times related to local data density and frequency, extending NTK analysis to non-uniform distributions and deep networks.

Findings

01

Convergence time depends on local density p(x) and frequency κ as O(κ^d/p(x)).

02

Eigenfunctions of NTK are derived for two-layer networks on the circle.

03

Deep networks show similar but distinct convergence behaviors compared to shallow ones.

Abstract

Recent works have partly attributed the generalization ability of over-parameterized neural networks to frequency bias -- networks trained with gradient descent on data drawn from a uniform distribution find a low frequency fit before high frequency ones. As realistic training sets are not drawn from a uniform distribution, we here use the Neural Tangent Kernel (NTK) model to explore the effect of variable density on training dynamics. Our results, which combine analytic and empirical observations, show that when learning a pure harmonic function of frequency $κ$ , convergence at a point $\x \in \Sphere^{d - 1}$ occurs in time $O (κ^{d} / p (\x))$ where $p (\x)$ denotes the local density at $\x$ . Specifically, for data in $\Sphere^{1}$ we analytically derive the eigenfunctions of the kernel associated with the NTK for two-layer networks. We further prove convergence results for deep,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Frequency Bias in Neural Networks for Input of Non-Uniform Density· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Model Reduction and Neural Networks

MethodsNeural Tangent Kernel