Understanding the Spectral Bias of Coordinate Based MLPs Via Training   Dynamics

John Lazzari; Xiuwen Liu

arXiv:2301.05816·cs.LG·May 5, 2023·1 cites

Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics

John Lazzari, Xiuwen Liu

PDF

Open Access

TL;DR

This paper investigates the spectral bias in coordinate-based MLPs by analyzing training dynamics, revealing how activation regions and gradient convergence influence frequency learning, especially in low-dimensional settings, and how positional encoding mitigates this bias.

Contribution

It introduces a novel approach to understanding spectral bias through direct analysis of ReLU MLP training dynamics and their relation to input encoding.

Findings

01

Spectral bias causes networks to learn low frequencies first.

02

Positional encoding helps overcome spectral bias in low dimensions.

03

Training dynamics reveal how activation regions affect convergence to high frequencies.

Abstract

Spectral bias is an important observation of neural network training, stating that the network will learn a low frequency representation of the target function before converging to higher frequency components. This property is interesting due to its link to good generalization in over-parameterized networks. However, in low dimensional settings, a severe spectral bias occurs that obstructs convergence to high frequency components entirely. In order to overcome this limitation, one can encode the inputs using a high frequency sinusoidal encoding. Previous works attempted to explain this phenomenon using Neural Tangent Kernel (NTK) and Fourier analysis. However, NTK does not capture real network dynamics, and Fourier analysis only offers a global perspective on the network properties that induce this bias. In this paper, we provide a novel approach towards understanding spectral bias by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Structural Health Monitoring Techniques

MethodsNeural Tangent Kernel · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings