On Spectral Bias Reduction of Multi-scale Neural Networks for Regression Problems
Bo Wang, Heng Yuan, Lizuo Liu, Wenzhong Zhang, Wei Cai

TL;DR
This paper develops diffusion equation models in the spectral domain to explain how multi-scale neural networks reduce spectral bias, demonstrating their effectiveness in approximating oscillatory functions across frequencies.
Contribution
It introduces spectral diffusion models derived from neural tangent kernel analysis to explain spectral bias reduction in multi-scale neural networks for regression.
Findings
Diffusion coefficients increase with more scales, enhancing spectral bias reduction.
Numerical results validate the diffusion models' accuracy in predicting error evolution.
MscaleDNNs effectively approximate functions with a wide frequency range.
Abstract
In this paper, we derive diffusion equation models in the spectral domain for the evolution of training errors of two-layer multi-scale deep neural networks (MscaleDNN) \cite{caixu2019,liu2020multi}, designed to reduce the spectral bias of fully connected deep neural networks in approximating oscillatory functions. The diffusion models are obtained from the spectral form of the error equation of the MscaleDNN, derived with a neural tangent kernel approach and gradient descent training and a sine activation function, assuming a vanishing learning rate and infinite network width and domain size. The involved diffusion coefficients are shown to have larger supports if more scales are used in the MscaleDNN, and thus, the proposed diffusion equation models in the frequency domain explain the MscaleDNN's spectral bias reduction capability. Numerical results of the diffusion models for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Numerical methods in inverse problems · Neural Networks and Applications
