How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal
Sam Jeong, Hae Yong Kim

TL;DR
This paper analyzes the Fourier Analysis Network's mechanism, revealing that sine activation improves training by mitigating vanishing gradients and proposing a Dual-Activation Layer that accelerates convergence across various tasks.
Contribution
It uncovers the specific role of sine activation in FAN, shifts understanding from spectral to training dynamics, and introduces DAL for faster, more effective neural network training.
Findings
Sine activation improves gradient flow near zero, aiding training.
FAN primarily alleviates the dying-ReLU problem.
DAL accelerates convergence and maintains or improves accuracy.
Abstract
Fourier Analysis Network (FAN) was recently proposed as a simple way to improve neural network performance by replacing part of Rectified Linear Unit (ReLU) activations with sine and cosine functions. Although several studies have reported small but consistent gains across tasks, the underlying mechanism behind these improvements has remained unclear. In this work, we show that only the sine activation contributes positively to performance, whereas the cosine activation tends to be detrimental. Our analysis reveals that the improvement is not a consequence of the sine function's periodic nature; instead, it stems from the function's local behavior near x = 0, where its non-zero derivative mitigates the vanishing-gradient problem. We further show that FAN primarily alleviates the dying-ReLU problem, in which a neuron consistently receives negative inputs, produces zero gradients, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
