Fast Training of Sinusoidal Neural Fields via Scaling Initialization
Taesun Yeom, Sangyoon Lee, Jaeho Lee

TL;DR
This paper introduces a simple weight scaling technique that significantly accelerates the training of sinusoidal neural fields, reducing training time by a factor of ten and improving convergence and spectral bias handling.
Contribution
The paper proposes a novel weight scaling initialization method for sinusoidal neural fields that enhances training speed and stability, outperforming existing architectures.
Findings
Weight scaling accelerates SNF training by 10x.
The method improves spectral bias handling.
It results in better-conditioned optimization trajectories.
Abstract
Neural fields are an emerging paradigm that represent data as continuous functions parameterized by neural networks. Despite many advantages, neural fields often have a high training cost, which prevents a broader adoption. In this paper, we focus on a popular family of neural fields, called sinusoidal neural fields (SNFs), and study how it should be initialized to maximize the training speed. We find that the standard initialization scheme for SNFs -- designed based on the signal propagation principle -- is suboptimal. In particular, we show that by simply multiplying each weight (except for the last layer) by a constant, we can accelerate SNF training by 10. This method, coined , consistently provides a significant speedup over various data domains, allowing the SNFs to train faster than more recently proposed architectures. To understand why the…
Peer Reviews
Decision·ICLR 2025 Poster
- The motivation is clear, and the exposition is easy to follow. - Although not very surprising, the study of the theoretical behavior of weight scaling is fairly comprehensive and I appreciate the authors efforts to draw upon different theoretical tools in existing literature to provide a broad picture.
- The current choice of experiments is rather limited. Since the contribution is largely methodological, experiments on important practical applications of SNFs, specifically NeRF and solving differential equations (for instance, see Saratchandran et al., 2024), as well as larger datasets, are needed to justify the claims. - Experimental methodology: A few aspects of the experimental methodology are unclear and/or need revision. - Baseline: The included baseline with Xavier initialization i
- Overall the paper is well written, with sufficient number of plots provided to visualize the findings. - The proposed weight scaling scheme is simple yet seems to be quite effective in speeding up training.
- The theoretical results are quite limited. For instance, to show that the proposed weight scaling increases the relative power of the higher-frequency bases, the paper considers the effects of such scaling on a 3-layer SNF only for the case when the width is one. - While the optimization trajectories are studied via the lens of the empirical neural tangent kernel, it would be more convincing if there are theoretical/empirical results that quantify the impact of such scaling on the gradient lo
* $\textbf{Effective Speed Improvement}$: WS offers a substantial acceleration in training time without degrading model performance or generalization capabilities, which is valuable for SNF applications requiring rapid processing. * $\textbf{Simplicity and Practicality}$: The method requires minimal changes to existing architectures, making it straightforward to apply WS to SNFs without complex modifications. * $\textbf{Well-Written}$: The paper has been written in detail. * $\textbf{Theoretic
The authors acknowledge the main limitations of their work, which cannot be ignored. To interpret the results accurately, these limitations must be carefully considered.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Speech and Audio Processing
MethodsFocus
