A new initialisation to Control Gradients in Sinusoidal Neural network

Andrea Combette; Antoine Venaille; Nelly Pustelnik

arXiv:2512.06427·cs.LG·May 12, 2026

A new initialisation to Control Gradients in Sinusoidal Neural network

Andrea Combette, Antoine Venaille, Nelly Pustelnik

PDF

TL;DR

This paper introduces a novel initialization method for sinusoidal neural networks like SIREN, improving gradient control, training stability, and generalization by deriving a closed-form parameter initialization based on fixed points and NTK analysis.

Contribution

The authors propose a new initialization scheme for sinusoidal networks that enhances gradient management and generalization, supported by theoretical analysis and empirical benchmarks.

Findings

01

The new initialization outperforms original SIREN and baselines in function fitting and image reconstruction.

02

Controlling gradients via the proposed scheme improves training stability and generalization.

03

The initialization influences training dynamics significantly through the Neural Tangent Kernel framework.

Abstract

Proper initialisation strategy is of primary importance to mitigate gradient explosion or vanishing when training neural networks. Yet, the impact of initialisation parameters still lacks a precise theoretical understanding for several well-established architectures. Here, we propose a new initialisation for networks with sinusoidal activation functions such as \texttt{SIREN}, focusing on gradients control, their scaling with network depth, their impact on training and on generalization. To achieve this, we identify a closed-form expression for the initialisation of the parameters, differing from the original \texttt{SIREN} scheme. This expression is derived from fixed points obtained through the convergence of pre-activation distribution and the variance of Jacobian sequences. Controlling both gradients and targeting vanishing pre-activation helps preventing the emergence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.