Equidistribution-based training of Free Knot Splines and ReLU Neural Networks
Simone Appella, Simon Arridge, Chris Budd, Teo Deveney, Lisa Maria, Kreusser

TL;DR
This paper introduces an equidistribution-based training method for shallow ReLU neural networks that improves conditioning and approximation accuracy by leveraging insights from Free Knot Splines and optimal piecewise linear interpolation.
Contribution
The paper proposes a novel training approach for ReLU networks inspired by Free Knot Splines, enhancing conditioning and approximation quality through equidistribution principles.
Findings
Improved training stability and accuracy for shallow ReLU networks.
Effective approximation of various target functions, including singular and rapidly varying ones.
Extension of the method to deeper networks with maintained performance.
Abstract
We consider the problem of univariate nonlinear function approximation using shallow neural networks (NN) with a rectified linear unit (ReLU) activation function. We show that the based approximation problem is ill-conditioned and the behaviour of optimisation algorithms used in training these networks degrades rapidly as the width of the network increases. This can lead to significantly poorer approximation in practice than expected from the theoretical expressivity of the ReLU architecture and traditional methods such as univariate Free Knot Splines (FKS). Univariate shallow ReLU NNs and FKS span the same function space, and thus have the same theoretical expressivity. However, the FKS representation remains well-conditioned as the number of knots increases. We leverage the theory of optimal piecewise linear interpolants to improve the training procedure for ReLU NNs. Using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Advanced machining processes and optimization
