TL;DR
This paper investigates the fundamental barriers in training stable and accurate neural networks, demonstrating theoretical limitations and proposing FIRENETs as a stable, efficient solution for inverse problems.
Contribution
The paper reveals intrinsic computational barriers in neural network training and introduces FIRENETs, a new stable network architecture with provable efficiency for inverse problems.
Findings
Certain stable NNs cannot be computed by any algorithm with high probability.
Deterministic algorithms can approximate NNs with fewer training samples under specific conditions.
FIRENETs are stable and require logarithmic layers for epsilon-accuracy in inverse problems.
Abstract
Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any algorithm, even randomised, that can train (or compute) such a NN. For any positive integers and , there are cases where simultaneously: (a) no randomised training algorithm can compute a NN correct to digits with probability greater than , (b) there exists a deterministic training algorithm that computes a NN with correct digits, but any such (even randomised) algorithm needs arbitrarily…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
