Can stable and accurate neural networks be computed? -- On the barriers   of deep learning and Smale's 18th problem

Matthew J. Colbrook; Vegard Antun; Anders C. Hansen

arXiv:2101.08286·cs.LG·April 4, 2022

Can stable and accurate neural networks be computed? -- On the barriers of deep learning and Smale's 18th problem

Matthew J. Colbrook, Vegard Antun, Anders C. Hansen

PDF

1 Repo

TL;DR

This paper investigates the fundamental barriers in training stable and accurate neural networks, demonstrating theoretical limitations and proposing FIRENETs as a stable, efficient solution for inverse problems.

Contribution

The paper reveals intrinsic computational barriers in neural network training and introduces FIRENETs, a new stable network architecture with provable efficiency for inverse problems.

Findings

01

Certain stable NNs cannot be computed by any algorithm with high probability.

02

Deterministic algorithms can approximate NNs with fewer training samples under specific conditions.

03

FIRENETs are stable and require logarithmic layers for epsilon-accuracy in inverse problems.

Abstract

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any algorithm, even randomised, that can train (or compute) such a NN. For any positive integers $K > 2$ and $L$ , there are cases where simultaneously: (a) no randomised training algorithm can compute a NN correct to $K$ digits with probability greater than $1/2$ , (b) there exists a deterministic training algorithm that computes a NN with $K - 1$ correct digits, but any such (even randomised) algorithm needs arbitrarily…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Comp-Foundations-and-Barriers-of-AI/firenet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.