Complexity of Neural Network Training and ETR: Extensions with Effectively Continuous Functions
Teemu Hankala, Miika Hannula, Juha Kontinen, Jonni Virtema

TL;DR
This paper investigates the computational complexity of training neural networks with various activation functions, establishing links to the existential theory of the reals and highlighting decidability issues for sigmoid and sinusoidal functions.
Contribution
It extends the understanding of neural network training complexity to effectively continuous functions, connecting it to the existential theory of the reals with exponential functions.
Findings
Training with sigmoid functions relates to the existential theory of reals with exponentials.
Training with sinusoidal functions is undecidable.
Upper bounds are provided within the arithmetical hierarchy.
Abstract
We study the complexity of the problem of training neural networks defined via various activation functions. The training problem is known to be existsR-complete with respect to linear activation functions and the ReLU activation function. We consider the complexity of the problem with respect to the sigmoid activation function and other effectively continuous functions. We show that these training problems are polynomial-time many-one bireducible to the existential theory of the reals extended with the corresponding activation functions. In particular, we establish that the sigmoid activation function leads to the existential theory of the reals with the exponential function. It is thus open, and equivalent with the decidability of the existential theory of the reals with the exponential function, whether training neural networks using the sigmoid activation function is algorithmically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms · Computability, Logic, AI Algorithms
MethodsSigmoid Activation
