On the Mathematical Impossibility of Safe Universal Approximators
Jasper Yao

TL;DR
This paper proves that any useful universal approximator inherently contains dense instabilities making perfect safety impossible, fundamentally challenging the pursuit of fully controllable AI systems.
Contribution
It establishes a rigorous mathematical proof that universal approximation systems cannot be both useful and perfectly safe due to unavoidable instabilities.
Findings
Catastrophic failure points are dense in practical neural networks.
Universal approximators must include dense singularities, leading to instabilities.
Adversarial examples are empirical evidence of inherent system instabilities.
Abstract
We establish fundamental mathematical limits on universal approximation theorem (UAT) system alignment by proving that catastrophic failures are an inescapable feature of any useful computational system. Our central thesis is that for any universal approximator, the expressive power required for useful computation is inextricably linked to a dense set of instabilities that make perfect, reliable control a mathematical impossibility. We prove this through a three-level argument that leaves no escape routes for any class of universal approximator architecture. i) Combinatorial Necessity: For the vast majority of practical universal approximators (e.g., those using ReLU activations), we prove that the density of catastrophic failure points is directly proportional to the network's expressive power. ii) Topological Necessity: For any theoretical universal approximator, we use singularity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
