Approximation Capabilities of Feedforward Neural Networks with GELU Activations

Konstantin Yakovlev; Nikita Puchkin

arXiv:2512.21749·cs.LG·December 29, 2025

Approximation Capabilities of Feedforward Neural Networks with GELU Activations

Konstantin Yakovlev, Nikita Puchkin

PDF

Open Access

TL;DR

This paper establishes error bounds for feedforward neural networks with GELU activations, demonstrating their ability to approximate a range of functions and their derivatives with controlled accuracy and network complexity.

Contribution

It provides the first comprehensive error bounds for GELU-activated networks approximating functions and derivatives, including elementary functions like polynomials, exponential, and reciprocal.

Findings

01

Error bounds hold for functions and derivatives up to any order.

02

Networks can approximate elementary functions with bounded derivatives.

03

Analysis includes network size, weight magnitudes, and behavior at infinity.

Abstract

We derive an approximation error bound that holds simultaneously for a function and all its derivatives up to any prescribed order. The bounds apply to elementary functions, including multivariate polynomials, the exponential function, and the reciprocal function, and are obtained using feedforward neural networks with the Gaussian Error Linear Unit (GELU) activation. In addition, we report the network size, weight magnitudes, and behavior at infinity. Our analysis begins with a constructive approximation of multiplication, where we prove the simultaneous validity of error bounds over domains of increasing size for a given approximator. Leveraging this result, we obtain approximation guarantees for division and the exponential function, ensuring that all higher-order derivatives of the resulting approximators remain globally bounded.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and ELM