IGLU: The Integrated Gaussian Linear Unit Activation Function
Mingi Kang, Zai Yang, Jeova Farias Sales Rocha Neto

TL;DR
This paper introduces IGLU, a new parametric activation function based on a Cauchy gate, which offers improved robustness and performance in deep neural networks across vision and language tasks.
Contribution
IGLU is a novel activation function derived as a scale mixture of GELU gates with a Cauchy distribution, providing a continuous interpolation between identity and ReLU behaviors.
Findings
IGLU outperforms ReLU and GELU on CIFAR and WikiText-103 datasets.
IGLU-Approx achieves similar performance with lower computational cost.
Heavy-tailed gating in IGLU improves results on imbalanced datasets.
Abstract
Activation functions are fundamental to deep neural networks, governing gradient flow, optimization stability, and representational capacity. Within historic deep architectures, while ReLU has been the dominant choice for the activation function, modern transformer-based models increasingly are adopting smoother alternatives such as GELU and other self-gated alternatives. Despite their empirical success, the mathematical relationships among these functions and the principles underlying their effectiveness remains only partially understood. We introduce IGLU, a parametric activation function derived as a scale mixture of GELU gates under a half-normal mixing distribution. This derivation yields a closed-form expression whose gating component is exactly the Cauchy CDF, providing a principled one-parameter family that continuously interpolates between identity-like and ReLU-like behavior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)
