Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics
Indrashis Das, Mahmoud Safari, Steven Adriaensen, Frank Hutter

TL;DR
The paper introduces Gompertz Linear Unit (GoLU), a novel activation function leveraging asymmetry to improve training dynamics and performance across various deep learning tasks.
Contribution
It proposes GoLU, a new self-gated activation function based on the Gompertz function, demonstrating improved variance reduction and gradient flow over existing functions.
Findings
GoLU outperforms GELU and Swish in multiple tasks.
GoLU reduces variance in latent space effectively.
Experiments cover diverse applications like image and language tasks.
Abstract
Activation functions are fundamental elements of deep learning architectures as they significantly influence training dynamics. ReLU, while widely used, is prone to the dying neuron problem, which has been mitigated by variants such as LeakyReLU, PReLU, and ELU that better handle negative neuron outputs. Recently, self-gated activations like GELU and Swish have emerged as state-of-the-art alternatives, leveraging their smoothness to ensure stable gradient flow and prevent neuron inactivity. In this work, we introduce the Gompertz Linear Unit (GoLU), a novel self-gated activation function defined as , where . The GoLU activation leverages the right-skewed asymmetry in the Gompertz function to reduce variance in the latent space more effectively compared to GELU and Swish, while preserving robust gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications
Methods(FiLe@Against@Claim)How do I file a claim against Expedia? · Refunds@Expedia|||How do I get a full refund from Expedia? · Sigmoid Activation · Exponential Linear Unit · Gompertz Linear Unit · Diffusion · *Communicated@Fast*How Do I Communicate to Expedia? · Parameterized ReLU
