Learning Activation Functions: A new paradigm for understanding Neural Networks
Mohit Goyal, Rajan Goyal, Brejesh Lall

TL;DR
This paper introduces Self-Learnable Activation Functions (SLAF) that adapt during training to better understand neural networks, offering a flexible, mathematically grounded approach that can approximate existing activations and improve performance.
Contribution
The paper proposes a novel, generic form of learnable activation functions (SLAF) that can approximate all continuous activations and enhance neural network performance with minimal additional complexity.
Findings
SLAF can approximate any Lipschitz continuous activation function.
SLNNs with SLAF can be represented as finite-degree polynomials.
Using SLAF with standard activations improves performance with few extra parameters.
Abstract
The scope of research in the domain of activation functions remains limited and centered around improving the ease of optimization or generalization quality of neural networks (NNs). However, to develop a deeper understanding of deep learning, it becomes important to look at the non linear component of NNs more carefully. In this paper, we aim to provide a generic form of activation function along with appropriate mathematical grounding so as to allow for insights into the working of NNs in future. We propose "Self-Learnable Activation Functions" (SLAF), which are learned during training and are capable of approximating most of the existing activation functions. SLAF is given as a weighted sum of pre-defined basis elements which can serve for a good approximation of the optimal activation function. The coefficients for these basis elements allow a search in the entire space of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM
