Deep Learning Activation Functions: Fixed-Shape, Parametric, Adaptive, Stochastic, Miscellaneous, Non-Standard, Ensemble
M. M. Hammad

TL;DR
This paper provides a comprehensive review and classification of various activation functions used in deep learning, analyzing their properties, benefits, limitations, and performance through experimental evaluation.
Contribution
It introduces a systematic taxonomy of activation functions, compares 12 state-of-the-art AFs, and discusses strategies for combining multiple AFs to improve deep learning models.
Findings
Comparative evaluation of 12 activation functions shows varying performance across tasks.
Non-standard and ensemble AFs can offer enhanced adaptability and accuracy.
Systematic classification aids in selecting appropriate AFs for specific applications.
Abstract
In the architecture of deep learning models, inspired by biological neurons, activation functions (AFs) play a pivotal role. They significantly influence the performance of artificial neural networks. By modulating the non-linear properties essential for learning complex patterns, AFs are fundamental in both classification and regression tasks. This paper presents a comprehensive review of various types of AFs, including fixed-shape, parametric, adaptive, stochastic/probabilistic, non-standard, and ensemble/combining types. We begin with a systematic taxonomy and detailed classification frameworks that delineates the principal characteristics of AFs and organizes them based on their structural and functional distinctions. Our in-depth analysis covers primary groups such as sigmoid-based, ReLU-based, and ELU-based AFs, discussing their theoretical foundations, mathematical formulations,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
