Evolution of Activation Functions: An Empirical Investigation
Andrew Nader, Danielle Azar

TL;DR
This paper introduces an evolutionary algorithm to automatically discover new activation functions for neural networks, demonstrating improved performance over existing functions across multiple datasets and architectures.
Contribution
It presents a novel evolutionary approach to automate the search for activation functions, a task traditionally done manually or with predefined options.
Findings
Evolved activation functions outperform standard functions in experiments.
The approach is statistically robust across 10 datasets and 30 runs.
New activation functions show promising results for neural network performance.
Abstract
The hyper-parameters of a neural network are traditionally designed through a time consuming process of trial and error that requires substantial expert knowledge. Neural Architecture Search (NAS) algorithms aim to take the human out of the loop by automatically finding a good set of hyper-parameters for the problem at hand. These algorithms have mostly focused on hyper-parameters such as the architectural configurations of the hidden layers and the connectivity of the hidden neurons, but there has been relatively little work on automating the search for completely new activation functions, which are one of the most crucial hyper-parameters to choose. There are some widely used activation functions nowadays which are simple and work well, but nonetheless, there has been some interest in finding better activation functions. The work in the literature has mostly focused on designing new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
