Mining Generalizable Activation Functions
Alex Vitvitskyi, Michael Boratko, Matej Grcic, Razvan Pascanu, Deep Shah, Petar Veli\v{c}kovi\'c

TL;DR
This paper explores using evolutionary search, specifically AlphaEvolve, to discover new activation functions that improve neural network performance and encode inductive biases, leveraging large language models and flexible search spaces.
Contribution
It introduces a novel framework using AlphaEvolve for discovering activation functions over broad search spaces, including functions encoding inductive biases, with empirical validation on synthetic datasets.
Findings
AlphaEvolve can find meaningful activation functions within synthetic datasets.
Flexible search spaces enable discovering functions beyond manually designed ones.
Performance on out-of-distribution data can guide activation function discovery.
Abstract
The choice of activation function is an active area of research, with different proposals aimed at improving optimization, while maintaining expressivity. Additionally, the activation function can significantly alter the implicit inductive bias of the architecture, controlling its non-linear behavior. In this paper, in line with previous work, we argue that evolutionary search provides a useful framework for finding new activation functions, while we also make two novel observations. The first is that modern pipelines, such as AlphaEvolve, which relies on frontier LLMs as a mutator operator, allows for a much wider and flexible search space; e.g., over all possible python functions within a certain FLOP budget, eliminating the need for manually constructed search spaces. In addition, these pipelines will be biased towards meaningful activation functions, given their ability to represent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Machine Learning and Data Classification · Evolutionary Algorithms and Applications
