Adaptive Parametric Activation: Unifying and Generalising Activation Functions Across Tasks
Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo

TL;DR
This paper introduces the Adaptive Parametric Activation (APA), a versatile activation function that unifies and generalizes many existing functions, improving performance across diverse tasks and imbalanced datasets.
Contribution
The paper proposes the APA function, a novel unified activation that adapts to data distribution, outperforming existing functions on multiple benchmarks and tasks.
Findings
APA significantly outperforms state-of-the-art on imbalanced benchmarks
Aligning activation functions with data improves model performance
APA enhances various tasks including classification, detection, and image generation
Abstract
The activation function plays a crucial role in model optimisation, yet the optimal choice remains unclear. For example, the Sigmoid activation is the de-facto activation in balanced classification tasks, however, in imbalanced classification, it proves inappropriate due to bias towards frequent classes. In this work, we delve deeper in this phenomenon by performing a comprehensive statistical analysis in the classification and intermediate layers of both balanced and imbalanced networks and we empirically show that aligning the activation function with the data distribution, enhances the performance in both balanced and imbalanced tasks. To this end, we propose the Adaptive Parametric Activation (APA) function, a novel and versatile activation function that unifies most common activation functions under a single formula. APA can be applied in both intermediate layers and attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Measurement and Metrology Techniques
MethodsSoftmax · Attention Is All You Need · Adaptive Generalised Linear Unit · Sigmoid Activation · Adaptive Pseudo Augmentation
