Stochastic Adaptive Activation Function
Kyungsu Lee, Jaeseung Yang, Haeyun Lee, and Jae Youn Hwang

TL;DR
This paper introduces Adaptive Swish (ASH), a novel trainable activation function that models neuron variability, improves deep learning performance across tasks, and offers a generalized mathematical framework over existing functions.
Contribution
The study proposes ASH, a flexible, context-aware activation function that generalizes Swish and enhances deep learning models' accuracy and convergence.
Findings
ASH improves prediction accuracy across various tasks.
ASH enables earlier convergence in training.
ASH demonstrates robustness in multiple deep learning models.
Abstract
The simulation of human neurons and neurotransmission mechanisms has been realized in deep neural networks based on the theoretical implementations of activation functions. However, recent studies have reported that the threshold potential of neurons exhibits different values according to the locations and types of individual neurons, and that the activation functions have limitations in terms of representing this variability. Therefore, this study proposes a simple yet effective activation function that facilitates different thresholds and adaptive activations according to the positions of units and the contexts of inputs. Furthermore, the proposed activation function mathematically exhibits a more generalized form of Swish activation function, and thus we denoted it as Adaptive SwisH (ASH). ASH highlights informative features that exhibit large values in the top percentiles in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsSigmoid Activation
