ProbAct: A Probabilistic Activation Function for Deep Neural Networks
Kumar Shridhar, Joonho Lee, Hideaki Hayashi, Purvanshi Mehta, Brian, Kenji Iwana, Seokjun Kang, Seiichi Uchida, Sheraz Ahmed, Andreas Dengel

TL;DR
This paper introduces ProbAct, a novel stochastic activation function for deep neural networks that improves classification accuracy, models uncertainty, and enhances robustness by sampling from a learned distribution during training.
Contribution
The paper proposes ProbAct, a probabilistic activation function that incorporates trainable mean and variance, acting as a form of feature augmentation and model ensemble within neural networks.
Findings
ProbAct improves classification accuracy by 2-3% over ReLU on multiple datasets.
ProbAct enables uncertainty estimation and robustness to noisy inputs.
ProbAct functions as an implicit ensemble of models through its stochastic nature.
Abstract
Activation functions play an important role in training artificial neural networks. The majority of currently used activation functions are deterministic in nature, with their fixed input-output relationship. In this work, we propose a novel probabilistic activation function, called ProbAct. ProbAct is decomposed into a mean and variance and the output value is sampled from the formed distribution, making ProbAct a stochastic activation function. The values of mean and variances can be fixed using known functions or trained for each element. In the trainable ProbAct, the mean and the variance of the activation distribution is trained within the back-propagation framework alongside other parameters. We show that the stochastic perturbation induced through ProbAct acts as a viable generalization technique for feature augmentation. In our experiments, we compare ProbAct with well-known…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Advanced Neural Network Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia?
