ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions

Koushik Biswas; Sandeep Kumar; Shilpak Banerjee; Ashish Kumar Pandey

arXiv:2109.04386·cs.NE·March 25, 2022

ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions

Koushik Biswas, Sandeep Kumar, Shilpak Banerjee, Ashish Kumar Pandey

PDF

Open Access 1 Video

TL;DR

This paper introduces two novel non-monotonic smooth trainable activation functions, ErfAct and Pserf, which significantly enhance neural network performance across various models and datasets compared to traditional activations.

Contribution

The paper proposes ErfAct and Pserf, two new non-monotonic smooth trainable activation functions that outperform existing functions like ReLU, Swish, and Mish in multiple neural network benchmarks.

Findings

01

ErfAct and Pserf improve top-1 accuracy on CIFAR datasets by over 5%.

02

They enhance mean average precision on SSD300 by 1%.

03

The proposed functions outperform ReLU, Swish, and Mish in experiments.

Abstract

An activation function is a crucial component of a neural network that introduces non-linearity in the network. The state-of-the-art performance of a neural network depends also on the perfect choice of an activation function. We propose two novel non-monotonic smooth trainable activation functions, called ErfAct and Pserf. Experiments suggest that the proposed functions improve the network performance significantly compared to the widely used activations like ReLU, Swish, and Mish. Replacing ReLU by ErfAct and Pserf, we have 5.68% and 5.42% improvement for top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR100 dataset, 2.11% and 1.96% improvement for top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR10 dataset, 1.0%, and 1.0% improvement on mean average precision (mAP) on SSD300 model in Pascal VOC dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ErfAct and Pserf: Non-Monotonic Smooth Trainable Activation Functions· underline

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Neural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Grouped Convolution · Channel Shuffle · Groupwise Point Convolution · Batch Normalization · Pointwise Convolution · Average Pooling · Depthwise Convolution · Residual Connection · ShuffleNet V2 Downsampling Block