An Analysis of State-of-the-art Activation Functions For Supervised Deep   Neural Network

Anh Nguyen; Khoa Pham; Dat Ngo; Thanh Ngo; Lam Pham

arXiv:2104.02523·cs.LG·April 7, 2021

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham

PDF

Open Access

TL;DR

This paper analyzes various state-of-the-art activation functions in deep neural networks, evaluating their performance across different architectures and datasets for supervised classification tasks.

Contribution

It provides a comparative analysis of multiple activation functions using two different deep learning architectures and datasets.

Findings

01

ReLU performs well on MNIST with MLP.

02

ELU and SELU show advantages in certain architectures.

03

Activation function choice impacts performance across datasets.

Abstract

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU), Scaled Exponential Linear Unit (SELU), Gaussian Error Linear Unit (GELU), and the Inverse Square Root Linear Unit (ISRLU). To evaluate, experiments over two deep learning network architectures integrating these activation functions are conducted. The first model, basing on Multilayer Perceptron (MLP), is evaluated with MNIST dataset to perform these activation functions. Meanwhile, the second model, likely VGGish-based architecture, is applied for Acoustic Scene Classification (ASC) Task 1A in DCASE 2018 challenge, thus evaluate whether these activation functions work well in different datasets as well as different network architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis