Flatten-T Swish: a thresholded ReLU-Swish-like activation function for   deep learning

Hock Hung Chieng; Noorhaniza Wahid; Pauline Ong; Sai Raj Kishore; Perla

arXiv:1812.06247·cs.NE·December 18, 2018

Flatten-T Swish: a thresholded ReLU-Swish-like activation function for deep learning

Hock Hung Chieng, Noorhaniza Wahid, Pauline Ong, Sai Raj Kishore, Perla

PDF

2 Repos

TL;DR

This paper introduces Flatten-T Swish, a new activation function that incorporates negative values to improve deep neural network performance, demonstrating better accuracy and faster convergence than ReLU on MNIST classification tasks.

Contribution

The paper proposes Flatten-T Swish, a novel activation function that leverages negative values, and empirically evaluates its superior performance over ReLU in deep neural networks.

Findings

01

FTS with T=-0.20 achieves the best overall performance.

02

FTS improves MNIST accuracy by up to 1.15% over ReLU.

03

FTS converges twice as fast as ReLU.

Abstract

Activation functions are essential for deep learning methods to learn and perform complex tasks such as image classification. Rectified Linear Unit (ReLU) has been widely used and become the default activation function across the deep learning community since 2012. Although ReLU has been popular, however, the hard zero property of the ReLU has heavily hindered the negative values from propagating through the network. Consequently, the deep neural network has not been benefited from the negative representations. In this work, an activation function called Flatten-T Swish (FTS) that leverage the benefit of the negative values is proposed. To verify its performance, this study evaluates FTS with ReLU and several recent activation functions. Each activation function is trained using MNIST dataset on five different deep fully connected neural networks (DFNNs) with depth vary from five to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · (FiLe@Against@Claim)How do I file a claim against Expedia? · *Communicated@Fast*How Do I Communicate to Expedia?