Activation Functions: Do They Represent A Trade-Off Between Modular   Nature of Neural Networks And Task Performance

Himanshu Pradeep Aswani; Amit Sethi

arXiv:2009.07793·cs.LG·September 17, 2020

Activation Functions: Do They Represent A Trade-Off Between Modular Nature of Neural Networks And Task Performance

Himanshu Pradeep Aswani, Amit Sethi

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether ReLU activation functions optimize the balance between neural network modularity and task performance, questioning if alternative functions could enhance modularity without sacrificing efficiency.

Contribution

It explores the potential trade-offs between activation functions, particularly ReLU, in terms of modularity and overall neural network performance.

Findings

01

ReLU accelerates training convergence.

02

Trade-offs exist between modularity and performance.

03

Alternative activation functions may improve modularity.

Abstract

Current research suggests that the key factors in designing neural network architectures involve choosing number of filters for every convolution layer, number of hidden neurons for every fully connected layer, dropout and pruning. The default activation function in most cases is the ReLU, as it has empirically shown faster training convergence. We explore whether ReLU is the best choice if one is aiming to desire better modularity structure within a neural network.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

himanshuaswani/ActivationFunctionsPaperImplementation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsConvolution · Dropout · *Communicated@Fast*How Do I Communicate to Expedia?