Activation Functions: Do They Represent A Trade-Off Between Modular Nature of Neural Networks And Task Performance
Himanshu Pradeep Aswani, Amit Sethi

TL;DR
This paper investigates whether ReLU activation functions optimize the balance between neural network modularity and task performance, questioning if alternative functions could enhance modularity without sacrificing efficiency.
Contribution
It explores the potential trade-offs between activation functions, particularly ReLU, in terms of modularity and overall neural network performance.
Findings
ReLU accelerates training convergence.
Trade-offs exist between modularity and performance.
Alternative activation functions may improve modularity.
Abstract
Current research suggests that the key factors in designing neural network architectures involve choosing number of filters for every convolution layer, number of hidden neurons for every fully connected layer, dropout and pruning. The default activation function in most cases is the ReLU, as it has empirically shown faster training convergence. We explore whether ReLU is the best choice if one is aiming to desire better modularity structure within a neural network.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsConvolution · Dropout · *Communicated@Fast*How Do I Communicate to Expedia?
