Learning Activation Functions for Sparse Neural Networks
Mohammad Loni, Aditya Mohan, Mehdi Asadi, Marius Lindauer

TL;DR
This paper introduces SAFS, a method for optimizing activation functions and hyperparameters in sparse neural networks, significantly improving accuracy at high pruning ratios across multiple models and datasets.
Contribution
It proposes a novel combined approach for tuning activation functions and hyperparameters specifically for sparse neural networks, addressing accuracy drops.
Findings
SAFS improves accuracy by up to 15.53% on LeNet-5.
SAFS achieves up to 8.88% improvement on VGG-16.
SAFS enhances ResNet-18 accuracy by 6.33% at high pruning ratios.
Abstract
Sparse Neural Networks (SNNs) can potentially demonstrate similar performance to their dense counterparts while saving significant energy and memory at inference. However, the accuracy drop incurred by SNNs, especially at high pruning ratios, can be an issue in critical deployment conditions. While recent works mitigate this issue through sophisticated pruning techniques, we shift our focus to an overlooked factor: hyperparameters and activation functions. Our analyses have shown that the accuracy drop can additionally be attributed to (i) Using ReLU as the default choice for activation functions unanimously, and (ii) Fine-tuning SNNs with the same hyperparameters as dense counterparts. Thus, we focus on learning a novel way to tune activation functions for sparse networks and combining these with a separate hyperparameter optimization (HPO) regime for sparse networks. By conducting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsPruning
