Learning Activation Functions for Sparse Neural Networks

Mohammad Loni; Aditya Mohan; Mehdi Asadi; Marius Lindauer

arXiv:2305.10964·cs.LG·June 6, 2023·5 cites

Learning Activation Functions for Sparse Neural Networks

Mohammad Loni, Aditya Mohan, Mehdi Asadi, Marius Lindauer

PDF

Open Access 1 Repo

TL;DR

This paper introduces SAFS, a method for optimizing activation functions and hyperparameters in sparse neural networks, significantly improving accuracy at high pruning ratios across multiple models and datasets.

Contribution

It proposes a novel combined approach for tuning activation functions and hyperparameters specifically for sparse neural networks, addressing accuracy drops.

Findings

01

SAFS improves accuracy by up to 15.53% on LeNet-5.

02

SAFS achieves up to 8.88% improvement on VGG-16.

03

SAFS enhances ResNet-18 accuracy by 6.33% at high pruning ratios.

Abstract

Sparse Neural Networks (SNNs) can potentially demonstrate similar performance to their dense counterparts while saving significant energy and memory at inference. However, the accuracy drop incurred by SNNs, especially at high pruning ratios, can be an issue in critical deployment conditions. While recent works mitigate this issue through sophisticated pruning techniques, we shift our focus to an overlooked factor: hyperparameters and activation functions. Our analyses have shown that the accuracy drop can additionally be attributed to (i) Using ReLU as the default choice for activation functions unanimously, and (ii) Fine-tuning SNNs with the same hyperparameters as dense counterparts. Thus, we focus on learning a novel way to tune activation functions for sparse networks and combining these with a separate hyperparameter optimization (HPO) regime for sparse networks. By conducting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

automl/safs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsPruning