Efficient and Sparse Neural Networks by Pruning Weights in a Multiobjective Learning Approach
Malena Reiners, Kathrin Klamroth, Michael Stiglmayr

TL;DR
This paper introduces a multiobjective optimization framework for neural network training that balances accuracy and complexity, enabling effective pruning with minimal accuracy loss and reduced computational costs.
Contribution
It proposes a novel multiobjective approach combining scalarization and stochastic multi-gradient descent for neural network pruning and complexity reduction.
Findings
Significant network complexity reduction with negligible accuracy loss.
Effective Pareto optimal solutions identified through multiobjective optimization.
Intra-training pruning enhances efficiency with minimal additional computational cost.
Abstract
Overparameterization and overfitting are common concerns when designing and training deep neural networks, that are often counteracted by pruning and regularization strategies. However, these strategies remain secondary to most learning approaches and suffer from time and computational intensive procedures. We suggest a multiobjective perspective on the training of neural networks by treating its prediction accuracy and the network complexity as two individual objective functions in a biobjective optimization problem. As a showcase example, we use the cross entropy as a measure of the prediction accuracy while adopting an l1-penalty function to assess the total cost (or complexity) of the network parameters. The latter is combined with an intra-training pruning approach that reinforces complexity reduction and requires only marginal extra computational cost. From the perspective of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Machine Learning and Data Classification
MethodsPruning
