Learning a Sparse Neural Network using IHT
Saeed Damadi, Soroush Zolfaghari, Mahdi Rezaie, Jinglai Shen

TL;DR
This paper explores the theoretical foundations of training sparse neural networks using iterative hard thresholding (IHT), validating conditions for convergence through experiments on a simple dataset.
Contribution
It provides a theoretical analysis of IHT convergence conditions in neural network training and validates these conditions experimentally.
Findings
IHT can effectively identify sparse solutions in neural networks.
Theoretical convergence conditions are applicable to neural network training.
Experimental validation confirms the practicality of the theoretical conditions.
Abstract
The core of a good model is in its ability to focus only on important information that reflects the basic patterns and consistencies, thus pulling out a clear, noise-free signal from the dataset. This necessitates using a simplified model defined by fewer parameters. The importance of theoretical foundations becomes clear in this context, as this paper relies on established results from the domain of advanced sparse optimization, particularly those addressing nonlinear differentiable functions. The need for such theoretical foundations is further highlighted by the trend that as computational power for training NNs increases, so does the complexity of the models in terms of a higher number of parameters. In practical scenarios, these large models are often simplified to more manageable versions with fewer parameters. Understanding why these simplified models with less number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus
