Learning a Sparse Neural Network using IHT

Saeed Damadi; Soroush Zolfaghari; Mahdi Rezaie; Jinglai Shen

arXiv:2404.18414·cs.LG·July 18, 2024

Learning a Sparse Neural Network using IHT

Saeed Damadi, Soroush Zolfaghari, Mahdi Rezaie, Jinglai Shen

PDF

Open Access

TL;DR

This paper explores the theoretical foundations of training sparse neural networks using iterative hard thresholding (IHT), validating conditions for convergence through experiments on a simple dataset.

Contribution

It provides a theoretical analysis of IHT convergence conditions in neural network training and validates these conditions experimentally.

Findings

01

IHT can effectively identify sparse solutions in neural networks.

02

Theoretical convergence conditions are applicable to neural network training.

03

Experimental validation confirms the practicality of the theoretical conditions.

Abstract

The core of a good model is in its ability to focus only on important information that reflects the basic patterns and consistencies, thus pulling out a clear, noise-free signal from the dataset. This necessitates using a simplified model defined by fewer parameters. The importance of theoretical foundations becomes clear in this context, as this paper relies on established results from the domain of advanced sparse optimization, particularly those addressing nonlinear differentiable functions. The need for such theoretical foundations is further highlighted by the trend that as computational power for training NNs increases, so does the complexity of the models in terms of a higher number of parameters. In practical scenarios, these large models are often simplified to more manageable versions with fewer parameters. Understanding why these simplified models with less number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus