Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun, Minghai Qin, Tianyun Zhang, Liu Liu, Yen-Kuang Chen, Yuan Xie

TL;DR
This paper reviews the current state of sparse neural network computation, emphasizing algorithms, software, and hardware, and discusses how sparsity can improve model efficiency and accuracy in resource-constrained environments.
Contribution
It provides a comprehensive overview of sparse neural network computation, highlighting the importance of structure and weight sparsity, and suggests directions for future hardware and algorithm development.
Findings
Sparse structures can enhance model efficiency and accuracy.
Large, sparse models are more beneficial for complex problems.
New algorithms and hardware are needed for effective sparse model training.
Abstract
Neural network models are widely used in solving many challenging problems, such as computer vision, personalized recommendation, and natural language processing. Those models are very computationally intensive and reach the hardware limit of the existing server and IoT devices. Thus, finding better model architectures with much less amount of computation while maximally preserving the accuracy is a popular research topic. Among various mechanisms that aim to reduce the computation complexity, identifying the zero values in the model weights and in the activations to avoid computing them is a promising direction. In this paper, we summarize the current status of the research on the computation of sparse neural networks, from the perspective of the sparse algorithms, the software frameworks, and the hardware accelerations. We observe that the search for the sparse structure can be a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
