Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin, Tianyun Zhang, Fei Sun, Yen-Kuang Chen, Makan Fardad,, Yanzhi Wang, Yuan Xie

TL;DR
This paper introduces a hierarchical sparse neural network framework enabling dynamic adjustment of sparsity levels during inference, significantly reducing computation and storage while maintaining accuracy.
Contribution
It proposes a novel multi-level sparse neural network training method that supports input-independent dynamic sparsity adjustment, improving efficiency and adaptability in resource-constrained environments.
Findings
Achieved 13.38% weights and 14.97% FLOPs with comparable accuracy to dense models.
Generated more-sparse sub-models with only 3.25% accuracy loss.
Validated on various models including ResNet-50, PointNet++, GNMT, and graph attention networks.
Abstract
Deep neural networks (DNNs) have shown to provide superb performance in many real life applications, but their large computation cost and storage requirement have prevented them from being deployed to many edge and internet-of-things (IoT) devices. Sparse deep neural networks, whose majority weight parameters are zeros, can substantially reduce the computation complexity and memory consumption of the models. In real-use scenarios, devices may suffer from large fluctuations of the available computation and memory resources under different environment, and the quality of service (QoS) is difficult to maintain due to the long tail inferences with large latency. Facing the real-life challenges, we propose to train a sparse model that supports multiple sparse levels. That is, a hierarchical structure of weights are satisfied such that the locations and the values of the non-zero parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Graph Neural Networks · Machine Learning and ELM
Methodstravel james
