GPU Acceleration of Sparse Neural Networks
Aavaas Gajurel, Sushil J. Louis, Frederick C Harris

TL;DR
This paper demonstrates that GPU acceleration can significantly speed up the activation process of sparse and arbitrarily structured neural networks, enabling more efficient machine learning workflows.
Contribution
The paper introduces a GPU-based method for accelerating sparse neural network activation, including a preprocessing step for dependency grouping and parallel computation using CUDA.
Findings
Significant speedup in neural network activation using GPU
Effective parallelization for sparse and arbitrary structured networks
Applicable to neural network pruning and evolutionary strategies
Abstract
In this paper, we use graphics processing units(GPU) to accelerate sparse and arbitrary structured neural networks. Sparse networks have nodes in the network that are not fully connected with nodes in preceding and following layers, and arbitrary structure neural networks have different number of nodes in each layers. Sparse Neural networks with arbitrary structures are generally created in the processes like neural network pruning and evolutionary machine learning strategies. We show that we can gain significant speedup for full activation of such neural networks using graphical processing units. We do a prepossessing step to determine dependency groups for all the nodes in a network, and use that information to guide the progression of activation in the neural network. Then we compute activation for each nodes in its own separate thread in the GPU, which allows for massive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Advanced Memory and Neural Computing
MethodsPruning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
