GPU Acceleration of Sparse Neural Networks

Aavaas Gajurel; Sushil J. Louis; Frederick C Harris

arXiv:2005.04347·cs.DC·May 12, 2020·1 cites

GPU Acceleration of Sparse Neural Networks

Aavaas Gajurel, Sushil J. Louis, Frederick C Harris

PDF

Open Access

TL;DR

This paper demonstrates that GPU acceleration can significantly speed up the activation process of sparse and arbitrarily structured neural networks, enabling more efficient machine learning workflows.

Contribution

The paper introduces a GPU-based method for accelerating sparse neural network activation, including a preprocessing step for dependency grouping and parallel computation using CUDA.

Findings

01

Significant speedup in neural network activation using GPU

02

Effective parallelization for sparse and arbitrary structured networks

03

Applicable to neural network pruning and evolutionary strategies

Abstract

In this paper, we use graphics processing units(GPU) to accelerate sparse and arbitrary structured neural networks. Sparse networks have nodes in the network that are not fully connected with nodes in preceding and following layers, and arbitrary structure neural networks have different number of nodes in each layers. Sparse Neural networks with arbitrary structures are generally created in the processes like neural network pruning and evolutionary machine learning strategies. We show that we can gain significant speedup for full activation of such neural networks using graphical processing units. We do a prepossessing step to determine dependency groups for all the nodes in a network, and use that information to guide the progression of activation in the neural network. Then we compute activation for each nodes in its own separate thread in the GPU, which allows for massive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Advanced Memory and Neural Computing

MethodsPruning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings