Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural   Networks

Dharma Teja Vooturi; Girish Varma; Kishore Kothapalli

arXiv:2006.13486·cs.LG·July 3, 2020

Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks

Dharma Teja Vooturi, Girish Varma, Kishore Kothapalli

PDF

TL;DR

This paper introduces RBGP, a framework for creating structured block sparse neural networks using Ramanujan bipartite graph products, leading to efficient GPU implementation and significant runtime improvements without sacrificing accuracy.

Contribution

The paper proposes a novel graph-theoretic approach to generate structured sparsity in neural networks, improving runtime efficiency and memory usage while maintaining high prediction accuracy.

Findings

01

Achieved 5-9x runtime speedup over unstructured sparsity.

02

Achieved 2-5x runtime speedup over block sparsity.

03

Maintained comparable accuracy on CIFAR dataset.

Abstract

Sparse neural networks are shown to give accurate predictions competitive to denser versions, while also minimizing the number of arithmetic operations performed. However current hardware like GPU's can only exploit structured sparsity patterns for better efficiency. Hence the run time of a sparse neural network may not correspond to the arithmetic operations required. In this work, we propose RBGP( Ramanujan Bipartite Graph Product) framework for generating structured multi level block sparse neural networks by using the theory of Graph products. We also propose to use products of Ramanujan graphs which gives the best connectivity for a given level of sparsity. This essentially ensures that the i.) the networks has the structured block sparsity for which runtime efficient algorithms exists ii.) the model gives high prediction accuracy, due to the better expressive power derived from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.