Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks
Dharma Teja Vooturi, Girish Varma, Kishore Kothapalli

TL;DR
This paper introduces RBGP, a framework for creating structured block sparse neural networks using Ramanujan bipartite graph products, leading to efficient GPU implementation and significant runtime improvements without sacrificing accuracy.
Contribution
The paper proposes a novel graph-theoretic approach to generate structured sparsity in neural networks, improving runtime efficiency and memory usage while maintaining high prediction accuracy.
Findings
Achieved 5-9x runtime speedup over unstructured sparsity.
Achieved 2-5x runtime speedup over block sparsity.
Maintained comparable accuracy on CIFAR dataset.
Abstract
Sparse neural networks are shown to give accurate predictions competitive to denser versions, while also minimizing the number of arithmetic operations performed. However current hardware like GPU's can only exploit structured sparsity patterns for better efficiency. Hence the run time of a sparse neural network may not correspond to the arithmetic operations required. In this work, we propose RBGP( Ramanujan Bipartite Graph Product) framework for generating structured multi level block sparse neural networks by using the theory of Graph products. We also propose to use products of Ramanujan graphs which gives the best connectivity for a given level of sparsity. This essentially ensures that the i.) the networks has the structured block sparsity for which runtime efficient algorithms exists ii.) the model gives high prediction accuracy, due to the better expressive power derived from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
