ClosNets: a Priori Sparse Topologies for Faster DNN Training
Mihailo Isakov, Michel A. Kinsy

TL;DR
ClosNets introduce a novel sparse topology for fully-connected layers in DNNs, enabling smaller, faster training without accuracy loss by using pre-selected sparse matrices, thus reducing memory and computation requirements.
Contribution
This work proposes a new fully-connected layer design with a priori sparse topologies, allowing training of smaller networks with reduced memory and computation without sacrificing accuracy.
Findings
ClosNets reduce dense layer sizes by up to an order of magnitude.
They achieve significant training speedups due to smaller network size.
High path diversity and shallowness contribute to performance.
Abstract
Fully-connected layers in deep neural networks (DNN) are often the throughput and power bottleneck during training. This is due to their large size and low data reuse. Pruning dense layers can significantly reduce the size of these networks, but this approach can only be applied after training. In this work we propose a novel fully-connected layer that reduces the memory requirements of DNNs without sacrificing accuracy. We replace a dense matrix with products of sparse matrices whose topologies we pick in advance. This allows us to: (1) train significantly smaller networks without a loss in accuracy, and (2) store the network weights without having to store connection indices. We therefore achieve significant training speedups due to the smaller network size, and a reduced amount of computation per epoch. We tested several sparse layer topologies and found that Clos networks perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques
MethodsPruning
