NeuroFabric: Identifying Ideal Topologies for Training A Priori Sparse Networks
Mihailo Isakov, Michel A. Kinsy

TL;DR
This paper introduces a theoretical framework for selecting optimal sparse network topologies, proposing a new initialization scheme and a data-free heuristic to identify the most effective sparse structures for training deep neural networks efficiently.
Contribution
It provides a novel theoretical foundation and a heuristic method for choosing ideal intra-layer sparse topologies, improving training efficiency and accuracy.
Findings
Certain topologies significantly outperform others in accuracy.
A new initialization scheme enables exploration of very deep sparse networks.
A data-free heuristic effectively evaluates topologies independently of datasets.
Abstract
Long training times of deep neural networks are a bottleneck in machine learning research. The major impediment to fast training is the quadratic growth of both memory and compute requirements of dense and convolutional layers with respect to their information bandwidth. Recently, training `a priori' sparse networks has been proposed as a method for allowing layers to retain high information bandwidth, while keeping memory and compute low. However, the choice of which sparse topology should be used in these networks is unclear. In this work, we provide a theoretical foundation for the choice of intra-layer topology. First, we derive a new sparse neural network initialization scheme that allows us to explore the space of very deep sparse networks. Next, we evaluate several topologies and show that seemingly similar topologies can often have a large difference in attainable accuracy. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Computational Physics and Python Applications · Advanced Data Processing Techniques
