On the training of sparse and dense deep neural networks: less parameters, same performance
Lorenzo Chicchi, Lorenzo Giambagli, Lorenzo Buffoni, Timoteo Carletti,, Marco Ciavarella, Duccio Fanelli

TL;DR
This paper introduces a spectral learning method for deep neural networks that reduces parameters and achieves performance close to traditional training, enabling the creation of highly sparse networks with strong classification capabilities.
Contribution
The authors propose a spectral training approach using two sets of eigenvalues, improving performance and sparsity compared to previous spectral methods and reducing computational costs.
Findings
Spectral training achieves near-conventional performance with fewer parameters.
The method enables the creation of highly sparse neural networks.
Spectral parameters influence node contribution and excitability effectively.
Abstract
Deep neural networks can be trained in reciprocal space, by acting on the eigenvalues and eigenvectors of suitable transfer operators in direct space. Adjusting the eigenvalues, while freezing the eigenvectors, yields a substantial compression of the parameter space. This latter scales by definition with the number of computing neurons. The classification scores, as measured by the displayed accuracy, are however inferior to those attained when the learning is carried in direct space, for an identical architecture and by employing the full set of trainable parameters (with a quadratic dependence on the size of neighbor layers). In this Letter, we propose a variant of the spectral learning method as appeared in Giambagli et al {Nat. Comm.} 2021, which leverages on two sets of eigenvalues, for each mapping between adjacent layers. The eigenvalues act as veritable knobs which can be freely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM
