On the training of sparse and dense deep neural networks: less   parameters, same performance

Lorenzo Chicchi; Lorenzo Giambagli; Lorenzo Buffoni; Timoteo Carletti,; Marco Ciavarella; Duccio Fanelli

arXiv:2106.09021·cs.LG·December 8, 2021·1 cites

On the training of sparse and dense deep neural networks: less parameters, same performance

Lorenzo Chicchi, Lorenzo Giambagli, Lorenzo Buffoni, Timoteo Carletti,, Marco Ciavarella, Duccio Fanelli

PDF

Open Access

TL;DR

This paper introduces a spectral learning method for deep neural networks that reduces parameters and achieves performance close to traditional training, enabling the creation of highly sparse networks with strong classification capabilities.

Contribution

The authors propose a spectral training approach using two sets of eigenvalues, improving performance and sparsity compared to previous spectral methods and reducing computational costs.

Findings

01

Spectral training achieves near-conventional performance with fewer parameters.

02

The method enables the creation of highly sparse neural networks.

03

Spectral parameters influence node contribution and excitability effectively.

Abstract

Deep neural networks can be trained in reciprocal space, by acting on the eigenvalues and eigenvectors of suitable transfer operators in direct space. Adjusting the eigenvalues, while freezing the eigenvectors, yields a substantial compression of the parameter space. This latter scales by definition with the number of computing neurons. The classification scores, as measured by the displayed accuracy, are however inferior to those attained when the learning is carried in direct space, for an identical architecture and by employing the full set of trainable parameters (with a quadratic dependence on the size of neighbor layers). In this Letter, we propose a variant of the spectral learning method as appeared in Giambagli et al {Nat. Comm.} 2021, which leverages on two sets of eigenvalues, for each mapping between adjacent layers. The eigenvalues act as veritable knobs which can be freely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM