ReLaX-Net: Reusing Layers for Parameter-Efficient Physical Neural Networks
Kohei Tsuchiyama, Andre Roehm, Takatomo Mihana, Ryoichi Horisaki

TL;DR
ReLaX-Net introduces a parameter-efficient layer reuse architecture for physical neural networks, leveraging time-multiplexing to enhance performance with minimal hardware modifications, validated on image and language tasks.
Contribution
It proposes a novel layer reuse scheme for PNNs using time-multiplexing, enabling scalable performance improvements with simple hardware additions.
Findings
ReLaX-Net outperforms traditional RNNs and DNNs with same parameters.
Minor hardware modifications enable significant performance gains.
Scales favorably across different tasks.
Abstract
Physical Neural Networks (PNN) are promising platforms for next-generation computing systems. However, recent advances in digital neural network performance are largely driven by the rapid growth in the number of trainable parameters and, so far, demonstrated PNNs are lagging behind by several orders of magnitude in terms of scale. This mirrors size and performance constraints found in early digital neural networks. In that period, efficient reuse of parameters contributed to the development of parameter-efficient architectures such as convolutional neural networks. In this work, we numerically investigate hardware-friendly weight-tying for PNNs. Crucially, with many PNN systems, there is a time-scale separation between the fast dynamic active elements of the forward pass and the only slowly trainable elements implementing weights and biases. With this in mind,we propose the Reuse of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Reservoir Computing · Advanced Memory and Neural Computing
