Efficient training for large-scale optical neural network using an evolutionary strategy and attention pruning
Zhiwei Yang, Zeyang Fan, Yihang Lai, Qi Chen, Tian Zhang, Jian Dai, and Kun Xu

TL;DR
This paper introduces the CAP algorithm, an evolutionary and attention pruning method for large-scale optical neural networks, significantly reducing parameters while maintaining performance and robustness against noise.
Contribution
The paper proposes a novel CAP algorithm that combines pruning and evolutionary strategies to efficiently train large-scale BONNs with improved robustness and reduced parameters.
Findings
Prunes up to 80% of parameters with minimal performance loss.
Demonstrates robustness against phase shifter noise.
Achieves high accuracy with significantly fewer parameters.
Abstract
MZI-based block optical neural networks (BONNs), which can achieve large-scale network models, have increasingly drawn attentions. However, the robustness of the current training algorithm is not high enough. Moreover, large-scale BONNs usually contain numerous trainable parameters, resulting in expensive computation and power consumption. In this article, by pruning matrix blocks and directly optimizing the individuals in population, we propose an on-chip covariance matrix adaptation evolution strategy and attention-based pruning (CAP) algorithm for large-scale BONNs. The calculated results demonstrate that the CAP algorithm can prune 60% and 80% of the parameters for MNIST and Fashion-MNIST datasets, respectively, while only degrades the performance by 3.289% and 4.693%. Considering the influence of dynamic noise in phase shifters, our proposed CAP algorithm (performance degradation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Optical Network Technologies
MethodsPruning
