Efficient Training for Optical Computing
Manon P. Bart, Nick Sparks, Ryan T. Glasser

TL;DR
This paper presents a novel, efficient training method for optical computing systems that leverages the structured nature of diffractive optical processors, significantly reducing training time for large-scale machine learning tasks.
Contribution
The authors introduce a new backpropagation algorithm utilizing Fourier transforms to compute gradients efficiently in optical systems, enabling scalable training of complex linear transformations.
Findings
Significant reduction in training time for optical systems.
Effective gradient computation across all trainable elements.
Broad applicability to wavefront shaping and signal processing.
Abstract
Diffractive optical information processors have demonstrated significant promise in delivering high-speed, parallel, and energy efficient inference for scaling machine learning tasks. Training, however, remains a major computational bottleneck, compounded by large datasets and many simulations required for state-of-the-art classification models. The underlying linear transformations in such systems are inherently constrained to compositions of circulant and diagonal matrix factors, representing free-space propagation and phase and/or amplitude modulation of light, respectively. While theoretically established that an arbitrary linear transformation can be generated by such factors, only upper bounds on the number of factors exist, which are experimentally unfeasible. Additionally, physical parameters such as inter-layer distance, number of layers, and phase-only modulation further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemiconductor Lasers and Optical Devices · Photonic and Optical Devices
