Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins
Ilker Oguz, Louis J. E. Suter, Jih-Liang Hsieh, Mustafa Yildirim,, Niyazi Ulas Dinc, Christophe Moser, Demetri Psaltis

TL;DR
This paper presents a hybrid neural network training method that integrates multimode optical nonlinearities via digital twins, achieving high accuracy and energy efficiency in large-scale AI models.
Contribution
It introduces a differentiable optical system model for training hybrid neural networks, combining physical optical transformations with neural computation.
Findings
Achieved state-of-the-art image classification accuracy.
Demonstrated high simulation fidelity and robustness to experimental drifts.
Reduced computational and energy demands for training large neural networks.
Abstract
The ability to train ever-larger neural networks brings artificial intelligence to the forefront of scientific and technical discoveries. However, their exponentially increasing size creates a proportionally greater demand for energy and computational hardware. Incorporating complex physical events in networks as fixed, efficient computation modules can address this demand by decreasing the complexity of trainable layers. Here, we utilize ultrashort pulse propagation in multimode fibers, which perform large-scale nonlinear transformations, for this purpose. Training the hybrid architecture is achieved through a neural model that differentiably approximates the optical system. The training algorithm updates the neural simulator and backpropagates the error signal over this proxy to optimize layers preceding the optical one. Our experimental results achieve state-of-the-art image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy Techniques in Biomedical and Chemical Research · Neural Networks and Reservoir Computing · Photonic and Optical Devices
