Scalable Photonic Neural Networks via Surrogate Scattering-Matrix Inverse Design
Azka Maula Iskandar Muda, U\u{g}ur Te\u{g}in

TL;DR
This paper presents a scalable method for designing optical neural networks using surrogate inverse design, significantly reducing simulation costs and enabling efficient training of compact photonic processors.
Contribution
The authors introduce a two-stage surrogate workflow and a banded-router architecture that decouple task learning from electromagnetic realization, improving efficiency and scalability.
Findings
Achieved near-accurate all-optical classification on MedMNIST after 20 epochs.
Improved test accuracy by over 15 percentage points on RSSCN7 with the new architecture.
Validated the framework on nonlinear decision tasks like Yin-Yang.
Abstract
Inverse-designed nanophotonic media are a promising platform for compact optical neural networks, but training them end to end is expensive because each adjoint iteration couples the full-wave solver to the dataset minibatch, so the number of electromagnetic simulations scales with both the network depth and the batch size. We introduce a two-stage surrogate workflow that decouples task learning from electromagnetic realization. In the first stage, the trainable optical block is represented as a passive complex matrix with bounded singular values and the classification task is solved directly in matrix space at negligible cost. In the second stage, the selected target operator is transferred to a fabrication-aware freeform device through an adjoint problem driven by a Frobenius-norm transmission residual and a reflection penalty, which removes the minibatch dependence from the full-wave…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
