Scalable Photonic Neural Networks via Surrogate Scattering-Matrix Inverse Design

Azka Maula Iskandar Muda; U\u{g}ur Te\u{g}in

arXiv:2604.21301·physics.optics·April 24, 2026

Scalable Photonic Neural Networks via Surrogate Scattering-Matrix Inverse Design

Azka Maula Iskandar Muda, U\u{g}ur Te\u{g}in

PDF

TL;DR

This paper presents a scalable method for designing optical neural networks using surrogate inverse design, significantly reducing simulation costs and enabling efficient training of compact photonic processors.

Contribution

The authors introduce a two-stage surrogate workflow and a banded-router architecture that decouple task learning from electromagnetic realization, improving efficiency and scalability.

Findings

01

Achieved near-accurate all-optical classification on MedMNIST after 20 epochs.

02

Improved test accuracy by over 15 percentage points on RSSCN7 with the new architecture.

03

Validated the framework on nonlinear decision tasks like Yin-Yang.

Abstract

Inverse-designed nanophotonic media are a promising platform for compact optical neural networks, but training them end to end is expensive because each adjoint iteration couples the full-wave solver to the dataset minibatch, so the number of electromagnetic simulations scales with both the network depth and the batch size. We introduce a two-stage surrogate workflow that decouples task learning from electromagnetic realization. In the first stage, the trainable optical block is represented as a passive complex matrix with bounded singular values and the classification task is solved directly in matrix space at negligible cost. In the second stage, the selected target operator is transferred to a fabrication-aware freeform device through an adjoint problem driven by a Frobenius-norm transmission residual and a reflection penalty, which removes the minibatch dependence from the full-wave…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.