Physics-aware Roughness Optimization for Diffractive Optical Neural Networks
Shanglin Zhou, Yingjie Li, Minhan Lou, Weilu Gao, Zhijie Shi, Cunxi, Yu, Caiwen Ding

TL;DR
This paper introduces a physics-aware training framework for diffractive optical neural networks that reduces phase mask roughness and improves deployment accuracy by integrating regularization, sparsification, and periodic optimization.
Contribution
It proposes a novel physics-aware training method with regularization and sparsification to align numerical models with physical optical device performance.
Findings
Achieves up to 35.7% reduction in phase mask roughness.
Maintains comparable accuracy on multiple datasets.
Enhances practical deployment of DONNs with minimal performance loss.
Abstract
As a representative next-generation device/circuit technology beyond CMOS, diffractive optical neural networks (DONNs) have shown promising advantages over conventional deep neural networks due to extreme fast computation speed (light speed) and low energy consumption. However, there is a mismatch, i.e., significant prediction accuracy loss, between the DONN numerical modelling and physical optical device deployment, because of the interpixel interaction within the diffractive layers. In this work, we propose a physics-aware diffractive optical neural network training framework to reduce the performance difference between numerical modeling and practical deployment. Specifically, we propose the roughness modeling regularization in the training process and integrate the physics-aware sparsification method to introduce sparsity to the phase masks to reduce sharp phase changes between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Photonic and Optical Devices · Optical Network Technologies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
