Dual adaptive training of photonic neural networks
Ziyang Zheng, Zhengyang Duan, Hang Chen, Rui Yang, Sheng Gao, Haiou, Zhang, Hongkai Xiong, Xing Lin

TL;DR
This paper introduces dual adaptive training (DAT) for photonic neural networks, enabling them to adapt to systematic errors and maintain high performance in large-scale physical implementations, surpassing existing training methods.
Contribution
The paper proposes a novel dual adaptive training method that predicts systematic errors and optimizes model accuracy during deployment of photonic neural networks.
Findings
DAT achieves high similarity mapping between models and physical systems.
DAT maintains classification accuracy in large-scale PNNs with systematic errors.
DAT outperforms state-of-the-art in situ training approaches.
Abstract
Photonic neural network (PNN) is a remarkable analog artificial intelligence (AI) accelerator that computes with photons instead of electrons to feature low latency, high energy efficiency, and high parallelism. However, the existing training approaches cannot address the extensive accumulation of systematic errors in large-scale PNNs, resulting in a significant decrease in model performance in physical systems. Here, we propose dual adaptive training (DAT) that allows the PNN model to adapt to substantial systematic errors and preserves its performance during the deployment. By introducing the systematic error prediction networks with task-similarity joint optimization, DAT achieves the high similarity mapping between the PNN numerical models and physical systems and high-accurate gradient calculations during the dual backpropagation training. We validated the effectiveness of DAT by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Photonic and Optical Devices · Optical Network Technologies
