Single-Shot Optical Neural Network
Liane Bernstein, Alexander Sludds, Christopher Panuski, Sivan, Trajtenberg-Mills, Ryan Hamerly, Dirk Englund

TL;DR
This paper introduces a scalable, single-shot optical neural network processor that leverages free-space optics and integrated optoelectronics, achieving high accuracy on MNIST and a theoretical throughput of 0.9 exaMAC/s.
Contribution
It presents a novel scalable optical neural network architecture capable of processing approximately 1,000 input elements per layer with high efficiency and accuracy.
Findings
Achieved 94.7% accuracy on MNIST without retraining.
Demonstrated a theoretical throughput of ~0.9 exaMAC/s.
Scalable design with $K \\approx 1,000$ and beyond.
Abstract
As deep neural networks (DNNs) grow to solve increasingly complex problems, they are becoming limited by the latency and power consumption of existing digital processors. For improved speed and energy efficiency, specialized analog optical and electronic hardware has been proposed, however, with limited scalability (input vector length of hundreds of elements). Here, we present a scalable, single-shot-per-layer analog optical processor that uses free-space optics to reconfigurably distribute an input vector and integrated optoelectronics for static, updatable weighting and the nonlinearity -- with and beyond. We experimentally test classification accuracy of the MNIST handwritten digit dataset, achieving 94.7% (ground truth 96.3%) without data preprocessing or retraining on the hardware. We also determine the fundamental upper bound on throughput (0.9…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Photonic and Optical Devices · Optical Network Technologies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
