Freely scalable and reconfigurable optical hardware for deep learning
Liane Bernstein (1), Alexander Sludds (1), Ryan Hamerly (1, 2),, Vivienne Sze (1), Joel Emer (1, 3), Dirk Englund (1) ((1) Massachusetts, Institute of Technology, (2) NTT Research Inc., (3) NVIDIA)

TL;DR
This paper introduces a digital optical neural network (DONN) architecture that enhances scalability and reconfigurability for deep learning by leveraging optical interconnects, demonstrated through a proof-of-concept experiment on MNIST classification.
Contribution
The paper presents a novel optical neural network design with intralayer optical interconnects and reconfigurable inputs, addressing scalability limitations of electronic processors.
Findings
Optical multicast successfully classified 500 MNIST images.
Optical data transfer reduces energy consumption for unit spacing over 10 micrometers.
DONN architecture offers greater flexibility in neural network design.
Abstract
As deep neural network (DNN) models grow ever-larger, they can achieve higher accuracy and solve more complex problems. This trend has been enabled by an increase in available compute power; however, efforts to continue to scale electronic processors are impeded by the costs of communication, thermal management, power delivery and clocking. To improve scalability, we propose a digital optical neural network (DONN) with intralayer optical interconnects and reconfigurable input values. The near path-length-independence of optical energy consumption enables information locality between a transmitter and arbitrarily arranged receivers, which allows greater flexibility in architecture design to circumvent scaling limitations. In a proof-of-concept experiment, we demonstrate optical multicast in the classification of 500 MNIST images with a 3-layer, fully-connected network. We also analyze…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
