All-Optical Machine Learning Using Diffractive Deep Neural Networks

Xing Lin; Yair Rivenson; Nezih T. Yardimci; Muhammed Veli; Mona; Jarrahi; Aydogan Ozcan

arXiv:1804.08711·cs.NE·September 26, 2018

All-Optical Machine Learning Using Diffractive Deep Neural Networks

Xing Lin, Yair Rivenson, Nezih T. Yardimci, Muhammed Veli, Mona, Jarrahi, Aydogan Ozcan

PDF

TL;DR

This paper presents a novel all-optical deep neural network architecture, D2NN, that uses passive diffractive layers to perform complex functions like image classification and lens functions at the speed of light, enabling rapid optical processing.

Contribution

The paper introduces the D2NN framework, demonstrating its ability to implement neural network functions optically using 3D-printed diffractive layers, a significant advancement in all-optical computing.

Findings

01

Successfully demonstrated handwritten digit classification using D2NN

02

Implemented optical lens functions at terahertz spectrum with D2NN

03

Achieved real-time, all-optical processing at the speed of light

Abstract

We introduce an all-optical Diffractive Deep Neural Network (D2NN) architecture that can learn to implement various functions after deep learning-based design of passive diffractive layers that work collectively. We experimentally demonstrated the success of this framework by creating 3D-printed D2NNs that learned to implement handwritten digit classification and the function of an imaging lens at terahertz spectrum. With the existing plethora of 3D-printing and other lithographic fabrication methods as well as spatial-light-modulators, this all-optical deep learning framework can perform, at the speed of light, various complex functions that computer-based neural networks can implement, and will find applications in all-optical image analysis, feature detection and object classification, also enabling new camera designs and optical components that can learn to perform unique tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings