Time-lapse image classification using a diffractive neural network
Md Sadman Sakib Rahman, Aydogan Ozcan

TL;DR
This paper introduces a novel time-lapse diffractive neural network that enhances optical image classification accuracy by leveraging lateral movements of objects and the network, achieving the highest accuracy to date on CIFAR-10 with a single diffractive network.
Contribution
It presents the first implementation of a time-lapse diffractive neural network, improving classification accuracy and generalization by utilizing lateral shifts of input objects.
Findings
Achieved 62.03% accuracy on CIFAR-10 classification.
Demonstrated the highest inference accuracy with a single diffractive network on CIFAR-10.
Showed that lateral movements improve optical classification performance.
Abstract
Diffractive deep neural networks (D2NNs) define an all-optical computing framework comprised of spatially engineered passive surfaces that collectively process optical input information by modulating the amplitude and/or the phase of the propagating light. Diffractive optical networks complete their computational tasks at the speed of light propagation through a thin diffractive volume, without any external computing power while exploiting the massive parallelism of optics. Diffractive networks were demonstrated to achieve all-optical classification of objects and perform universal linear transformations. Here we demonstrate, for the first time, a "time-lapse" image classification scheme using a diffractive network, significantly advancing its classification accuracy and generalization performance on complex input objects by using the lateral movements of the input objects and/or the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Optical Network Technologies · Photonic and Optical Devices
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
