Scale-, shift- and rotation-invariant diffractive optical networks

Deniz Mengu; Yair Rivenson; Aydogan Ozcan

arXiv:2010.12747·physics.optics·December 25, 2020

Scale-, shift- and rotation-invariant diffractive optical networks

Deniz Mengu, Yair Rivenson, Aydogan Ozcan

PDF

TL;DR

This paper introduces a novel training method for diffractive optical neural networks that makes them invariant to scale, shift, and rotation of input objects, enhancing their robustness for real-world applications.

Contribution

The authors propose a new training strategy that incorporates object transformations as random variables, enabling diffractive networks to achieve invariance to scale, shift, and rotation.

Findings

01

Achieved scale-, shift-, and rotation-invariance in diffractive optical networks.

02

Improved robustness in optical neural networks for dynamic vision tasks.

03

Potential applications in autonomous vehicles and biomedical imaging.

Abstract

Recent research efforts in optical computing have gravitated towards developing optical neural networks that aim to benefit from the processing speed and parallelism of optics/photonics in machine learning applications. Among these endeavors, Diffractive Deep Neural Networks (D2NNs) harness light-matter interaction over a series of trainable surfaces, designed using deep learning, to compute a desired statistical inference task as the light waves propagate from the input plane to the output field-of-view. Although, earlier studies have demonstrated the generalization capability of diffractive optical networks to unseen data, achieving e.g., >98% image classification accuracy for handwritten digits, these previous designs are in general sensitive to the spatial scaling, translation and rotation of the input objects. Here, we demonstrate a new training strategy for diffractive networks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.