All-Optical Image Identification with Programmable Matrix Transformation

Shikang Li; Baohua Ni; Xue Feng; Kaiyu Cui; Fang Liu; Wei Zhang; and; Yidong Huang

arXiv:2104.02474·physics.optics·August 20, 2021

All-Optical Image Identification with Programmable Matrix Transformation

Shikang Li, Baohua Ni, Xue Feng, Kaiyu Cui, Fang Liu, Wei Zhang, and, Yidong Huang

PDF

TL;DR

This paper presents a programmable all-optical neural network capable of high-speed image classification using matrix transformations and nonlinear photodetection, achieving high accuracy and potential processing speeds up to 74 trillion FLOPs per second.

Contribution

It introduces a novel all-optical neural network architecture with programmable matrix operations and nonlinear activation, enabling high-speed, high-accuracy image classification on a single platform.

Findings

01

Achieved high accuracy in classifying handwritten digits, objects, and depth images.

02

Demonstrated potential processing speeds up to 74T FLOPs/sec.

03

Implemented programmable matrix transformations using spatial light modulators.

Abstract

An optical neural network is proposed and demonstrated with programmable matrix transformation and nonlinear activation function of photodetection (square-law detection). Based on discrete phase-coherent spatial modes, the dimensionality of programmable optical matrix operations is 30~37, which is implemented by spatial light modulators. With this architecture, all-optical classification tasks of handwritten digits, objects and depth images are performed on the same platform with high accuracy. Due to the parallel nature of matrix multiplication, the processing speed of our proposed architecture is potentially as high as7.4T~74T FLOPs per second (with 10~100GHz detector)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.