An Optical Frontend for a Convolutional Neural Network

Shane Colburn; Yi Chu; Eli Shlizerman; Arka Majumdar

arXiv:1901.03661·cs.CV·May 22, 2019

An Optical Frontend for a Convolutional Neural Network

Shane Colburn, Yi Chu, Eli Shlizerman, Arka Majumdar

PDF

TL;DR

This paper presents a hybrid optical-electronic neural network architecture that leverages nanophotonic components for efficient linear operations, reducing power consumption and increasing speed for large-scale image processing tasks.

Contribution

It introduces a novel optical frontend for the first layer of a CNN, minimizing electronic-optical conversions and enhancing performance over purely electronic systems.

Findings

01

Outperforms electronic architectures in speed and power for large images and kernels.

02

Achieves 87% accuracy on Kaggle Cats and Dogs dataset with modified AlexNet.

03

Design demonstrates feasibility of hybrid photonic-electronic CNNs.

Abstract

The parallelism of optics and the miniaturization of optical components using nanophotonic structures, such as metasurfaces present a compelling alternative to electronic implementations of convolutional neural networks. The lack of a low-power optical nonlinearity, however, requires slow and energy-inefficient conversions between the electronic and optical domains. Here, we design an architecture which utilizes a single electrical to optical conversion by designing a free-space optical frontend unit that implements the linear operations of the first layer with the subsequent layers realized electronically. Speed and power analysis of the architecture indicates that the hybrid photonic-electronic architecture outperforms sole electronic architecture for large image sizes and kernels. Benchmarking of the photonic-electronic architecture on a modified version of AlexNet achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · 1x1 Convolution · Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax