FatNet: High Resolution Kernels for Classification Using Fully   Convolutional Optical Neural Networks

Riad Ibadulla; Thomas M. Chen; Constantino Carlos Reyes-Aldasoro

arXiv:2210.16914·cs.CV·November 1, 2022

FatNet: High Resolution Kernels for Classification Using Fully Convolutional Optical Neural Networks

Riad Ibadulla, Thomas M. Chen, Constantino Carlos Reyes-Aldasoro

PDF

Open Access 1 Repo

TL;DR

FatNet is a fully convolutional optical neural network leveraging high-resolution kernels and feature maps, optimized for free-space 4f systems, achieving faster inference with minimal accuracy loss compared to traditional models.

Contribution

This paper introduces FatNet, a novel fully convolutional optical neural network architecture that enhances inference speed using high-resolution kernels in free-space optical systems.

Findings

01

8.2 times fewer convolution operations than ResNet-18

02

Only 6% lower accuracy compared to ResNet-18

03

Faster inference in optical systems with high-resolution kernels

Abstract

This paper describes the transformation of a traditional in-silico classification network into an optical fully convolutional neural network with high-resolution feature maps and kernels. When using the free-space 4f system to accelerate the inference speed of neural networks, higher resolutions of feature maps and kernels can be used without the loss in frame rate. We present FatNet for the classification of images, which is more compatible with free-space acceleration than standard convolutional classifiers. It neglects the standard combination of convolutional feature extraction and classifier dense layers by performing both in one fully convolutional network. This approach takes full advantage of the parallelism in the 4f free-space system and performs fewer conversions between electronics and optics by reducing the number of channels and increasing the resolution, making the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

riadibadulla/simulator
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Optical Network Technologies · Optical Coherence Tomography Applications

MethodsConvolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings