ExtremeMETA: High-speed Lightweight Image Segmentation Model by   Remodeling Multi-channel Metamaterial Imagers

Quan Liu; Brandon T. Swartz; Ivan Kravchenko; Jason G. Valentine,; Yuankai Huo

arXiv:2405.17568·cs.CV·May 29, 2024

ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers

Quan Liu, Brandon T. Swartz, Ivan Kravchenko, Jason G. Valentine,, Yuankai Huo

PDF

Open Access

TL;DR

ExtremeMETA is a high-speed, lightweight image segmentation model that combines optical neural network principles with digital optimization, achieving high accuracy with reduced computational costs.

Contribution

The paper introduces ExtremeMETA, a novel large kernel lightweight segmentation model that extends optical neural network applications and employs model compression for efficiency.

Findings

01

Improved segmentation accuracy from 92.45 to 95.97 mIoU.

02

Reduced FLOPs from 461.07 MMacs to 166.03 MMacs.

03

Demonstrated effectiveness on three public datasets.

Abstract

Deep neural networks (DNNs) have heavily relied on traditional computational units like CPUs and GPUs. However, this conventional approach brings significant computational burdens, latency issues, and high power consumption, limiting their effectiveness. This has sparked the need for lightweight networks like ExtremeC3Net. On the other hand, there have been notable advancements in optical computational units, particularly with metamaterials, offering the exciting prospect of energy-efficient neural networks operating at the speed of light. Yet, the digital design of metamaterial neural networks (MNNs) faces challenges such as precision, noise, and bandwidth, limiting their application to intuitive tasks and low-resolution images. In this paper, we propose a large kernel lightweight segmentation model, ExtremeMETA. Based on the ExtremeC3Net, the ExtremeMETA maximizes the ability of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification

MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution