ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers
Quan Liu, Brandon T. Swartz, Ivan Kravchenko, Jason G. Valentine,, Yuankai Huo

TL;DR
ExtremeMETA is a high-speed, lightweight image segmentation model that combines optical neural network principles with digital optimization, achieving high accuracy with reduced computational costs.
Contribution
The paper introduces ExtremeMETA, a novel large kernel lightweight segmentation model that extends optical neural network applications and employs model compression for efficiency.
Findings
Improved segmentation accuracy from 92.45 to 95.97 mIoU.
Reduced FLOPs from 461.07 MMacs to 166.03 MMacs.
Demonstrated effectiveness on three public datasets.
Abstract
Deep neural networks (DNNs) have heavily relied on traditional computational units like CPUs and GPUs. However, this conventional approach brings significant computational burdens, latency issues, and high power consumption, limiting their effectiveness. This has sparked the need for lightweight networks like ExtremeC3Net. On the other hand, there have been notable advancements in optical computational units, particularly with metamaterials, offering the exciting prospect of energy-efficient neural networks operating at the speed of light. Yet, the digital design of metamaterial neural networks (MNNs) faces challenges such as precision, noise, and bandwidth, limiting their application to intuitive tasks and low-resolution images. In this paper, we propose a large kernel lightweight segmentation model, ExtremeMETA. Based on the ExtremeC3Net, the ExtremeMETA maximizes the ability of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution
