Memory-Optimized Once-For-All Network

Maxime Girard; Victor Qu\'etu; Samuel Tardieu; Van-Tam Nguyen; Enzo; Tartaglione

arXiv:2409.05900·cs.LG·September 11, 2024

Memory-Optimized Once-For-All Network

Maxime Girard, Victor Qu\'etu, Samuel Tardieu, Van-Tam Nguyen, Enzo, Tartaglione

PDF

Open Access 2 Repos

TL;DR

This paper introduces Memory-Optimized OFA (MOOFA), a supernet that better utilizes device memory to improve DNN deployment efficiency and accuracy on resource-constrained hardware.

Contribution

We propose MOOFA, a novel supernet that maximizes memory usage for improved model generalizability and performance on limited-resource devices.

Findings

01

MOOFA outperforms original OFA in memory utilization.

02

MOOFA achieves higher accuracy on ImageNet.

03

Enhanced features diversity in MOOFA improves deployment efficiency.

Abstract

Deploying Deep Neural Networks (DNNs) on different hardware platforms is challenging due to varying resource constraints. Besides handcrafted approaches aiming at making deep models hardware-friendly, Neural Architectures Search is rising as a toolbox to craft more efficient DNNs without sacrificing performance. Among these, the Once-For-All (OFA) approach offers a solution by allowing the sampling of well-performing sub-networks from a single supernet -- this leads to evident advantages in terms of computation. However, OFA does not fully utilize the potential memory capacity of the target device, focusing instead on limiting maximum memory usage per layer. This leaves room for an unexploited potential in terms of model generalizability. In this paper, we introduce a Memory-Optimized OFA (MOOFA) supernet, designed to enhance DNN deployment on resource-limited devices by maximizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterconnection Networks and Systems · IoT and Edge/Fog Computing · Cloud Computing and Resource Management

MethodsOFA