Does Interference Exist When Training a Once-For-All Network?

Jordan Shipard; Arnold Wiliem; Clinton Fookes

arXiv:2204.09210·cs.LG·April 21, 2022·1 cites

Does Interference Exist When Training a Once-For-All Network?

Jordan Shipard, Arnold Wiliem, Clinton Fookes

PDF

Open Access 1 Repo

TL;DR

This paper reevaluates the impact of interference during training of Once-For-All networks, finding that architecture bias is more critical than interference mitigation, and introduces a simple sampling method that improves efficiency.

Contribution

It challenges the belief that interference mitigation is crucial in OFA training and proposes Random Subnet Sampling, which simplifies training and enhances performance.

Findings

01

Interference mitigation strategies have limited impact on subnet performance.

02

Random Subnet Sampling outperforms Progressive Shrinking in several datasets.

03

RSS significantly reduces training time with minimal performance loss.

Abstract

The Once-For-All (OFA) method offers an excellent pathway to deploy a trained neural network model into multiple target platforms by utilising the supernet-subnet architecture. Once trained, a subnet can be derived from the supernet (both architecture and trained weights) and deployed directly to the target platform with little to no retraining or fine-tuning. To train the subnet population, OFA uses a novel training method called Progressive Shrinking (PS) which is designed to limit the negative impact of interference during training. It is believed that higher interference during training results in lower subnet population accuracies. In this work we take a second look at this interference effect. Surprisingly, we find that interference mitigation strategies do not have a large impact on the overall subnet population performance. Instead, we find the subnet architecture selection bias…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jordan-hs/rss-interference-cvprw2022
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Age of Information Optimization · IoT and Edge/Fog Computing

MethodsOFA