Does Interference Exist When Training a Once-For-All Network?
Jordan Shipard, Arnold Wiliem, Clinton Fookes

TL;DR
This paper reevaluates the impact of interference during training of Once-For-All networks, finding that architecture bias is more critical than interference mitigation, and introduces a simple sampling method that improves efficiency.
Contribution
It challenges the belief that interference mitigation is crucial in OFA training and proposes Random Subnet Sampling, which simplifies training and enhances performance.
Findings
Interference mitigation strategies have limited impact on subnet performance.
Random Subnet Sampling outperforms Progressive Shrinking in several datasets.
RSS significantly reduces training time with minimal performance loss.
Abstract
The Once-For-All (OFA) method offers an excellent pathway to deploy a trained neural network model into multiple target platforms by utilising the supernet-subnet architecture. Once trained, a subnet can be derived from the supernet (both architecture and trained weights) and deployed directly to the target platform with little to no retraining or fine-tuning. To train the subnet population, OFA uses a novel training method called Progressive Shrinking (PS) which is designed to limit the negative impact of interference during training. It is believed that higher interference during training results in lower subnet population accuracies. In this work we take a second look at this interference effect. Surprisingly, we find that interference mitigation strategies do not have a large impact on the overall subnet population performance. Instead, we find the subnet architecture selection bias…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Age of Information Optimization · IoT and Edge/Fog Computing
MethodsOFA
