A Progressive Sub-Network Searching Framework for Dynamic Inference
Li Yang, Zhezhi He, Yu Cao, Deliang Fan

TL;DR
This paper introduces a progressive sub-network searching framework that enhances the dynamic inference capability of deep neural networks, allowing better accuracy-complexity trade-offs post-deployment through efficient sub-net sampling and selection techniques.
Contribution
It proposes a novel progressive sub-net searching framework with techniques like trainable noise ranking and sub-net re-selection to improve dynamic inference performance.
Findings
Outperforms prior methods on CIFAR-10 and ImageNet datasets.
Achieves 4.4% better accuracy in dynamic inference over Universally-Slimmable-Network.
Demonstrates effectiveness across different network structures.
Abstract
Many techniques have been developed, such as model compression, to make Deep Neural Networks (DNNs) inference more efficiently. Nevertheless, DNNs still lack excellent run-time dynamic inference capability to enable users trade-off accuracy and computation complexity (i.e., latency on target hardware) after model deployment, based on dynamic requirements and environments. Such research direction recently draws great attention, where one realization is to train the target DNN through a multiple-term objective function, which consists of cross-entropy terms from multiple sub-nets. Our investigation in this work show that the performance of dynamic inference highly relies on the quality of sub-net sampling. With objective to construct a dynamic DNN and search multiple high quality sub-nets with minimal searching cost, we propose a progressive sub-net searching framework, which is embedded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
