Any-Width Networks
Thanh Vu, Marc Eder, True Price, Jan-Michael Frahm

TL;DR
This paper introduces Any-Width Networks (AWNs), a flexible CNN architecture that enables dynamic adjustment of model width during inference, balancing speed and accuracy for resource-constrained applications.
Contribution
The paper proposes a novel adjustable-width CNN architecture with a training routine that allows fine-grained control over inference speed and accuracy.
Findings
AWNs outperform existing methods in flexibility and efficiency.
AWNs enable real-time adjustment of model width during inference.
The use of lower-triangular weight matrices facilitates multi-width operations.
Abstract
Despite remarkable improvements in speed and accuracy, convolutional neural networks (CNNs) still typically operate as monolithic entities at inference time. This poses a challenge for resource-constrained practical applications, where both computational budgets and performance needs can vary with the situation. To address these constraints, we propose the Any-Width Network (AWN), an adjustable-width CNN architecture and associated training routine that allow for fine-grained control over speed and accuracy during inference. Our key innovation is the use of lower-triangular weight matrices which explicitly address width-varying batch statistics while being naturally suited for multi-width operations. We also show that this design facilitates an efficient training routine based on random width sampling. We empirically demonstrate that our proposed AWNs compare favorably to existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCooperative Communication and Network Coding · Interconnection Networks and Systems · Advanced MIMO Systems Optimization
