Dynamic Routing Networks
Shaofeng Cai, Yao Shu, Wei Wang, Beng Chin Ooi

TL;DR
Dynamic Routing Networks (DRNets) enable instance-aware inference by dynamically routing inputs through selected branches, significantly reducing computational costs while maintaining high prediction accuracy.
Contribution
The paper introduces DRNets, a novel architecture that dynamically routes inputs through selected branches for efficient inference, addressing the inefficiency of static models.
Findings
DRNets reduce parameters and FLOPs during inference.
DRNets achieve comparable accuracy to state-of-the-art models.
Dynamic routing improves inference efficiency significantly.
Abstract
The deployment of deep neural networks in real-world applications is mostly restricted by their high inference costs. Extensive efforts have been made to improve the accuracy with expert-designed or algorithm-searched architectures. However, the incremental improvement is typically achieved with increasingly more expensive models that only a small portion of input instances really need. Inference with a static architecture that processes all input instances via the same transformation would thus incur unnecessary computational costs. Therefore, customizing the model capacity in an instance-aware manner is much needed for higher inference efficiency. In this paper, we propose Dynamic Routing Networks (DRNets), which support efficient instance-aware inference by routing the input instance to only necessary transformation branches selected from a candidate set of branches for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
MethodsSigmoid Activation · Tanh Activation · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Softmax · Batch Normalization · Long Short-Term Memory · Inverted Residual Block · Average Pooling
