Adaptive Neural Networks for Efficient Inference
Tolga Bolukbasi, Joseph Wang, Ofer Dekel, Venkatesh Saligrama

TL;DR
This paper introduces adaptive neural network evaluation methods that selectively utilize network components or different networks for each example, significantly reducing inference time with minimal accuracy loss.
Contribution
It proposes novel adaptive evaluation schemes that dynamically select network components or networks per example, improving efficiency without sacrificing accuracy.
Findings
Achieved up to 2.8x speedup on ImageNet networks
Reduced computational cost with less than 1% accuracy loss
Demonstrated effectiveness of adaptive early exit and network selection
Abstract
We present an approach to adaptively utilize deep neural networks in order to reduce the evaluation time on new examples without loss of accuracy. Rather than attempting to redesign or approximate existing networks, we propose two schemes that adaptively utilize networks. We first pose an adaptive network evaluation scheme, where we learn a system to adaptively choose the components of a deep network to be evaluated for each example. By allowing examples correctly classified using early layers of the system to exit, we avoid the computational time associated with full evaluation of the network. We extend this to learn a network selection system that adaptively selects the network to be evaluated for each example. We show that computational time can be dramatically reduced by exploiting the fact that many examples can be correctly classified using relatively efficient networks and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
