Adaptive Neural Networks for Efficient Inference

Tolga Bolukbasi; Joseph Wang; Ofer Dekel; Venkatesh Saligrama

arXiv:1702.07811·cs.LG·September 20, 2017·134 cites

Adaptive Neural Networks for Efficient Inference

Tolga Bolukbasi, Joseph Wang, Ofer Dekel, Venkatesh Saligrama

PDF

Open Access 2 Repos

TL;DR

This paper introduces adaptive neural network evaluation methods that selectively utilize network components or different networks for each example, significantly reducing inference time with minimal accuracy loss.

Contribution

It proposes novel adaptive evaluation schemes that dynamically select network components or networks per example, improving efficiency without sacrificing accuracy.

Findings

01

Achieved up to 2.8x speedup on ImageNet networks

02

Reduced computational cost with less than 1% accuracy loss

03

Demonstrated effectiveness of adaptive early exit and network selection

Abstract

We present an approach to adaptively utilize deep neural networks in order to reduce the evaluation time on new examples without loss of accuracy. Rather than attempting to redesign or approximate existing networks, we propose two schemes that adaptively utilize networks. We first pose an adaptive network evaluation scheme, where we learn a system to adaptively choose the components of a deep network to be evaluated for each example. By allowing examples correctly classified using early layers of the system to exit, we avoid the computational time associated with full evaluation of the network. We extend this to learn a network selection system that adaptively selects the network to be evaluated for each example. We show that computational time can be dramatically reduced by exploiting the fact that many examples can be correctly classified using relatively efficient networks and that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)