BlockDrop: Dynamic Inference Paths in Residual Networks

Zuxuan Wu; Tushar Nagarajan; Abhishek Kumar; Steven Rennie; Larry S.; Davis; Kristen Grauman; Rogerio Feris

arXiv:1711.08393·cs.CV·January 29, 2019·40 cites

BlockDrop: Dynamic Inference Paths in Residual Networks

Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S., Davis, Kristen Grauman, Rogerio Feris

PDF

Open Access 1 Repo

TL;DR

BlockDrop is a method that dynamically selects which residual blocks to evaluate in a deep neural network during inference, significantly reducing computation while maintaining high accuracy, by learning policies through reinforcement learning.

Contribution

It introduces a reinforcement learning-based framework to adaptively drop residual blocks in ResNets during inference, achieving faster predictions without accuracy loss.

Findings

01

Achieves up to 36% speedup on ImageNet with no accuracy degradation.

02

Learned policies encode meaningful visual information.

03

Method is effective on CIFAR and ImageNet datasets.

Abstract

Very deep convolutional neural networks offer excellent recognition results, yet their computational expense limits their impact for many real-world applications. We introduce BlockDrop, an approach that learns to dynamically choose which layers of a deep network to execute during inference so as to best reduce total computation without degrading prediction accuracy. Exploiting the robustness of Residual Networks (ResNets) to layer dropping, our framework selects on-the-fly which residual blocks to evaluate for a given novel image. In particular, given a pretrained ResNet, we train a policy network in an associative reinforcement learning setting for the dual reward of utilizing a minimal number of blocks while preserving recognition accuracy. We conduct extensive experiments on CIFAR and ImageNet. The results provide strong quantitative and qualitative evidence that these learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Tushar-N/blockdrop
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsAverage Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection · Convolution