Batch-Shaping for Learning Conditional Channel Gated Networks
Babak Ehteshami Bejnordi, Tijmen Blankevoort, Max Welling

TL;DR
This paper introduces a novel gating mechanism and batch-shaping technique for neural networks, enabling conditional computation that improves accuracy while reducing average computational cost on image classification and segmentation tasks.
Contribution
It proposes a new residual block architecture with fine-grained channel gating and a batch-shaping method to align feature posteriors with prior distributions, enhancing conditional computation.
Findings
Gated networks achieve higher accuracy with similar or lower computational cost.
Networks learn to allocate more features to difficult examples and fewer to simple ones.
Significant accuracy improvements on ImageNet and CIFAR-10 datasets.
Abstract
We present a method that trains large capacity neural networks with significantly improved accuracy and lower dynamic computational cost. We achieve this by gating the deep-learning architecture on a fine-grained-level. Individual convolutional maps are turned on/off conditionally on features in the network. To achieve this, we introduce a new residual block architecture that gates convolutional channels in a fine-grained manner. We also introduce a generally applicable tool - that matches the marginal aggregate posteriors of features in a neural network to a pre-specified prior distribution. We use this novel technique to force gates to be more conditional on the data. We present results on CIFAR-10 and ImageNet datasets for image classification, and Cityscapes for semantic segmentation. Our results show that our method can slim down large architectures conditionally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Batch Normalization · Residual Block · Residual Connection
