Channel Gating Neural Networks
Weizhe Hua, Yuan Zhou, Christopher De Sa, Zhiru Zhang, and G. Edward, Suh

TL;DR
This paper proposes channel gating, a dynamic pruning method for CNNs that reduces computation by skipping less important channels at run-time, achieving significant FLOP reduction with minimal accuracy loss.
Contribution
It introduces a novel input-dependent channel gating scheme that enhances CNN efficiency and demonstrates hardware-friendly implementation and acceleration.
Findings
Achieves 2.7-8.0× FLOP reduction on CIFAR-10.
Reduces ResNet-18 compute cost by 2.6× on ImageNet.
Provides hardware accelerator design for channel gating networks.
Abstract
This paper introduces channel gating, a dynamic, fine-grained, and hardware-efficient pruning scheme to reduce the computation cost for convolutional neural networks (CNNs). Channel gating identifies regions in the features that contribute less to the classification result, and skips the computation on a subset of the input channels for these ineffective regions. Unlike static network pruning, channel gating optimizes CNN inference at run-time by exploiting input-specific characteristics, which allows substantially reducing the compute cost with almost no accuracy loss. We experimentally show that applying channel gating in state-of-the-art networks achieves 2.7-8.0 reduction in floating-point operations (FLOPs) and 2.0-4.4 reduction in off-chip memory accesses with a minimal accuracy loss on CIFAR-10. Combining our method with knowledge distillation reduces the compute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
MethodsPruning · Knowledge Distillation
