Dense Pruning of Pointwise Convolutions in the Frequency Domain
Mark Buckler, Neil Adit, Yuwei Hu, Zhiru Zhang, and Adrian Sampson

TL;DR
This paper introduces a novel frequency domain pruning method for pointwise convolutions in CNNs, transforming activations with DCT and learning channel-specific thresholds to reduce computation without significant accuracy loss.
Contribution
It unifies frequency domain techniques with pointwise convolutions by transforming activations, enabling dense, efficient pruning based on learned frequency thresholds.
Findings
Reduced MobileNetV2 computation time by 22%
Achieved less than 1% accuracy degradation
Introduced a novel learned threshold for frequency pruning
Abstract
Depthwise separable convolutions and frequency-domain convolutions are two recent ideas for building efficient convolutional neural networks. They are seemingly incompatible: the vast majority of operations in depthwise separable CNNs are in pointwise convolutional layers, but pointwise layers use 1x1 kernels, which do not benefit from frequency transformation. This paper unifies these two ideas by transforming the activations, not the kernels. Our key insights are that 1) pointwise convolutions commute with frequency transformation and thus can be computed in the frequency domain without modification, 2) each channel within a given layer has a different level of sensitivity to frequency domain pruning, and 3) each channel's sensitivity to frequency pruning is approximately monotonic with respect to frequency. We leverage this knowledge by proposing a new technique which wraps each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Vision and Imaging · Model Reduction and Neural Networks
MethodsPruning · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · 1x1 Convolution · Convolution · Discrete Cosine Transform · Inverted Residual Block · Average Pooling
