SMOF: Squeezing More Out of Filters Yields Hardware-Friendly CNN Pruning
Yanli Liu, Bochen Guan, Qinwen Xu, Weiyi Li, and Shuxue Quan

TL;DR
SMOF introduces a novel CNN pruning framework that reduces both kernel size and filter channels, leading to hardware-friendly, efficient models suitable for edge devices without requiring specialized hardware modifications.
Contribution
It is the first to explore kernel size reduction in CNN pruning, enhancing hardware compatibility and efficiency compared to existing channel-only pruning methods.
Findings
Significant reduction in model size and inference time.
Effective pruning across various CNN architectures.
Compatibility with standard hardware without custom implementations.
Abstract
For many years, the family of convolutional neural networks (CNNs) has been a workhorse in deep learning. Recently, many novel CNN structures have been designed to address increasingly challenging tasks. To make them work efficiently on edge devices, researchers have proposed various structured network pruning strategies to reduce their memory and computational cost. However, most of them only focus on reducing the number of filter channels per layer without considering the redundancy within individual filter channels. In this work, we explore pruning from another dimension, the kernel size. We develop a CNN pruning framework called SMOF, which Squeezes More Out of Filters by reducing both kernel size and the number of filter channels. Notably, SMOF is friendly to standard hardware devices without any customized low-level implementations, and the pruning effort by kernel size reduction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and ELM
MethodsPruning
