LeanConvNets: Low-cost Yet Effective Convolutional Neural Networks
Jonathan Ephrath, Moshe Eliasof, Lars Ruthotto, Eldad Haber, Eran, Treister

TL;DR
LeanConvNets are a new class of CNNs that sparsify convolution operators to significantly reduce computational costs while maintaining accuracy, outperforming some existing lightweight architectures on benchmark tasks.
Contribution
Introduction of sparsified convolution operators in CNNs, enabling more efficient networks with minimal accuracy loss, applicable to residual networks and other architectures.
Findings
Achieve near state-of-the-art accuracy with reduced computational cost.
Outperform MobileNets and ShuffleNets in most benchmark tests.
Demonstrate effectiveness across image classification and semantic segmentation tasks.
Abstract
Convolutional Neural Networks (CNNs) have become indispensable for solving machine learning tasks in speech recognition, computer vision, and other areas that involve high-dimensional data. A CNN filters the input feature using a network containing spatial convolution operators with compactly supported stencils. In practice, the input data and the hidden features consist of a large number of channels, which in most CNNs are fully coupled by the convolution operators. This coupling leads to immense computational cost in the training and prediction phase. In this paper, we introduce LeanConvNets that are derived by sparsifying fully-coupled operators in existing CNNs. Our goal is to improve the efficiency of CNNs by reducing the number of weights, floating point operations and latency times, with minimal loss of accuracy. Our lean convolution operators involve tuning parameters that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
