# OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural   Networks

**Authors:** Jiashi Li, Qi Qi, Jingyu Wang, Ce Ge, Yujian Li, Zhangzhang Yue, and, Haifeng Sun

arXiv: 1905.11664 · 2019-07-02

## TL;DR

This paper introduces OICSR, a novel channel pruning method that considers correlations between successive layers to create more compact neural networks without sacrificing accuracy.

## Contribution

The paper proposes Out-In-Channel Sparsity Regularization (OICSR), which leverages inter-layer correlations for more effective channel pruning in deep neural networks.

## Key findings

- Reduces 37.2% FLOPs on ResNet-50 with improved accuracy.
- Outperforms baseline models on CIFAR and ImageNet datasets.
- Effectively transfers discriminative features into fewer channels.

## Abstract

Channel pruning can significantly accelerate and compress deep neural networks. Many channel pruning works utilize structured sparsity regularization to zero out all the weights in some channels and automatically obtain structure-sparse network in training stage. However, these methods apply structured sparsity regularization on each layer separately where the correlations between consecutive layers are omitted. In this paper, we first combine one out-channel in current layer and the corresponding in-channel in next layer as a regularization group, namely out-in-channel. Our proposed Out-In-Channel Sparsity Regularization (OICSR) considers correlations between successive layers to further retain predictive power of the compact network. Training with OICSR thoroughly transfers discriminative features into a fraction of out-in-channels. Correspondingly, OICSR measures channel importance based on statistics computed from two consecutive layers, not individual layer. Finally, a global greedy pruning algorithm is designed to remove redundant out-in-channels in an iterative way. Our method is comprehensively evaluated with various CNN architectures including CifarNet, AlexNet, ResNet, DenseNet and PreActSeNet on CIFAR-10, CIFAR-100 and ImageNet-1K datasets. Notably, on ImageNet-1K, we reduce 37.2% FLOPs on ResNet-50 while outperforming the original model by 0.22% top-1 accuracy.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.11664/full.md

## Figures

40 figures with captions in the complete paper: https://tomesphere.com/paper/1905.11664/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1905.11664/full.md

---
Source: https://tomesphere.com/paper/1905.11664