# Parameterized Structured Pruning for Deep Neural Networks

**Authors:** Guenther Schindler, Wolfgang Roth, Franz Pernkopf, Holger Froening

arXiv: 1906.05180 · 2019-06-13

## TL;DR

This paper introduces Parameterized Structured Pruning (PSP), a method that learns structured sparsity in DNNs to improve hardware efficiency without sacrificing accuracy.

## Contribution

PSP is a novel approach that dynamically learns structured sparsity in DNNs, enabling efficient mapping to parallel hardware while maintaining performance.

## Key findings

- PSP achieves high sparsity with minimal accuracy loss.
- Structured sparsity improves hardware efficiency.
- Validated on CIFAR and ImageNet datasets.

## Abstract

As a result of the growing size of Deep Neural Networks (DNNs), the gap to hardware capabilities in terms of memory and compute increases. To effectively compress DNNs, quantization and connection pruning are usually considered. However, unconstrained pruning usually leads to unstructured parallelism, which maps poorly to massively parallel processors, and substantially reduces the efficiency of general-purpose processors. Similar applies to quantization, which often requires dedicated hardware. We propose Parameterized Structured Pruning (PSP), a novel method to dynamically learn the shape of DNNs through structured sparsity. PSP parameterizes structures (e.g. channel- or layer-wise) in a weight tensor and leverages weight decay to learn a clear distinction between important and unimportant structures. As a result, PSP maintains prediction performance, creates a substantial amount of sparsity that is structured and, thus, easy and efficient to map to a variety of massively parallel processors, which are mandatory for utmost compute power and energy efficiency. PSP is experimentally validated on the popular CIFAR10/100 and ILSVRC2012 datasets using ResNet and DenseNet architectures, respectively.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.05180/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1906.05180/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1906.05180/full.md

---
Source: https://tomesphere.com/paper/1906.05180