Grow-Push-Prune: aligning deep discriminants for effective structural   network compression

Qing Tian; Tal Arbel; James J. Clark

arXiv:2009.13716·cs.CV·October 5, 2021

Grow-Push-Prune: aligning deep discriminants for effective structural network compression

Qing Tian, Tal Arbel, James J. Clark

PDF

Open Access

TL;DR

This paper introduces Grow-Push-Prune, an iterative method for creating task-specific compact deep neural networks by aligning discriminants and pruning unnecessary neurons, demonstrated to outperform existing models on multiple datasets.

Contribution

It proposes a novel Grow-Push-Prune approach that combines discriminant alignment and neuron pruning, along with a network growing strategy, for effective model compression.

Findings

01

Achieves higher accuracy than comparable compact models on ImageNet.

02

Effectively reduces model complexity while maintaining or improving performance.

03

Demonstrates superiority over residual and other compact networks at similar sizes.

Abstract

Most of today's popular deep architectures are hand-engineered to be generalists. However, this design procedure usually leads to massive redundant, useless, or even harmful features for specific tasks. Unnecessarily high complexities render deep nets impractical for many real-world applications, especially those without powerful GPU support. In this paper, we attempt to derive task-dependent compact models from a deep discriminant analysis perspective. We propose an iterative and proactive approach for classification tasks which alternates between (1) a pushing step, with an objective to simultaneously maximize class separation, penalize co-variances, and push deep discriminants into alignment with a compact set of neurons, and (2) a pruning step, which discards less useful or even interfering neurons. Deconvolution is adopted to reverse 'unimportant' filters' effects and recover…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI

MethodsPruning · 1x1 Convolution · Max Pooling · Convolution · Inception Module