Convolutional Neural Network Simplification with Progressive Retraining

D. Osaku; J.F. Gomes; A.X. Falc\~ao

arXiv:2101.04699·cs.LG·August 19, 2021

Convolutional Neural Network Simplification with Progressive Retraining

D. Osaku, J.F. Gomes, A.X. Falc\~ao

PDF

TL;DR

This paper introduces a novel CNN simplification method called progressive retraining, which selectively retrains layers after kernel elimination based on relevance criteria, leading to more effective model simplification and better performance than existing methods.

Contribution

The paper proposes a new layer-by-layer kernel elimination approach with progressive retraining guided by relevance criteria, improving CNN simplification effectiveness.

Findings

01

Outperforms two popular kernel pruning methods and a state-of-the-art approach.

02

Achieves significant model simplification while maintaining accuracy.

03

Demonstrates effectiveness on four challenging image datasets.

Abstract

Kernel pruning methods have been proposed to speed up, simplify, and improve explanation of convolutional neural network (CNN) models. However, the effectiveness of a simplified model is often below the original one. In this letter, we present new methods based on objective and subjective relevance criteria for kernel elimination in a layer-by-layer fashion. During the process, a CNN model is retrained only when the current layer is entirely simplified, by adjusting the weights from the next layer to the first one and preserving weights of subsequent layers not involved in the process. We call this strategy \emph{progressive retraining}, differently from kernel pruning methods that usually retrain the entire model after each simplification action -- e.g., the elimination of one or a few kernels. Our subjective relevance criterion exploits the ability of humans in recognizing visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning