Training CNNs faster with Dynamic Input and Kernel Downsampling

Zissis Poulos; Ali Nouri; Andreas Moshovos

arXiv:1910.06548·cs.LG·October 16, 2019

Training CNNs faster with Dynamic Input and Kernel Downsampling

Zissis Poulos, Ali Nouri, Andreas Moshovos

PDF

Open Access

TL;DR

This paper introduces a method to accelerate CNN training by intermittently downsampling inputs and filters, reducing computation and memory usage while maintaining accuracy.

Contribution

The authors propose a novel training approach combining input and kernel downsampling with interleaved passes, significantly reducing training time with minimal accuracy loss.

Findings

01

Achieved up to 23% reduction in training time.

02

Minimal loss in validation accuracy.

03

Effective for residual architectures.

Abstract

We reduce training time in convolutional networks (CNNs) with a method that, for some of the mini-batches: a) scales down the resolution of input images via downsampling, and b) reduces the forward pass operations via pooling on the convolution filters. Training is performed in an interleaved fashion; some batches undergo the regular forward and backpropagation passes with original network parameters, whereas others undergo a forward pass with pooled filters and downsampled inputs. Since pooling is differentiable, the gradients of the pooled filters propagate to the original network parameters for a standard parameter update. The latter phase requires fewer floating point operations and less storage due to the reduced spatial dimensions in feature maps and filters. The key idea is that this phase leads to smaller and approximate updates and thus slower learning, but at significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis

MethodsConvolution