Learn&Drop: Fast Learning of CNNs based on Layer Dropping

Giorgio Cruciata; Luca Cruciata; Liliana Lo Presti; Jan Van Gemert; Marco La Cascia

arXiv:2604.23403·cs.CV·April 28, 2026

Learn&Drop: Fast Learning of CNNs based on Layer Dropping

Giorgio Cruciata, Luca Cruciata, Liliana Lo Presti, Jan Van Gemert, Marco La Cascia

PDF

TL;DR

This paper introduces a novel training method for CNNs that reduces forward pass computations during training by dynamically dropping layers based on parameter change scores, significantly speeding up training.

Contribution

The method uniquely focuses on decreasing training forward operations through layer dropping, validated on VGG and ResNet architectures with substantial speed improvements.

Findings

01

Training time more than halved without significant accuracy loss

02

FLOPs reduction in forward pass ranges from 17.83% to 83.74%

03

Effective on MNIST, CIFAR-10, and Imagenette datasets

Abstract

This paper proposes a new method to improve the training efficiency of deep convolutional neural networks. During training, the method evaluates scores to measure how much each layer's parameters change and whether the layer will continue learning or not. Based on these scores, the network is scaled down such that the number of parameters to be learned is reduced, yielding a speed up in training. Unlike state-of-the-art methods that try to compress the network to be used in the inference phase or to limit the number of operations performed in the backpropagation phase, the proposed method is novel in that it focuses on reducing the number of operations performed by the network in the forward propagation during training. The proposed training strategy has been validated on two widely used architecture families: VGG and ResNet. Experiments on MNIST, CIFAR-10 and Imagenette show that, with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.