LayerDropBack: A Universally Applicable Approach for Accelerating   Training of Deep Networks

Evgeny Hershkovitch Neiterman; Gil Ben-Artzi

arXiv:2412.18027·cs.CV·December 25, 2024

LayerDropBack: A Universally Applicable Approach for Accelerating Training of Deep Networks

Evgeny Hershkovitch Neiterman, Gil Ben-Artzi

PDF

Open Access 1 Repo

TL;DR

LayerDropBack (LDB) is a simple method that accelerates training of deep networks by introducing randomness only in the backward pass, applicable across various architectures without modifying the network structure.

Contribution

LDB is a novel, architecture-agnostic training acceleration technique that maintains model integrity and improves training efficiency across diverse deep learning models.

Findings

01

Achieves 16.93% to 23.97% reduction in training time.

02

Maintains or improves model accuracy across multiple architectures.

03

Applicable to various network topologies without architectural modifications.

Abstract

Training very deep convolutional networks is challenging, requiring significant computational resources and time. Existing acceleration methods often depend on specific architectures or require network modifications. We introduce LayerDropBack (LDB), a simple yet effective method to accelerate training across a wide range of deep networks. LDB introduces randomness only in the backward pass, maintaining the integrity of the forward pass, guaranteeing that the same network is used during both training and inference. LDB can be seamlessly integrated into the training process of any model without altering its architecture, making it suitable for various network topologies. Our extensive experiments across multiple architectures (ViT, Swin Transformer, EfficientNet, DLA) and datasets (CIFAR-100, ImageNet) show significant training time reductions of 16.93\% to 23.97\%, while preserving or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neiterman21/ldb
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification

Methods(FiLe@Against@Claim)How do I file a claim against Expedia? · Attention Is All You Need · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Linear Layer · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Batch Normalization · Stochastic Depth