TL;DR
InPlace-ABN is a memory-efficient layer combining BatchNorm and activation, reducing memory usage by up to 50% during training with minimal computational overhead, enabling scalable deep learning.
Contribution
The paper introduces InPlace-ABN, a novel layer that reduces memory footprint in DNN training by combining BatchNorm and activation with inversion techniques.
Findings
Achieves up to 50% memory savings during training.
Maintains state-of-the-art accuracy on ImageNet-1k.
Sets new state-of-the-art results in semantic segmentation tasks.
Abstract
In this work we present In-Place Activated Batch Normalization (InPlace-ABN) - a novel approach to drastically reduce the training memory footprint of modern deep neural networks in a computationally efficient way. Our solution substitutes the conventionally used succession of BatchNorm + Activation layers with a single plugin layer, hence avoiding invasive framework surgery while providing straightforward applicability for existing deep learning frameworks. We obtain memory savings of up to 50% by dropping intermediate results and by recovering required information during the backward pass through the inversion of stored forward results, with only minor increase (0.8-2%) in computation time. Also, we demonstrate how frequently used checkpointing approaches can be made computationally as efficient as InPlace-ABN. In our experiments on image classification, we demonstrate on-par results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsIn-Place Activated Batch Normalization · Batch Normalization
