DIVISION: Memory Efficient Training via Dual Activation Precision

Guanchu Wang; Zirui Liu; Zhimeng Jiang; Ninghao Liu; Na; Zou; Xia Hu

arXiv:2208.04187·cs.LG·May 23, 2023·1 cites

DIVISION: Memory Efficient Training via Dual Activation Precision

Guanchu Wang, Zirui Liu, Zhimeng Jiang, Ninghao Liu, Na, Zou, Xia Hu

PDF

Open Access 1 Repo 1 Video

TL;DR

DIVISION introduces a memory-efficient DNN training method that compresses high-frequency activation components, reducing memory use over 10x while maintaining accuracy and throughput.

Contribution

The paper proposes a novel activation compression technique that separates low- and high-frequency components, simplifying and improving memory efficiency during training.

Findings

01

Achieves over 10x activation map compression.

02

Maintains competitive model accuracy.

03

Offers better performance than existing methods.

Abstract

Activation compressed training provides a solution towards reducing the memory cost of training deep neural networks~(DNNs). However, state-of-the-art work combines a search of quantization bit-width with the training, which makes the procedure complicated and less transparent. To this end, we propose a simple and effective method to compress DNN training. Our method is motivated by an instructive observation: DNN backward propagation mainly utilizes the low-frequency component (LFC) of the activation maps, while the majority of memory is for caching the high-frequency component (HFC) during the training. This indicates the HFC of activation maps is highly redundant and compressible during DNN training, which inspires our proposed Dual Activation Precision (DIVISION). During the training, DIVISION preserves the high-precision copy of LFC and compresses the HFC into a light-weight copy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guanchuwang/division
pytorchOfficial

Videos

DIVISION: Memory Efficient Training via Dual Activation Precision· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Speech and Audio Processing