A Novel Memory-Efficient Deep Learning Training Framework via   Error-Bounded Lossy Compression

Sian Jin; Guanpeng Li; Shuaiwen Leon Song; Dingwen Tao

arXiv:2011.09017·cs.DC·November 24, 2020

A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression

Sian Jin, Guanpeng Li, Shuaiwen Leon Song, Dingwen Tao

PDF

TL;DR

This paper introduces a memory-efficient deep learning training framework using error-bounded lossy compression, enabling larger models and faster training with minimal accuracy loss.

Contribution

It designs a novel error-bounded lossy compression scheme with theoretical error control and adaptive configuration for memory reduction during DNN training.

Findings

01

Reduces training memory by up to 13.5x with minimal accuracy loss.

02

Achieves up to 1.8x speedup over state-of-the-art compression methods.

03

Maintains model accuracy while significantly decreasing memory footprint.

Abstract

Deep neural networks (DNNs) are becoming increasingly deeper, wider, and non-linear due to the growing demands on prediction accuracy and analysis quality. When training a DNN model, the intermediate activation data must be saved in the memory during forward propagation and then restored for backward propagation. However, state-of-the-art accelerators such as GPUs are only equipped with very limited memory capacities due to hardware design constraints, which significantly limits the maximum batch size and hence performance speedup when training large-scale DNNs. In this paper, we propose a novel memory-driven high performance DNN training framework that leverages error-bounded lossy compression to significantly reduce the memory requirement for training in order to allow training larger networks. Different from the state-of-the-art solutions that adopt image-based lossy compressors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.