Training with Mixed-Precision Floating-Point Assignments

Wonyeol Lee; Rahul Sharma; Alex Aiken

arXiv:2301.13464·cs.LG·June 26, 2023·1 cites

Training with Mixed-Precision Floating-Point Assignments

Wonyeol Lee, Rahul Sharma, Alex Aiken

PDF

Open Access

TL;DR

This paper introduces a technique for assigning mixed-precision levels to tensors during neural network training, reducing memory usage while maintaining or improving accuracy compared to prior low-precision methods.

Contribution

It presents a novel method for generating precision assignments that optimize memory-accuracy tradeoffs in CNN training, outperforming existing approaches.

Findings

01

Typically achieves > 2x memory reduction

02

Maintains training accuracy with reduced precision

03

Avoids divergence issues seen in other baselines

Abstract

When training deep neural networks, keeping all tensors in high precision (e.g., 32-bit or even 16-bit floats) is often wasteful. However, keeping all tensors in low precision (e.g., 8-bit floats) can lead to unacceptable accuracy loss. Hence, it is important to use a precision assignment -- a mapping from all tensors (arising in training) to precision levels (high or low) -- that keeps most of the tensors in low precision and leads to sufficiently accurate models. We provide a technique that explores this memory-accuracy tradeoff by generating precision assignments for convolutional neural networks that (i) use less memory and (ii) lead to more accurate convolutional networks at the same time, compared to the precision assignments considered by prior work in low-precision floating-point training. We evaluate our technique on image classification tasks by training convolutional networks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Computational Physics and Python Applications · Tensor decomposition and applications