Collage: Light-Weight Low-Precision Strategy for LLM Training
Tao Yu, Gaurav Gupta, Karthick Gopalswamy, Amith Mamidala, Hao Zhou,, Jeffrey Huynh, Youngsuk Park, Ron Diamant, Anoop Deoras, Luke Huan

TL;DR
This paper introduces Collage, a low-precision training strategy for large language models that maintains accuracy while reducing memory and computation costs, enabling faster training with less hardware resources.
Contribution
Collage employs multi-component float representation to compensate for numerical errors in low-precision training, improving efficiency without sacrificing model performance.
Findings
Achieves similar or better training performance compared to mixed-precision methods.
Reduces memory usage by up to 23%.
Speeds up training by up to 3.7 times.
Abstract
Large models training is plagued by the intense compute cost and limited hardware memory. A practical solution is low-precision representation but is troubled by loss in numerical accuracy and unstable training rendering the model less useful. We argue that low-precision floating points can perform well provided the error is properly compensated at the critical locations in the training process. We propose Collage which utilizes multi-component float representation in low-precision to accurately perform operations with numerical errors accounted. To understand the impact of imprecision to training, we propose a simple and novel metric which tracks the lost information during training as well as differentiates various precision strategies. Our method works with commonly used low-precision such as half-precision (-bit floating points) and can be naturally extended to work with even…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Surface Polishing Techniques · Welding Techniques and Residual Stresses
