TL;DR
This paper introduces Four Over Six (4/6), an adaptive block scaling method for NVFP4 quantization that reduces error and improves performance in large language models.
Contribution
The paper proposes a novel adaptive scaling technique for NVFP4 quantization, enhancing accuracy and efficiency in large language model training and inference.
Findings
4/6 reduces quantization error compared to existing NVFP4 methods.
Training loss with 4/6 is closer to BF16 than previous NVFP4 recipes.
Efficient implementation of 4/6 yields performance gains on modern hardware.
Abstract
As large language models have grown larger, interest has grown in low-precision numerical formats such as NVFP4 as a way to improve speed and reduce memory usage. However, quantizing models to NVFP4 remains challenging as the lack of precision generally degrades model performance. In this work, we address this issue with Four Over Six (4/6), a modification to the block-scaled NVFP4 quantization algorithm that yields reduced quantization error. Unlike integer formats, floating point formats have non-uniform step sizes which create larger quantization error on larger values. 4/6 takes advantage of this by adaptively scaling some blocks to smaller FP4 values, making the distribution of representable values more uniform and reducing quantization error for near-maximal values. We show that 4/6 can be implemented efficiently on modern hardware accelerators, resulting in performance gains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗DataSnake/Muse-12B-NVFP4-4over6model· 18 dl· ♡ 118 dl♡ 1
- 🤗DataSnake/Wayfarer-2-12B-NVFP4-4over6model· 11 dl· ♡ 211 dl♡ 2
- 🤗DataSnake/Wayfarer-12B-NVFP4-4over6model· 4 dl· ♡ 14 dl♡ 1
- 🤗DataSnake/Mistral-Nemo-Instruct-2407-NVFP4-FP8model· 48 dl48 dl
- 🤗DataSnake/Mistral-Nemo-Instruct-2407-NVFP4-4over6model· 12 dl12 dl
- 🤗DataSnake/Muse-12B-NVFP4-FP8model· 413 dl413 dl
- 🤗DataSnake/Wayfarer-12B-NVFP4-FP8model· 282 dl282 dl
- 🤗DataSnake/Wayfarer-2-12B-NVFP4-FP8model· 300 dl300 dl
- 🤗DataSnake/Mistral-Nemo-Instruct-2407-Down-4over6model· 7 dl7 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
