1.58-bit FLUX

Chenglin Yang; Celong Liu; Xueqing Deng; Dongwon Kim; Xing Mei,; Xiaohui Shen; Liang-Chieh Chen

arXiv:2412.18653·cs.CV·December 30, 2024

1.58-bit FLUX

Chenglin Yang, Celong Liu, Xueqing Deng, Dongwon Kim, Xing Mei,, Xiaohui Shen, Liang-Chieh Chen

PDF

Open Access 1 Models

TL;DR

This paper introduces 1.58-bit FLUX, a quantization method for text-to-image models that reduces storage and computation without sacrificing image quality, using self-supervision and custom hardware optimization.

Contribution

The paper presents the first quantization of FLUX model to 1.58-bit weights without data access, along with a custom kernel for efficient inference.

Findings

01

7.7x reduction in model storage

02

5.1x reduction in inference memory

03

Maintains image quality on benchmarks

Abstract

We present 1.58-bit FLUX, the first successful approach to quantizing the state-of-the-art text-to-image generation model, FLUX.1-dev, using 1.58-bit weights (i.e., values in {-1, 0, +1}) while maintaining comparable performance for generating 1024 x 1024 images. Notably, our quantization method operates without access to image data, relying solely on self-supervision from the FLUX.1-dev model. Additionally, we develop a custom kernel optimized for 1.58-bit operations, achieving a 7.7x reduction in model storage, a 5.1x reduction in inference memory, and improved inference latency. Extensive evaluations on the GenEval and T2I Compbench benchmarks demonstrate the effectiveness of 1.58-bit FLUX in maintaining generation quality while significantly enhancing computational efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
ugonfor/1.58bit-flux-reproduction
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptical Network Technologies · Photonic and Optical Devices · Neural Networks and Reservoir Computing

MethodsSeven Ways to Contact How can i speak to someone at Tangem Wallet Customer Support : A Step by Step Guide