LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Lianwei Yang; Haokun Lin; Tianchen Zhao; Yichen Wu; Hongyu Zhu; Ruiqi Xie; Zhenan Sun; Yu Wang; Qingyi Gu

arXiv:2508.03485·cs.CV·September 24, 2025

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Lianwei Yang, Haokun Lin, Tianchen Zhao, Yichen Wu, Hongyu Zhu, Ruiqi Xie, Zhenan Sun, Yu Wang, Qingyi Gu

PDF

Open Access

TL;DR

LRQ-DiT introduces a novel post-training quantization framework for diffusion transformers, significantly reducing model size and computation while maintaining high image and video generation quality.

Contribution

It proposes Twin-Log Quantization and Adaptive Rotation Scheme to effectively address distribution and outlier challenges in low-bit PTQ for DiTs.

Findings

01

Achieves high-quality image and video generation after quantization

02

Reduces model size and inference cost substantially

03

Maintains performance comparable to full-precision models

Abstract

Diffusion Transformers (DiTs) have achieved impressive performance in text-to-image and text-to-video generation. However, their high computational cost and large parameter sizes pose significant challenges for usage in resource-constrained scenarios. Effective compression of models has become a crucial issue that urgently needs to be addressed. Post-training quantization (PTQ) is a promising solution to reduce memory usage and accelerate inference, but existing PTQ methods suffer from severe performance degradation under extreme low-bit settings. After experiments and analysis, we identify two key obstacles to low-bit PTQ for DiTs: (1) the weights of DiT models follow a Gaussian-like distribution with long tails, causing uniform quantization to poorly allocate intervals and leading to significant quantization errors. This issue has been observed in the linear layer weights of different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing