Frequency-Aware Error-Bounded Caching for Accelerating Diffusion Transformers

Guandong Li

arXiv:2603.05315·cs.CV·March 6, 2026

Frequency-Aware Error-Bounded Caching for Accelerating Diffusion Transformers

Guandong Li

PDF

Open Access

TL;DR

This paper introduces SpectralCache, a novel caching framework that leverages frequency-aware, error-bounded strategies to significantly accelerate diffusion transformers during inference without sacrificing output quality.

Contribution

It identifies non-uniformities in DiT denoising and proposes a unified, training-free caching method with frequency decomposition and error management for faster inference.

Findings

01

Achieves 2.46x speedup on FLUX.1-schnell at 512x512 resolution.

02

Maintains comparable image quality with LPIPS difference less than 1%.

03

Outperforms existing caching methods like TeaCache by 16% in speed.

Abstract

Diffusion Transformers (DiTs) have emerged as the dominant architecture for high-quality image and video generation, yet their iterative denoising process incurs substantial computational cost during inference. Existing caching methods accelerate DiTs by reusing intermediate computations across timesteps, but they share a common limitation: treating the denoising process as uniform across time,depth, and feature dimensions. In this work, we identify three orthogonal axes of non-uniformity in DiT denoising: (1) temporal -- sensitivity to caching errors varies dramatically across the denoising trajectory; (2) depth -- consecutive caching decisions lead to cascading approximation errors; and (3) feature -- different components of the hidden state exhibit heterogeneous temporal dynamics. Based on these observations, we propose SpectralCache, a unified caching framework comprising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment