Sortblock: Similarity-Aware Feature Reuse for Diffusion Model

Hanqi Chen; Xu Zhang; Xiaoliu Guan; Lielin Jiang; Guanzhong Wang; Zeyu Chen; Yi Liu

arXiv:2508.00412·cs.CV·January 8, 2026

Sortblock: Similarity-Aware Feature Reuse for Diffusion Model

Hanqi Chen, Xu Zhang, Xiaoliu Guan, Lielin Jiang, Guanzhong Wang, Zeyu Chen, Yi Liu

PDF

Open Access 1 Video

TL;DR

Sortblock is a training-free method that accelerates diffusion models by adaptively caching and skipping redundant features based on their similarity, doubling inference speed with minimal quality loss.

Contribution

It introduces a novel similarity-aware caching framework that dynamically determines which features to reuse or recompute during diffusion model inference.

Findings

01

Over 2x inference speedup achieved.

02

Minimal degradation in generated output quality.

03

Applicable across various diffusion transformer architectures.

Abstract

Diffusion Transformers (DiTs) have demonstrated remarkable generative capabilities, particularly benefiting from Transformer architectures that enhance visual and artistic fidelity. However, their inherently sequential denoising process results in high inference latency, limiting their deployment in real-time scenarios. Existing training-free acceleration approaches typically reuse intermediate features at fixed timesteps or layers, overlooking the evolving semantic focus across denoising stages and Transformer blocks.To address this, we propose Sortblock, a training-free inference acceleration framework that dynamically caches block-wise features based on their similarity across adjacent timesteps. By ranking the evolution of residuals, Sortblock adaptively determines a recomputation ratio, selectively skipping redundant computations while preserving generation quality. Furthermore, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sortblock: Similarity-Aware Feature Reuse for Diffusion Model· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music Technology and Sound Studies · Advanced Memory and Neural Computing