H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models
Mingyu Sung, Il-Min Kim, Sangseok Yun, Jae-Mo Kang

TL;DR
H2-Cache introduces a hierarchical dual-stage caching mechanism for diffusion models, significantly accelerating inference while preserving image quality, by separating denoising into structure and detail stages with a novel similarity estimation technique.
Contribution
The paper presents H2-Cache, a novel hierarchical caching method with dual thresholds and pooled feature summarization, improving speed and quality in diffusion model inference.
Findings
Achieves up to 5.08x acceleration on Flux architecture.
Maintains image quality nearly identical to baseline.
Outperforms existing caching methods in speed and fidelity.
Abstract
Diffusion models have emerged as state-of-the-art in image generation, but their practical deployment is hindered by the significant computational cost of their iterative denoising process. While existing caching techniques can accelerate inference, they often create a challenging trade-off between speed and fidelity, suffering from quality degradation and high computational overhead. To address these limitations, we introduce H2-Cache, a novel hierarchical caching mechanism designed for modern generative diffusion model architectures. Our method is founded on the key insight that the denoising process can be functionally separated into a structure-defining stage and a detail-refining stage. H2-cache leverages this by employing a dual-threshold system, using independent thresholds to selectively cache each stage. To ensure the efficiency of our dual-check approach, we introduce pooled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Cell Image Analysis Techniques
