Ditto: Accelerating Diffusion Model via Temporal Value Similarity

Sungbin Kim; Hyunwuk Lee; Wonho Cho; Mincheol Park; and Won Woo Ro

arXiv:2501.11211·cs.AR·January 22, 2025

Ditto: Accelerating Diffusion Model via Temporal Value Similarity

Sungbin Kim, Hyunwuk Lee, Wonho Cho, Mincheol Park, and Won Woo Ro

PDF

Open Access

TL;DR

Ditto introduces a novel algorithm and hardware accelerator that leverage temporal value similarity in diffusion models to significantly reduce computation and energy costs in image generation tasks.

Contribution

The paper proposes the Ditto algorithm and hardware, exploiting temporal similarity and quantization to accelerate diffusion models with reduced energy consumption.

Findings

01

Achieves up to 1.5x speedup over existing accelerators.

02

Saves 17.74% energy compared to other hardware.

03

Effectively reduces computation by leveraging temporal value similarity.

Abstract

Diffusion models achieve superior performance in image generation tasks. However, it incurs significant computation overheads due to its iterative structure. To address these overheads, we analyze this iterative structure and observe that adjacent time steps in diffusion models exhibit high value similarity, leading to narrower differences between consecutive time steps. We adapt these characteristics to a quantized diffusion model and reveal that the majority of these differences can be represented with reduced bit-width, and even zero. Based on our observations, we propose the Ditto algorithm, a difference processing algorithm that leverages temporal similarity with quantization to enhance the efficiency of diffusion models. By exploiting the narrower differences and the distributive property of layer operations, it performs full bit-width operations for the initial time step and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion