Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant   Computation Elimination in Diffusion Model

Omid Saghatchian; Atiyeh Gh. Moghadam; Ahmad Nickabadi

arXiv:2501.00946·cs.CV·January 3, 2025

Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model

Omid Saghatchian, Atiyeh Gh. Moghadam, Ahmad Nickabadi

PDF

Open Access 1 Repo

TL;DR

This paper introduces cached adaptive token merging (CA-ToMe), a method that reduces computational costs in diffusion models by adaptively merging similar tokens with caching, achieving faster inference without sacrificing quality.

Contribution

The paper proposes a novel adaptive token merging technique with caching for diffusion models, improving speed while maintaining image quality, and is training-free.

Findings

01

Achieves 1.24x speedup in denoising process

02

Maintains the same FID scores as existing methods

03

Operates without additional training

Abstract

Diffusion models have emerged as a promising approach for generating high-quality, high-dimensional images. Nevertheless, these models are hindered by their high computational cost and slow inference, partly due to the quadratic computational complexity of the self-attention mechanisms with respect to input size. Various approaches have been proposed to address this drawback. One such approach focuses on reducing the number of tokens fed into the self-attention, known as token merging (ToMe). In our method, which is called cached adaptive token merging(CA-ToMe), we calculate the similarity between tokens and then merge the r proportion of the most similar tokens. However, due to the repetitive patterns observed in adjacent steps and the variation in the frequency of similarities, we aim to enhance this approach by implementing an adaptive threshold for merging tokens and adding a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

omidiu/ca_tome
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Distributed systems and fault tolerance