FlexCache: Flexible Approximate Cache System for Video Diffusion

Desen Sun; Henry Tian; Tim Lu; Sihang Liu

arXiv:2501.04012·cs.MM·January 9, 2025

FlexCache: Flexible Approximate Cache System for Video Diffusion

Desen Sun, Henry Tian, Tim Lu, Sihang Liu

PDF

Open Access

TL;DR

FlexCache is a novel cache system for video diffusion that compresses caches and decouples objects from backgrounds, significantly improving throughput and reducing costs in text-to-video generation.

Contribution

We introduce FlexCache, a flexible approximate cache system with cache compression and object-background decoupling, tailored for efficient video diffusion.

Findings

01

Achieves 1.26x higher throughput

02

Reduces cache storage by 6.7x on average

03

Cuts costs by 25% compared to existing systems

Abstract

Text-to-Video applications receive increasing attention from the public. Among these, diffusion models have emerged as the most prominent approach, offering impressive quality in visual content generation. However, it still suffers from substantial computational complexity, often requiring several minutes to generate a single video. While prior research has addressed the computational overhead in text-to-image diffusion models, the techniques developed are not directly suitable for video diffusion models due to the significantly larger cache requirements and enhanced computational demands associated with video generation. We present FlexCache, a flexible approximate cache system that addresses the challenges in two main designs. First, we compress the caches before saving them to storage. Our compression strategy can reduce 6.7 times consumption on average. Then we find that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Coding and Compression Technologies · Multimedia Communication and Technology · Image and Video Quality Assessment

MethodsSoftmax · Attention Is All You Need · Diffusion