FlexCache: Flexible Approximate Cache System for Video Diffusion
Desen Sun, Henry Tian, Tim Lu, Sihang Liu

TL;DR
FlexCache is a novel cache system for video diffusion that compresses caches and decouples objects from backgrounds, significantly improving throughput and reducing costs in text-to-video generation.
Contribution
We introduce FlexCache, a flexible approximate cache system with cache compression and object-background decoupling, tailored for efficient video diffusion.
Findings
Achieves 1.26x higher throughput
Reduces cache storage by 6.7x on average
Cuts costs by 25% compared to existing systems
Abstract
Text-to-Video applications receive increasing attention from the public. Among these, diffusion models have emerged as the most prominent approach, offering impressive quality in visual content generation. However, it still suffers from substantial computational complexity, often requiring several minutes to generate a single video. While prior research has addressed the computational overhead in text-to-image diffusion models, the techniques developed are not directly suitable for video diffusion models due to the significantly larger cache requirements and enhanced computational demands associated with video generation. We present FlexCache, a flexible approximate cache system that addresses the challenges in two main designs. First, we compress the caches before saving them to storage. Our compression strategy can reduce 6.7 times consumption on average. Then we find that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Multimedia Communication and Technology · Image and Video Quality Assessment
MethodsSoftmax · Attention Is All You Need · Diffusion
