LatentBox: Storing AI-Generated Images at Scale via a Latent-First Design
Zirui Wang, Yunjia Zheng, Tingfeng Lan, Zhaoyuan Su, Haoran Ni, Juncheng Yang, Yue Cheng

TL;DR
LatentBox is a storage system for AI-generated images that significantly reduces storage needs by storing compressed latent representations and reconstructing images on demand, based on extensive access pattern analysis.
Contribution
It introduces a latent-first storage architecture for AI images, leveraging large-scale access analysis to optimize storage and retrieval, reducing storage by 78.7%.
Findings
Reduces persistent storage by 78.7%.
Maintains competitive latency with image-based storage.
Uses on-demand GPU reconstruction for efficiency.
Abstract
The explosive growth of AI-generated images has created a sustainability challenge for storage infrastructure. Platforms like Midjourney and Adobe Firefly already host billions of generative images, yet conventional object stores persist them as blobs with full-resolution pixels, consuming huge amounts of storage capacity and bandwidth. Unlike natural photos, however, AI-generated images can be deterministically reconstructed from compact, model-native latent tensors, making persistent image storage fundamentally redundant. This paper presents LatentBox, a latent-first storage system for AI-generated images. LatentBox treats compressed latents as durable storage objects and uses on-demand GPU reconstruction on the read path to trade inexpensive compute for large persistent storage savings. Our design is guided by the first large-scale analysis of AI-generated image access we are aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
