Efficient Spatially Sparse Inference for Conditional GANs and Diffusion   Models

Muyang Li; Ji Lin; Chenlin Meng; Stefano Ermon; Song Han; and Jun-Yan; Zhu

arXiv:2211.02048·cs.CV·September 15, 2023·5 cites

Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Muyang Li, Ji Lin, Chenlin Meng, Stefano Ermon, Song Han, and Jun-Yan, Zhu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Spatially Sparse Inference (SSI), a technique that accelerates image editing in generative models by reusing features and selectively computing only edited regions, significantly reducing inference time.

Contribution

The paper proposes SSI and SIGE, enabling efficient, incremental inference for conditional GANs and diffusion models, with extensions for attention layers and support for new hardware.

Findings

01

SIGE accelerates DDPM by up to 4.6x on GPU

02

SIGE reduces Stable Diffusion inference time by up to 7.2x

03

Supports large, sequential edits with significant speedups

Abstract

During image editing, existing deep generative models tend to re-synthesize the entire output from scratch, including the unedited regions. This leads to a significant waste of computation, especially for minor editing operations. In this work, we present Spatially Sparse Inference (SSI), a general-purpose technique that selectively performs computation for edited regions and accelerates various generative models, including both conditional GANs and diffusion models. Our key observation is that users prone to gradually edit the input image. This motivates us to cache and reuse the feature maps of the original image. Given an edited image, we sparsely apply the convolutional filters to the edited regions while reusing the cached features for the unedited areas. Based on our algorithm, we further propose Sparse Incremental Generative Engine (SIGE) to convert the computation reduction to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lmxyy/sige
pytorchOfficial

Videos

Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications

MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings