HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level   and Fidelity-Rich Conditions in Diffusion Models

Shengkai Zhang; Nianhong Jiao; Tian Li; Chaojie Yang; Chenhui Xue,; Boya Niu; Jun Gao

arXiv:2410.22901·cs.CV·October 31, 2024

HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models

Shengkai Zhang, Nianhong Jiao, Tian Li, Chaojie Yang, Chenhui Xue,, Boya Niu, Jun Gao

PDF

Open Access 1 Repo 10 Models

TL;DR

This paper introduces HelloMeme, a method that enhances diffusion models with spatial knitting attentions, enabling complex downstream tasks like meme video generation while maintaining the models' generalization capabilities.

Contribution

It presents a novel adapter insertion technique that optimizes attention mechanisms in diffusion models, improving performance on high-level, fidelity-rich tasks.

Findings

01

Effective meme video generation demonstrated

02

Significant performance improvements observed

03

Good compatibility with SD1.5 models

Abstract

We propose an effective method for inserting adapters into text-to-image foundation models, which enables the execution of complex downstream tasks while preserving the generalization ability of the base model. The core idea of this method is to optimize the attention mechanism related to 2D feature maps, which enhances the performance of the adapter. This approach was validated on the task of meme video generation and achieved significant results. We hope this work can provide insights for post-training tasks of large text-to-image models. Additionally, as this method demonstrates good compatibility with SD1.5 derivative models, it holds certain value for the open-source community. Therefore, we will release the related code (\url{https://songkey.github.io/hellomeme}).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HelloVision/HelloMeme
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTextile materials and evaluations

MethodsSoftmax · Attention Is All You Need · Balanced Selection · Latent Diffusion Model