HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Shengkai Zhang, Nianhong Jiao, Tian Li, Chaojie Yang, Chenhui Xue,, Boya Niu, Jun Gao

TL;DR
This paper introduces HelloMeme, a method that enhances diffusion models with spatial knitting attentions, enabling complex downstream tasks like meme video generation while maintaining the models' generalization capabilities.
Contribution
It presents a novel adapter insertion technique that optimizes attention mechanisms in diffusion models, improving performance on high-level, fidelity-rich tasks.
Findings
Effective meme video generation demonstrated
Significant performance improvements observed
Good compatibility with SD1.5 models
Abstract
We propose an effective method for inserting adapters into text-to-image foundation models, which enables the execution of complex downstream tasks while preserving the generalization ability of the base model. The core idea of this method is to optimize the attention mechanism related to 2D feature maps, which enhances the performance of the adapter. This approach was validated on the task of meme video generation and achieved significant results. We hope this work can provide insights for post-training tasks of large text-to-image models. Additionally, as this method demonstrates good compatibility with SD1.5 derivative models, it holds certain value for the open-source community. Therefore, we will release the related code (\url{https://songkey.github.io/hellomeme}).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗songkey/hm_animatediffmodel· 6 dl6 dl
- 🤗songkey/hm_controlmodel· 12 dl12 dl
- 🤗songkey/hm_referencemodel· 11 dl11 dl
- 🤗songkey/hello_group_facemodelmodel
- 🤗songkey/hm_animatediff_frame12model· 11 dl· ♡ 211 dl♡ 2
- 🤗songkey/pd_fgc_motionmodel· 34 dl34 dl
- 🤗songkey/hm_control2model· 17 dl17 dl
- 🤗songkey/hm2_animatediff_frame12model· 12 dl12 dl
- 🤗songkey/hm2_referencemodel· 13 dl13 dl
- 🤗songkey/hm2_controlmodel· 7 dl7 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTextile materials and evaluations
MethodsSoftmax · Attention Is All You Need · Balanced Selection · Latent Diffusion Model
