FloAt: Flow Warping of Self-Attention for Clothing Animation Generation
Swasti Shreya Mishra, Kuldeep Kulkarni, Duygu Ceylan, Balaji Vasan, Srinivasan

TL;DR
FloAtControlNet introduces a flow warping technique for self-attention in diffusion models to generate realistic clothing animations from text prompts and normal maps, improving visual quality and reducing artifacts.
Contribution
The paper presents a novel flow-based self-attention manipulation method for clothing animation generation, leveraging normal maps and a training-free ControlNet backbone.
Findings
Outperforms baselines in visual quality and user studies.
Reduces background flickering in clothing animations.
Achieves better RMSE and PSNR scores compared to existing methods.
Abstract
We propose a diffusion model-based approach, FloAtControlNet to generate cinemagraphs composed of animations of human clothing. We focus on human clothing like dresses, skirts and pants. The input to our model is a text prompt depicting the type of clothing and the texture of clothing like leopard, striped, or plain, and a sequence of normal maps that capture the underlying animation that we desire in the output. The backbone of our method is a normal-map conditioned ControlNet which is operated in a training-free regime. The key observation is that the underlying animation is embedded in the flow of the normal maps. We utilize the flow thus obtained to manipulate the self-attention maps of appropriate layers. Specifically, the self-attention maps of a particular layer and frame are recomputed as a linear combination of itself and the self-attention maps of the same layer and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation
MethodsDiffusion · Contrastive Language-Image Pre-training · Focus
