Dance Your Latents: Consistent Dance Generation through Spatial-temporal   Subspace Attention Guided by Motion Flow

Haipeng Fang; Zhihao Sun; Ziyao Huang; Fan Tang; Juan Cao; Sheng Tang

arXiv:2310.14780·cs.CV·October 24, 2023·1 cites

Dance Your Latents: Consistent Dance Generation through Spatial-temporal Subspace Attention Guided by Motion Flow

Haipeng Fang, Zhihao Sun, Ziyao Huang, Fan Tang, Juan Cao, Sheng Tang

PDF

Open Access

TL;DR

This paper introduces Dance-Your-Latents, a novel framework that enhances the spatiotemporal consistency of dance video generation by using spatial-temporal subspace attention guided by motion flow, reducing artifacts like flickering.

Contribution

The paper proposes a new spatial-temporal subspace attention mechanism and motion flow guided alignment to improve dance video coherence in generative models.

Findings

01

Significantly improves spatiotemporal consistency in generated dance videos

02

Reduces artifacts such as ghosting and flickering

03

Demonstrates effectiveness on TikTok dataset

Abstract

The advancement of generative AI has extended to the realm of Human Dance Generation, demonstrating superior generative capacities. However, current methods still exhibit deficiencies in achieving spatiotemporal consistency, resulting in artifacts like ghosting, flickering, and incoherent motions. In this paper, we present Dance-Your-Latents, a framework that makes latents dance coherently following motion flow to generate consistent dance videos. Firstly, considering that each constituent element moves within a confined space, we introduce spatial-temporal subspace-attention blocks that decompose the global space into a combination of regular subspaces and efficiently model the spatiotemporal consistency within these subspaces. This module enables each patch pay attention to adjacent areas, mitigating the excessive dispersion of long-range attention. Furthermore, observing that body…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis

MethodsALIGN