Bidirectional Temporal Diffusion Model for Temporally Consistent Human   Animation

Tserendorj Adiya; Jae Shin Yoon; Jungeun Lee; Sanghun Kim; Hwasup Lim

arXiv:2307.00574·cs.CV·March 25, 2024

Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation

Tserendorj Adiya, Jae Shin Yoon, Jungeun Lee, Sanghun Kim, Hwasup Lim

PDF

Open Access 1 Video

TL;DR

This paper presents a bidirectional temporal diffusion model that enhances the realism and temporal consistency of human animation generated from various inputs by suppressing motion ambiguity through bidirectional modeling.

Contribution

The paper introduces a novel bidirectional temporal diffusion framework for human animation, improving over unidirectional methods in temporal coherence and realism.

Findings

01

Outperforms existing unidirectional methods in temporal coherence

02

Generates realistic human animations from single images, videos, or noise

03

Effectively suppresses motion artifacts and appearance distortions

Abstract

We introduce a method to generate temporally coherent human animation from a single image, a video, or a random noise. This problem has been formulated as modeling of an auto-regressive generation, i.e., to regress past frames to decode future frames. However, such unidirectional generation is highly prone to motion drifting over time, generating unrealistic human animation with significant artifacts such as appearance distortion. We claim that bidirectional temporal modeling enforces temporal coherence on a generative network by largely suppressing the motion ambiguity of human appearance. To prove our claim, we design a novel human animation framework using a denoising diffusion model: a neural network learns to generate the image of a person by denoising temporal Gaussian noises whose intermediate results are cross-conditioned bidirectionally between consecutive frames. In the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis

MethodsDiffusion