CFSynthesis: Controllable and Free-view 3D Human Video Synthesis

Liyuan Cui; Xiaogang Xu; Wenqi Dong; Zesong Yang; Hujun Bao; Zhaopeng; Cui

arXiv:2412.11067·cs.CV·December 19, 2024

CFSynthesis: Controllable and Free-view 3D Human Video Synthesis

Liyuan Cui, Xiaogang Xu, Wenqi Dong, Zesong Yang, Hujun Bao, Zhaopeng, Cui

PDF

Open Access 1 Models

TL;DR

CFSynthesis is a new framework for generating high-quality, controllable 3D human videos with customizable attributes, addressing limitations of 2D methods in complex poses and backgrounds.

Contribution

It introduces a texture-SMPL-based representation and a foreground-background separation strategy for stable, customizable 3D human video synthesis.

Findings

01

Achieves state-of-the-art performance in complex human animations

02

Effectively adapts to 3D motions in free-view scenarios

03

Enables seamless integration of user-defined backgrounds

Abstract

Human video synthesis aims to create lifelike characters in various environments, with wide applications in VR, storytelling, and content creation. While 2D diffusion-based methods have made significant progress, they struggle to generalize to complex 3D poses and varying scene backgrounds. To address these limitations, we introduce CFSynthesis, a novel framework for generating high-quality human videos with customizable attributes, including identity, motion, and scene configurations. Our method leverages a texture-SMPL-based representation to ensure consistent and stable character appearances across free viewpoints. Additionally, we introduce a novel foreground-background separation strategy that effectively decomposes the scene as foreground and background, enabling seamless integration of user-defined backgrounds. Experimental results on multiple datasets show that CFSynthesis not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
lycui/CFSynthesis
model· 5 dl
5 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · 3D Shape Modeling and Analysis