Control4D: Efficient 4D Portrait Editing with Text

Ruizhi Shao; Jingxiang Sun; Cheng Peng; Zerong Zheng; Boyao Zhou,; Hongwen Zhang; Yebin Liu

arXiv:2305.20082·cs.CV·December 1, 2023·2 cites

Control4D: Efficient 4D Portrait Editing with Text

Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou,, Hongwen Zhang, Yebin Liu

PDF

Open Access

TL;DR

Control4D introduces a novel 4D representation and a generator-based approach to enable efficient, consistent, and high-quality text-driven editing of dynamic 4D portraits, overcoming limitations of previous methods.

Contribution

The paper presents GaussianPlanes, a structured 4D representation, and leverages a 4D generator to improve efficiency and consistency in 4D portrait editing from text instructions.

Findings

01

Reduced training time significantly

02

Achieved high-quality 4D rendering

03

Maintained spatial-temporal consistency

Abstract

We introduce Control4D, an innovative framework for editing dynamic 4D portraits using text instructions. Our method addresses the prevalent challenges in 4D editing, notably the inefficiencies of existing 4D representations and the inconsistent editing effect caused by diffusion-based editors. We first propose GaussianPlanes, a novel 4D representation that makes Gaussian Splatting more structured by applying plane-based decomposition in 3D space and time. This enhances both efficiency and robustness in 4D editing. Furthermore, we propose to leverage a 4D generator to learn a more continuous generation space from inconsistent edited images produced by the diffusion-based editor, which effectively improves the consistency and quality of 4D editing. Comprehensive evaluation demonstrates the superiority of Control4D, including significantly reduced training time, high-quality rendering,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques