Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Jin Wang; Jianxiang Lu; Comi Chen; Guangzheng Xu; Haoyu Yang; Peng Chen; Na Zhang; Yifan Xu; Longhuang Wu; Shuai Shao; Qinglin Lu; Ping Luo

arXiv:2601.05722·cs.CV·January 12, 2026

Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Jin Wang, Jianxiang Lu, Comi Chen, Guangzheng Xu, Haoyu Yang, Peng Chen, Na Zhang, Yifan Xu, Longhuang Wu, Shuai Shao, Qinglin Lu, Ping Luo

PDF

Open Access

TL;DR

This paper introduces RCM, a diffusion-based framework for high-quality 3D character generation from images, enabling consistent view synthesis, high resolution, and multi-view conditioning, advancing digital content creation.

Contribution

RCM is a novel diffusion framework that achieves pose transfer, high-resolution view synthesis, and multi-view conditioning for 3D character generation from images.

Findings

01

Outperforms state-of-the-art in view synthesis quality

02

Supports high-resolution 1024x1024 video generation

03

Enables multi-view conditioning with up to 4 images

Abstract

Generating high-quality 3D characters from single images remains a significant challenge in digital content creation, particularly due to complex body poses and self-occlusion. In this paper, we present RCM (Rotate your Character Model), an advanced image-to-video diffusion framework tailored for high-quality novel view synthesis (NVS) and 3D character generation. Compared to existing diffusion-based approaches, RCM offers several key advantages: (1) transferring characters with any complex poses into a canonical pose, enabling consistent novel view synthesis across the entire viewing orbit, (2) high-resolution orbital video generation at 1024x1024 resolution, (3) controllable observation positions given different initial camera poses, and (4) multi-view conditioning supporting up to 4 input images, accommodating diverse user scenarios. Extensive experiments demonstrate that RCM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · 3D Shape Modeling and Analysis