ExpPortrait: Expressive Portrait Generation via Personalized Representation
Junyi Wang, Yudong Guo, Boyang Guo, Shengming Yang, Juyong Zhang

TL;DR
This paper introduces a personalized, high-fidelity head representation and an expression transfer module to improve the generation of expressive, coherent, and controllable portrait videos using diffusion models, surpassing previous methods in detail and stability.
Contribution
The paper proposes a novel personalized head representation and an expression transfer module, enabling more expressive and accurate portrait video synthesis with diffusion models.
Findings
Outperforms previous models in identity preservation.
Achieves higher expression accuracy in generated videos.
Demonstrates superior temporal stability and detail capture.
Abstract
While diffusion models have shown great potential in portrait generation, generating expressive, coherent, and controllable cinematic portrait videos remains a significant challenge. Existing intermediate signals for portrait generation, such as 2D landmarks and parametric models, have limited disentanglement capabilities and cannot express personalized details due to their sparse or low-rank representation. Therefore, existing methods based on these models struggle to accurately preserve subject identity and expressions, hindering the generation of highly expressive portrait videos. To overcome these limitations, we propose a high-fidelity personalized head representation that more effectively disentangles expression and identity. This representation captures both static, subject-specific global geometry and dynamic, expression-related details. Furthermore, we introduce an expression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications
