LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space
Guanwen Feng, Zhihao Qian, Yunan Li, Siyu Jin, Qiguang Miao, Chi-Man, Pun

TL;DR
LES-Talker introduces a highly interpretable fine-grained emotion editing model for talking head generation, leveraging a Linear Emotion Space based on Facial Action Units to enable detailed and controllable facial expression transformations.
Contribution
The paper proposes LES-Talker, a novel model that uses a Linear Emotion Space and a Cross-Dimension Attention Net to achieve interpretable, fine-grained emotion editing in talking head synthesis.
Findings
Outperforms mainstream methods in visual quality and emotion control.
Provides high interpretability and fine-grained emotion editing capabilities.
Successfully models complex emotion transformations with detailed facial control.
Abstract
While existing one-shot talking head generation models have achieved progress in coarse-grained emotion editing, there is still a lack of fine-grained emotion editing models with high interpretability. We argue that for an approach to be considered fine-grained, it needs to provide clear definitions and sufficiently detailed differentiation. We present LES-Talker, a novel one-shot talking head generation model with high interpretability, to achieve fine-grained emotion editing across emotion types, emotion levels, and facial units. We propose a Linear Emotion Space (LES) definition based on Facial Action Units to characterize emotion transformations as vector transformations. We design the Cross-Dimension Attention Net (CDAN) to deeply mine the correlation between LES representation and 3D model representation. Through mining multiple relationships across different feature and structure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Emotion and Mood Recognition
MethodsSoftmax · Attention Is All You Need
