LaughTalk: Expressive 3D Talking Head Generation with Laughter
Kim Sung-Bin, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju,, Tae-Hyun Oh

TL;DR
This paper introduces LaughTalk, a novel approach for generating 3D talking heads that can articulate speech and authentically express laughter, supported by a new dataset and a two-stage training scheme.
Contribution
The paper presents a new task, dataset, and baseline model for expressive 3D talking head generation including laughter, advancing social interaction realism in virtual avatars.
Findings
Model effectively generates talking heads with laughter expressions.
Proposed dataset enables training of laughter-aware talking head models.
Method outperforms existing approaches in both speech and laughter expression.
Abstract
Laughter is a unique expression, essential to affirmative social interactions of humans. Although current 3D talking head generation methods produce convincing verbal articulations, they often fail to capture the vitality and subtleties of laughter and smiles despite their importance in social context. In this paper, we introduce a novel task to generate 3D talking heads capable of both articulate speech and authentic laughter. Our newly curated dataset comprises 2D laughing videos paired with pseudo-annotated and human-validated 3D FLAME parameters and vertices. Given our proposed dataset, we present a strong baseline with a two-stage training scheme: the model first learns to talk and then acquires the ability to express laughter. Extensive experiments demonstrate that our method performs favorably compared to existing approaches in both talking head generation and expressing laughter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
LaughTalk: Expressive 3D Talking Head Generation With Laughter· youtube
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Speech and Audio Processing
