LaughTalk: Expressive 3D Talking Head Generation with Laughter

Kim Sung-Bin; Lee Hyun; Da Hye Hong; Suekyeong Nam; Janghoon Ju,; Tae-Hyun Oh

arXiv:2311.00994·cs.CV·November 3, 2023·1 cites

LaughTalk: Expressive 3D Talking Head Generation with Laughter

Kim Sung-Bin, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju,, Tae-Hyun Oh

PDF

Open Access 1 Video

TL;DR

This paper introduces LaughTalk, a novel approach for generating 3D talking heads that can articulate speech and authentically express laughter, supported by a new dataset and a two-stage training scheme.

Contribution

The paper presents a new task, dataset, and baseline model for expressive 3D talking head generation including laughter, advancing social interaction realism in virtual avatars.

Findings

01

Model effectively generates talking heads with laughter expressions.

02

Proposed dataset enables training of laughter-aware talking head models.

03

Method outperforms existing approaches in both speech and laughter expression.

Abstract

Laughter is a unique expression, essential to affirmative social interactions of humans. Although current 3D talking head generation methods produce convincing verbal articulations, they often fail to capture the vitality and subtleties of laughter and smiles despite their importance in social context. In this paper, we introduce a novel task to generate 3D talking heads capable of both articulate speech and authentic laughter. Our newly curated dataset comprises 2D laughing videos paired with pseudo-annotated and human-validated 3D FLAME parameters and vertices. Given our proposed dataset, we present a strong baseline with a two-stage training scheme: the model first learns to talk and then acquires the ability to express laughter. Extensive experiments demonstrate that our method performs favorably compared to existing approaches in both talking head generation and expressing laughter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LaughTalk: Expressive 3D Talking Head Generation With Laughter· youtube

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Speech and Audio Processing