DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Ziqiao Peng; Yanbo Fan; Haoyu Wu; Xuan Wang; Hongyan Liu; Jun He; Zhaoxin Fan

arXiv:2505.18096·cs.CV·May 27, 2025

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Ziqiao Peng, Yanbo Fan, Haoyu Wu, Xuan Wang, Hongyan Liu, Jun He, Zhaoxin Fan

PDF

1 Datasets

TL;DR

DualTalk introduces a unified framework for realistic 3D talking head conversations that seamlessly handle speaking and listening roles, improving naturalness and expressiveness in dual-speaker interactions.

Contribution

The paper presents a novel task and a unified model that captures dynamic speaker-listener behaviors in 3D talking head generation, along with a new dataset for multi-round conversations.

Findings

01

Enhanced naturalness of 3D talking heads in dual conversations

02

Significant improvement in expressiveness and realism

03

Effective modeling of speaker-listener role interplay

Abstract

In face-to-face conversations, individuals need to switch between speaking and listening roles seamlessly. Existing 3D talking head generation models focus solely on speaking or listening, neglecting the natural dynamics of interactive conversation, which leads to unnatural interactions and awkward transitions. To address this issue, we propose a new task -- multi-round dual-speaker interaction for 3D talking head generation -- which requires models to handle and generate both speaking and listening behaviors in continuous conversation. To solve this task, we introduce DualTalk, a novel unified framework that integrates the dynamic behaviors of speakers and listeners to simulate realistic and coherent dialogue interactions. This framework not only synthesizes lifelike talking heads when speaking but also generates continuous and vivid non-verbal feedback when listening, effectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

ZiqiaoPeng/DualTalk_Dataset
dataset· 96 dl
96 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus