TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body   Gestures Generation

Mireille Fares; Catherine Pelachaud; Nicolas Obin

arXiv:2308.10843·cs.MM·August 22, 2023

TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation

Mireille Fares, Catherine Pelachaud, Nicolas Obin

PDF

Open Access

TL;DR

This paper introduces TranSTYLer, a multimodal transformer model that transfers behavioral expressivity styles across modalities like speech, gestures, and facial expressions, without requiring style labels, and outperforms existing methods in style transfer tasks.

Contribution

The paper presents a style-content disentanglement approach in a multimodal transformer that enables style transfer without style labels and generalizes to unseen styles.

Findings

01

Outperforms state-of-the-art in style transfer accuracy

02

Effective style transfer for both seen and unseen styles

03

Proposes a methodology to evaluate style and content preservation

Abstract

This paper addresses the challenge of transferring the behavior expressivity style of a virtual agent to another one while preserving behaviors shape as they carry communicative meaning. Behavior expressivity style is viewed here as the qualitative properties of behaviors. We propose TranSTYLer, a multimodal transformer based model that synthesizes the multimodal behaviors of a source speaker with the style of a target speaker. We assume that behavior expressivity style is encoded across various modalities of communication, including text, speech, body gestures, and facial expressions. The model employs a style and content disentanglement schema to ensure that the transferred style does not interfere with the meaning conveyed by the source behaviors. Our approach eliminates the need for style labels and allows the generalization to styles that have not been seen during the training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Speech and dialogue systems · Human Motion and Animation