TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation
Mireille Fares, Catherine Pelachaud, Nicolas Obin

TL;DR
This paper introduces TranSTYLer, a multimodal transformer model that transfers behavioral expressivity styles across modalities like speech, gestures, and facial expressions, without requiring style labels, and outperforms existing methods in style transfer tasks.
Contribution
The paper presents a style-content disentanglement approach in a multimodal transformer that enables style transfer without style labels and generalizes to unseen styles.
Findings
Outperforms state-of-the-art in style transfer accuracy
Effective style transfer for both seen and unseen styles
Proposes a methodology to evaluate style and content preservation
Abstract
This paper addresses the challenge of transferring the behavior expressivity style of a virtual agent to another one while preserving behaviors shape as they carry communicative meaning. Behavior expressivity style is viewed here as the qualitative properties of behaviors. We propose TranSTYLer, a multimodal transformer based model that synthesizes the multimodal behaviors of a source speaker with the style of a target speaker. We assume that behavior expressivity style is encoded across various modalities of communication, including text, speech, body gestures, and facial expressions. The model employs a style and content disentanglement schema to ensure that the transferred style does not interfere with the meaning conveyed by the source behaviors. Our approach eliminates the need for style labels and allows the generalization to styles that have not been seen during the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Speech and dialogue systems · Human Motion and Animation
